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Award for Distinguished Service to Professor Murray Klamkin 
GERALD L. ALEXANDERSON 


If you think of problems and of problem competitions, one person is sure to 
come quickly to mind: Murray Klamkin. In collections of problems, journal 
problem sections, or surveys of Putnam or Olympiad problems, Professor Klamkin’s 
name is everywhere. For his contributions to the realm of problem solving, for his 
inspiring influence on young problem solvers, and for his many other contributions 
to mathematics, the Association presents this year’s Award for Distinguished 
Service to Mathematics to Professor Murray S. Klamkin. 

Professor Klamkin was born on March 5, 1921, in New York. He attended 
schools there, receiving a B. Ch.E. from the Cooper Union in 1942 and an M.Sc. in 
physics from the Polytechnic Institute of New York in 1947. Between 1948 and 1957 
he moved from the rank of instructor to that of associate professor at the 
Polytechnic. From there he went on to a long and distinguished career in applied 
mathematics, first at AVCO Research and Advanced Development Division, then to 
a professorship at the State University of New York at Buffalo from 1962 to 1964. 
He then joined the Scientific Research Staff at the Ford Motor Company, 1965 to 
1976. During that period he held a visiting professorship at the University of 
Minnesota and, after leaving Ford, he was appointed to a professorship in applied 
mathematics at the University of Waterloo. From 1976 until 1981 he was professor 
and chair of the Department of Mathematics at the University of Alberta, where he 
remains as professor of mathematics, now emeritus. 

He has served as problem editor of many journals including SIAM Review, The 
American Mathematical Monthly, Pi Mu Epsilon Journal, Mathematical Intelligencer. 
He has edited the Olympiad Corner of Crux Mathematicorum, the Classroom Notes 
in Applied Mathematics of SIAM Review, and served as Associate Editor of 
Computers and Mathematics with Applications and of Mathematical Modelling. 

Professor Klamkin has served the Association as a visiting lecturer between 1962 
and 1970, as member of the Board of Governors from 1971 to 1974, and as member 
of numerous committees. In 1970 he became a member of the Putnam Prize 
Committee (the committee that each year makes up the problems for the William 
Lowell Putnam Mathematical Competition), a task for which he is ideally suited; he 
became chairman of that committee in 1972. The MAA presented him with a 
Certificate of Merit in 1978. 

He has served other organizations as well including the American Association for 
the Advancement of Science of which he became a Fellow in 1982, the American 
Mathematical Society, and the Council of the Society for Industrial and Applied 
Mathematics. In 1987 the University of Waterloo conferred on him the degree, 
Doctor of Mathematics, honoris causa. 

In recent years Professor Klamkin has been deeply involved in the Canadian, the 
U. S., and the International Olympiads. He has tirelessly coached the high school 
contestants, has contributed to the construction of competitions, and has written 
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extensively on the Olympiads in Crux Mathematicorum. He served on the U. S. A. 
Mathematical Olympiad Committee as well as on its Canadian counterpart. He 
chaired the U. S. A. Committee between 1983 and 1985, coached the U. S. A. Team 
between 1975 and 1984, and coaches the mathematics team of university students 
on his own campus, simultaneously maintaining active involvement in mathematical 
competitions in the province of Alberta and in the city of Edmonton. He does all of 
this while carrying on extensive correspondence with a worldwide network of 
problem posers and problem solvers. 

He has compiled and written solutions for the International Mathematical 
Olympiads 1978-1985 which appeared as volume 31 of the NML series in 1986, and 
he is performing the same task for the U. S. A. Mathematical Olympiads 1972-1986, 
also to appear soon in the NML. 

Professor Klamkin has published over 150 papers in professional journals and 
has authored 1001 Problems in High School Mathematics. Over the years he has 
given invited addresses around the world: in Sydney, Haifa, Suva, and Halifax, to 
name a few. He has almost certainly contributed problems to every mathematical 
journal with a problem section published in the English language, and to some that 
are not: Elemente der Mathematik and Archief voor Wiskunde among them. 

Professor Klamkin’s colleague, Andy Liu, has written: “I have greatly benefitted 
from my association with Murray Klamkin. He has generously shared with me 
many worthwhile and exciting experiences. What I value most are our common 
interest and close collaboration in problem solving. We both prefer working in the 
wee hours, with our telephones buzzing when some progress is made. His mathe- 
matical insight is uncanny.” 

And Professor Klamkin’s longtime friend and collaborator, D. J. Newman has 
written: “Some of us remember when Mathematics was full of marvels and mystery. 
It was an adventure and a constant challenge. So it was with Murray Klamkin. A 
walk with Murray, something which I have often enjoyed, was always punctuated 
with a phrase such as, ‘Yes, but is it true for tetrahedrons?’ 

“Whether or not we share his love for tetrahedra and such, very few of us could 
resist his excitement and love for the ‘thing mathematical.’ 

“TI do not want to give the impression that Murray Klamkin was just the 
mathematical ‘enfant-terrible,’ however. He is quite the scholar, in his own way, and 
can usually be counted on for the incisive reference, and the complete history 
behind a vast number of results.” 

“Of course his teaching, and especially coaching abilities are legend. I remember 
how delighted I always was when I heard his lectures or just his casual explanations.” 

We look forward to being challenged by Murray’s ingenious problems for many 
years to come. 


Isonemal Fabrics * 


BRANKO GRUNBAUM and G. C. SHEPHARD 


BRANKO GRUNBAUM and G. C. SHEPHARD teach mathematics at the Univer- 
sity of Washington in Seattle, and the University of East Anglia in Norwich, 
respectively. They have published extensively on various questions in geome- 
try, combinatorics, and group theory. During the last ten years they have 
worked on a systematic development of the theory of tilings and related 
topics; many of the results of this collaboration appear in the recently 
published book Tilings and Patterns (reviewed in this MONTHLY). 


1. Introduction. The beginnings of geometry were stimulated by practical prob- 
lems of building, surveying, and decorating. Throughout history, internally moti- 
vated developments as much as the needs of other fields helped steer geometry into 
many new directions, and endow it with a surprising variety of flavors. Foundations, 
algebraic geometry, and differential geometry are among the major and most widely 
studied branches of geometry; each is now a vast body of knowledge and insights. 
But geometry comprises a large number of other fields—smaller but no less 
interesting. Many of these disciplines (such as convexity, theory of symmetry, 
combinatorial geometry, computational geometry) arose, like geometry itself, from 
simple problems or observations, which often are visually and intuitively easy to 
grasp. Elementary beginnings often lead to deeper results, to applications in 
mathematics and outside of it, and to additional attractive problems; all this 
showing the versatility of the geometric approach, and its continuing vitality. As an 
added point of interest we may note that in many cases, the ease with which a 
problem may be formulated and understood is in marked contrast to the difficulties 
encountered in attempting its solution. 

To illustrate this point, we find it surprising and even humbling to realize that we 
have no idea what is the maximum number of (two-dimensional) faces possible in a 


*Research supported by NSF Grants MCS77-01629-A01, MCS8001570 and MCS8301971, and by a 
Fellowship from the John Simon Guggenheim Memorial Foundation. 
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convex polyhedron of which congruent copies will tile three-dimensional space—nor 
do we even known if any such maximum exists! Examples of “tiling polyhedra” 
with 38 faces were found by Engel [8]; additional information and references on this 
topic can be found in Griinbaum and Shephard [16]. 

Many other questions arise (through some abstraction) from everyday phenom- 
ena and lead to similarly baffling problems; often enough, they also lead to 
interesting mathematical developments. In a number of papers we have discussed 
several topics of this nature—the classification of plane patterns, symmetries in 
ornamental art, tilings of space, colorings of patterns—but the field is still wide 
open. We hope that increasing numbers of our colleagues will be attracted to study 
these elegant and challenging questions. 

In this article we are concerned with combinatorial, geometric, and group-theo- 
retic problems related to woven fabrics. Two aspects of this topic may be as 
unexpected to the reader as they were to us. The first surprise was the existence of 
many nontrivial mathematical questions that arise in this context. The second 
surprise was our finding that over the centuries practical weavers have produced 
many fabric designs which are mathematically interesting. In [15], [19], and [20] we 
have given accounts of some of the questions and of the basic results concerning 
woven fabrics of the usual kind, in which the strands lie in two perpendicular 
directions; bibliographic references were also included, hence we do not repeat them 
here except as needed. In the present article we extend the investigation and 
consider fabrics in which the strands can lie in more than two directions. Some 
“fabrics” of these kinds have excellent mechanical features and have been produced 
since ancient times—not by weavers but by basket makers; see Section 8 for more 
details and for references. Recently we learned that a company concerned with the 
manufacture of textile machinery has produced a loom that weaves fabrics with 
strands in three directions (or “triaxial fabrics,” as they call them), see [41]. We have 
also heard that such fabrics are being used in certain applications where non- 
stretchability is essential (as for sails or parachutes). 

The organization of the article is as follows. In Section 2 we review the necessary 
terminology. Our treatment is self-contained but is necessarily brief; the reader is 
urged to consult [15] and [19] for additional information and illustrative examples. 
At the end of the section, the main result is formulated; the following four sections 
are devoted to its proof. In Section 7 we discuss various generalizations and open 
problems. Additional information on the practical aspects of the more general kinds 
of fabrics is given in the last section, together with some historical remarks. 

Some of the fabrics we shall describe here are quite complicated, and verbal 
descriptions or even diagrams often prove inadequate for a detailed understanding 
of their structure. In order to help readers visualize some of the three-dimensional 
structures, in a number of cases (see Section 5) we have found it convenient to draw 
two parts of a fabric separately; the reader should fold a copy of the page so that 
one part lies on top of the other. We hope that this will indicate how the two parts 
are to be “interlaced.” But for a full understanding of the material presented it is 
advisable to construct “models” of the fabrics. This can be easily done using long 
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narrow Strips of colored paper to represent the strands; it will be found convenient 
to hold the strips in position by temporarily pinning or taping them to a suitable 
board. 


2. Terminology and the statement of the main result. A fabric, as we use the 
term, consists of two or more sets of “strands” (or “ threads”) which are “woven” 
together subject to certain conditions. Our first aim will be to reformulate this 
statement as a rigorous mathematical definition. 

By a strand we mean a doubly infinite, straight, open strip of constant width, that 
is, the set of points of a plane that lie strictly between two parallel lines. A layer is a 
set of parallel, congruent and disjoint strands such that every point of the plane 
belongs to (the interior of) exactly one of the strands, or else lies on the boundaries 
of two adjacent strands (see [15, FIGURE 1]). Since all the strands in a layer are 
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Fic. 1. A design (a) and a sketch (b) of a 2-way, 2-fold isonemal fabric; this is an example of a “sponge 
weave.” Here and in the other illustrations we use A, B, etc. as abbreviations for the permutations which 
indicate the rankings of the strands. 


parallel, there is a unique direction associated with the layer, and it is meaningful to 
speak of parallel or nonparallel layers. A prefabric F consists of two or more 
congruent layers of strands in the same plane F together with a preferential ranking 
or ordering of the layers at every point P of EF that does not lie on the boundary of 
a strand. This ranking must satisfy the condition R1 stated below, which formalizes 
the idea that the strands represent material objects which cannot penetrate each 


8 B. GRUNBAUM AND G. C. SHEPHARD [January 


other. The ranking is conveniently represented by a permutation of the numbers 
1,2,..., f, where f is the number of layers in the prefabric. Thus in a prefabric with 
three layers, the permutation (312) assigned to a point P means that at P we regard 
the strand belonging to layer 3 as taking a precedence over those belonging to layers 
1 and 2, and the strand belonging to layer 1 as taking precedence over that 
belonging to layer 2. In accordance with the practical interpretation we say that, at 
P, the strand of layer 3 passes over the other two strands, or is the uppermost (or 
top) strand; the strand of layer 2 passes under the other two strands, or is the 
lowermost (or bottom) strand; and the strand of layer 1 passes between the other two 
strands, or is the middle strand. Obvious modifications of this terminology will be 
made when, for example, a prefabric contains more than three layers. 

In order to come closer to the objects of practical use we shall in general restrict 
attention to fabrics, by which we mean prefabrics that satisfy the common-sense 
requirement R2 expressing the idea that a fabric should “hang together.” 

The restrictions on the rankings are as follows. 

R1. Suppose that the point P belongs to a strand 7, of layer i and also to a 
strand T; of layer j (where i # j). Then if layer 7 is ranked before layer j at P, layer 
i must be ranked before layer j at every point of T,M T;,. 


R2. It is impossible to partition the set of all strands, belonging to all the layers, 
into two nonempty subsets so that each strand in the first subset passes over (is 
ranked before, or takes precedence over) every strand in the second subset. 


A prefabric that fails to satisfy R2 is said to “fall apart.” Examples of such 
prefabrics with two layers were given in [15, FIGURE 5] and [19, FIGURE 2]. An 
example of a prefabric with three layers will be given shortly (FIGURE 4). Criteria 
for testing whether a given prefabric satisfies condition R2 were given by Clapham 
[2], [3] and [4]; see also Enns [9]. For a practical procedure see [19]; although the 
formulation there is for prefabrics with just two layers, it is of general applicability. 

If the strands of a fabric F lie in w different directions, then F will be called a 
w-way fabric, and if F contains f layers then F will be called an f-fold fabric. (It is 
clear that these definitions make sense for prefabrics as well; if convenient, we shall 
use them in this more general sense.) If no further restrictions are imposed on the 
fabrics under consideration, it is easy to see that w-way, f-fold fabrics exist for all 
integer values of w and f satisfying f > w > 2. In [15], [19], and [20] we considered 
the “minimal” case w = f = 2; here we shall be mainly concerned with larger values 
of w and f. 

In order to specify a w-way, f-fold fabric we shall employ a graphical representa- 
tion that generalizes the traditional designs of black and white squares (introduced 
by weavers long ago, and used in most mathematical treatments as well). On the 
plane E of the fabric F we use sets of equidistant parallel lines to represent the 
edges (boundaries) of the strands; a w-way fabric leads to lines lying in w different 
directions. These lines divide F into a set of polygonal regions or tiles, each of 
which is labelled with a permutation indicating the ranking of the layers at one 
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(and, therefore, by R1 at every) point of the region. The resulting labelled tiling is 
called a design for the fabric. Examples of designs are given in FIGURES 1, 2, 3 and 
subsequent diagrams. FIGURE 1 shows a 2-way, 2-fold fabric known traditionally as 
a sponge weave. FIGURES 2 and 3 show 3-way, 3-fold fabrics, and they illustrate the 
fact that if w > 3, the relative positions of the strands can differ in such a way as to 
lead to infinitely many different tilings of the plane. 
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Fic. 2. A design and a sketch of a 3-way, 3-fold fabric. This fabric is not isonemal. The positions of the 
layers can be moved relative to one another to form a continuous family of distinct fabrics. 


Many of the illustrations show, besides the designs, also sketches of the fabrics. 
In these, shading is used to indicate for each tile the layer to which the uppermost 
strand belongs. Hence a sketch approximates the “appearance” of a model of the 
fabric when it is looked at from above. It should be noted that, unlike a design 
which specifies the structure of a fabric completely, a sketch gives much less 
information about the fabric when f> 3. This is illustrated by FIGURE 7; the 
fabrics whose designs are shown in FIGURES 7(a) and 7(c) are distinct yet their 
sketches are the same. 
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Fic. 3. A design and a sketch of a 3-way, 3-fold isonemal fabric known as the “mad weave.” 


Given any tiling defined by sets of parallel and equidistant straight lines, various 
restrictions on the allocation of permutations to the tiles (implied by conditions R1 
and R2) have to be satisfied if the resulting labelled tiling is to be the design of a 
fabric. In particular, as a consequence of condition R1, if the common edge of two 
adjacent tiles S, and S, is parallel to the strands of layer i, then the permutations 
allocated to S, and S, can differ only in the position of the integer i. We also note 
that this implies, in the case of w-way fabrics with w > 3, that the permutation 
allocated to any one tile is uniquely determined by those allocated to all its adjacent 
tiles. “Designs” which violate condition R2 are more difficult to recognize on sight. 
An example is shown in FIGURES 4(a) and 4(b). Here the “design” does not 
correspond to a fabric since the strands labelled * in the diagram pass over all the 
remaining strands. In other words, a prefabric constructed according to this 
“design” would separate into the two parts shown in FIGURES 4(c) and 4(d). See 
‘Section 7 for further comments on the topic of such “ weaves.” 

A symmetry of a fabric F is defined to be any isometry of the plane E containing 
F which maps each strand of F onto a strand of F and either preserves the rankings 
at each point of £, or else reverses all the rankings. A symmetry which reverses the 
rankings is said to interchange the sides of F, and one that leaves the rankings 
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Fic. 4. The design in part (a) appears to be that of a fabric, but is not since it “falls apart,” that is, 
violates condition R2. In the sketch (b) we have indicated by asterisks a subset of the strands that pass 
over all the remaining strands and so can be “lifted off.” In (c) and (d) we show sketches of the resulting 
pseudofabrics. 


unchanged is said to preserve the sides of F. The group of all symmetries of F (with 
the operation of composition) is denoted by S(F) and is called the symmetry group 
of F. We use S,(F) to denote the normal subgroup of S(F) consisting of those 
symmetries that preserve the sides of F. There are two possibilities: either S)(F) = 
S(F), or S)(F) is of index 2 in S(F). In the latter case there must be some element o 
in S(F) which interchanges the sides of the fabric, and then S(F) = S,(F) U oS, (F). 
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A fabric is called periodic if Sj(F), and, therefore, S(F), contains translations in 
at least two nonparallel directions. The design of a periodic fabric can be obtained 
from a fundamental block (usually a parallelogram called a period parallelogram, but 
for groups with 3-fold rotations frequently chosen as a regular hexagon) by applying 
to it the translations of S(F). 

A fabric F is called isonemal (pronounced i-so-nee’-mal, from the Greek word for 
thread) if S(F) is transitive on the strands of F. Thus an isonemal fabric is one with 
the property that every strand is the image of one chosen strand under a symmetry 
of the fabric or, equivalently, the strands form one orbit under the group S(F). For 
example, the 3-way, 3-fold fabric of FIGURE 3 is isonemal—in this case even the 
proper subgroup S,(F) of S(F) is transitive on the strands. On the other hand the 
3-way, 3-fold fabric of FIGURE 2 is not isonemal. 

Throughout the discussion, we consider two fabrics as identical if one can be 
mapped onto the other by a similarity transformation, possibly combined with an 
interchange of the two sides of the fabric (that is, a reversal of all the rankings in the 
design of the fabric). Clearly, fabrics with the general appearance of that in FIGURE 
2 but corresponding to different ratios of size of the small triangles to the large 
ones, are different under this interpretation. In the case of isonemal fabrics, to say 
that two fabrics are identical can be shown to be equivalent to saying that they are 
homeomeric. A coarser classification, into henomeric types, is also possible and 
seems to be the appropriate one in some cases. (The meaning of these terms is, in 
essence, that two objects are homeomeric if there is a homeomorphism mapping one 
onto the other and compatible with all the symmetries involved; henomeric is 
defined similarly, but using a bijection between parts of the object instead of the 
homeomorphism. For detailed explanations see [17] or [21], where examples of 
applications of these concepts to many classes of geometric structures are given as 
well.) 


Using the terminology introduced above we are now able to state the basic result: 


THEOREM. If w and f are such that a w-way, f-fold periodic isonemal fabric exists, 
then the pair (w, f) is one of the following six: (2,2), (2,4), (4,4), (3,3), (3,6) or 
(6,6). Moreover, for each of these pairs there exist infinitely many different w-way, 
f-fold periodic isonemal fabrics. 


The proof of the theorem falls into two parts. First we shall prove the last 
(existential) assertion. We already know that infinitely many different 2-way, 2-fold 
isonemal fabrics exist (see [15]), and the next three sections will establish the 
corresponding facts for each of the other five pairs (w, f). In Section 6 we shall 
complete the proof by showing that no pairs (w, f) can occur other than the six 
listed in the statement of the theorem. In Sections 3 through 6 all the isonemal 
fabrics are periodic, and we shall not repeat this each time. 


3. Construction of 3-way, 3-fold isonemal fabrics. It is clear that if a 3-way fabric 
is to be isonemal, the directions of the layers must be equally inclined (at 60° or 
120° to each other). Hence the edges of the strands must be arranged as illustrated 
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in FIGURES 2 or 3. However, it is easy to verify that the situation in FIGURE 2, where 
the edges of the tiles do not pass completely across strands, is incompatible with 
isonemality. Thus only those fabrics can be isonemal in which some points (the 
centers of 3-fold rotational symmetry) are on the boundaries of strands of all three 
directions. 

The fabric of FIGURE 3 is the “simplest” 3-way, 3-fold isonemal fabric, in the 
sense that it is periodic with a fundamental block of the smallest area. This fabric is 
known as the “mad weave,” see Section 8. Infinitely many other isonemal fabrics 
can be derived from the “mad weave” by the procedure of transposing the rankings 
of pairs of strands in suitable parts of the fabric. If this is done in a sufficiently 
“orderly” way, isonemality will be preserved. An example is shown in FIGURE 5. In 
each of the rhombs outlined by a heavy line in the design, the rankings of two 
strands have been interchanged (compare FIGURE 3). It is clear that infinitely many 
fabrics of this kind can be constructed in a similar manner. 


EYNIRLINIIN. T, VNC 
VAT AININ, |Z 
DSRINISINPTS TSN NY 
ADINSININ, SPT NAIA I 
ISIRINISDDSIST. YXCIN NTS 
IVAN AVN AV NVA CEVA AV 
LSI ISSRITS DW RL AT 


1 3 A: (123) B: (231) C: (312) 
, D: (132) E: (213) F: (321) 
(a) (b) 


Fic. 5. The construction of a 3-way, 3-fold isonemal fabric by transpositions. Starting from the “mad 
weave” (FIGURE 3), we transpose two of the strands in each of the rhombs indicated by thickened lines in 
the design (a). If such transpositions are carried out in a sufficiently orderly way, as done here, the 
resulting fabric is isonemal. A hexagonal fundamental block is indicated by dotted lines. A sketch of this 
fabric is shown in (b). 


We shall now describe some further families of 3-way, 3-fold isonemal fabrics, in 
order to illustrate the wealth of possibilities. In FIGURE 6 we show two members of 
the first family, which we propose to call hexagonal fabrics. Let us denote by F,(p) 
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Fic. 6. Designs and sket f two 3-way, 3-fold isonemal fabrics for which we propose the name 
“hexagonal fabric acne (and) show 2) and pr » 6) and (show Ene) These are the first two 
members of a a infini amily o F(p) w m 4 # 1 (mod 3). In ine design * (and 
in several of the following oe some not the tes a are “hicks ned to facilitate comparison between the 
design and the corresponding sketch. 
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Fic. 7. Designs and sketches of two 3-way, 3-fold isonemal fabrics for which we propose the name 
“triangular fabrics.” Parts (a) and (b) show F,(1) and parts (d) and (e) show F,(2). The triangular fabrics 
F,(p) (with p > 1) form an infinite family of isonemal fabrics. Part (c) shows the design of another 3-way, 
3-fold isonemal fabric, which is obtained from F,(1) by transposing certain pairs of strands. A sketch of 
this fabric is identical with that shown in (b), thus illustrating the inadequacy of sketches in specifying 
w-way fabrics when w > 3. 
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Fic. 7. 


the hexagonal fabric in which the hexagonal “patches” are each built up from six 
equilateral triangles which appear in the sketch as made up of p parallel strands. 
The hexagonal patches are separated by floating strands, that is, by strands which 
run for an appreciable distance along one side of the fabric without interweaving 
with strands from the other two layers. The prefabric of FIGURE 4 appears to belong 
to this family, but as already mentioned, it is not a fabric since it does not hang 
together. In general, it is not hard to see that for p > 1 the prefabrics F,(p) are 
indeed fabrics, and that they are isonemal if and only if p # 1 (mod 3). 

In FIGURES 7(a) and 7(c) we show designs of two members of a second family of 
3-way, 3-fold fabrics. These we call triangular fabrics, and we denote F,(p) the 
triangular fabric in which the floating strands separate triangular patches each of 
edge-length 3p. It can be shown that F\(p) is an isonemal fabric for each p > 1. The 
case p = 0, in which the triangles disappear completely, can be interpreted as 
meaning the “mad weave” of FIGURE 3. The “mad weave” may therefore be 
denoted by F,(0); alternatively, with equal justification it may be denoted by F,,(0). 

Hexagonal and triangular fabrics can be modified and combined in various ways. 
An example is shown in FIGURES 8(a) and 8(b). Here the hexagonal patches have 
edge-length 2, the triangular patches have edge-length 3, and they are separated by 
floating strands as before. More generally, it can be shown that if a fabric of this 
kind has hexagonal patches of edge-length p and triangular patches of edge-length 
q, then it is isonemal if and only if p #1 (mod 3) and the integers 2p + 1 and 
2q +1 are relatively prime. Another example of a fabric which results from 
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Fic. 8. Designs and sketches of two 3-way, 3-fold isonemal fabrics that result from combining the 


triangular and hexagonal fabrics shown in the previous 


igures. 


two f 
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combining a hexagonal and a triangular fabric is shown in FIGURES 8(c) and 8(d). 
Here the patches are separated by pairs of floating strands. It is clear that there are 
many other possibilities. 


4. Construction of 2-way, 4-fold and 3-way, 6-fold isonemal fabrics. The simplest 
way to construct a w-way, 2w-fold fabric F* (w = 2 or 3) is by taking any w-way, 
w-fold fabric F and replacing every strand by a pair of strands whose boundaries 
coincide (in other words, by two strands one of which lies on top of the other). If F 
is such that 

(i) the group S,(F) of symmetries that preserve the sides of the fabric is transitive 
on the strands, and 

(11) the group S(F) contains a symmetry which interchanges the sides of the 
fabric, 
then the corresponding fabric F* will be isonemal. It is clear how a design for F* 
can be derived from that of F, and the sketches of the two fabrics will be identical. 

More interesting isonemal fabrics can be obtained by a procedure we call 
interlacing. In practical weaver’s terminology this is known as “stitching” or “tying” 
the two fabrics together, see [13, Chapter 7]. It can be explained as follows. Let F be 
a w-way, w-fold fabric (w = 2 or 3) and lay one copy of F, say F,, on another copy 
of F, say F,, facing in the other direction and in such a way that the boundaries of 
the strands of F, and of F, coincide. Then clearly the prefabric F, U F, is not a 
fabric since it does not hang together, but under certain circumstances it may be 
transformed into a fabric (even an isonemal fabric) by reversing the ranking of the 
top strand of F, and the bottom strand of F, at various points of the plane. If S9(F) 
is transitive on the strands, and the reversal of rankings or interlacing is carried out 
in some “orderly” way, then an isonemal fabric may be obtained. Examples starting 
from 2-way, 2-fold fabrics are shown in FIGURE 9. In FIGURE 9(a) we begin with 
two copies of the common “plain” or “tabby” weave (one with strands in layers 1 
and 2, the other with strands in layers 3 and 4) and interlace the strands at the 
positions indicated by heavy lines in the design. In FIGURE 9(b) we show the result 
of applying a similar procedure to the fabric traditionally known as the “basket 


3 
weave’, and in FIGURE 9c) to a 3 twill. (For an explanation of these terms see 


[15].) All of these fabrics are isonemal. 

Exactly similar considerations apply to 3-way, 3-fold fabrics. We give an example 
in FIGURE 10. Here two copies of the “mad weave” of FIGURE 3 are placed one on 
top of the other, and interlacing takes place at all points inside the hexagons 
indicated by heavy lines. The precise method of doing this is shown by the 
permutations in the design. 

The construction of fabrics obtained by interlacing is not completely trivial, since 
it is often difficult to determine, from a given design, when interlacing is or is not 
possible and if it leads to an isonemal fabric. In all the examples given here the 
results have been carefully checked, theoretically and by using models as suggested 
in the introductory remarks. The “doubling up” of all strands, described at the start 
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Fic. 9. Designs and sketches of three 2-way, 4-fold isonemal fabrics that are obtained from 2-way, 2-fold 
fabrics by the process of “interlacing.” In part (a) we show how two copies of the “plain” or “tabby” 
weave can be laid one on top of the other so that the boundaries of the strands coincide, and then 
interlaced at the positions shown by the thickened squares in the design. In (b) we show a similar process 
applied to a “basket weave,” and in (c) to a twill. It should be noticed that the sketch of each fabric 
coincides with that of each of its component 2-way, 2-fold parts. 
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Fic. 10. Interlacing two copies of the “mad weave” (FIGURE 3) produces a 3-way, 6-fold isonemal fabric 
F. The sketch of this fabric coincides with that of the “mad weave,” and the thickened hexagons indicate 
the places where the strands are interlaced. The dotted hexagon indicates a fundamental block of the 
fabric F. 


of this section, can obviously be considered as an extreme special case of interlacing. 

It should be noticed that the method of interlacing can also be used to construct 
different kinds of 2-way, 4-fold isonemal fabrics, for which the boundaries of the 
strands in the two copies of F are distinct (that is, parallel strands partly overlap). 
The practical uses of 2-way, 4-fold fabrics appear in most cases to have the strands 
of the two layers shifted a half-width with respect to each other; see, for example 
[13, Figure 7.11], for such an isonemal fabric obtained by interlacing two copies of a 


a4 twill. (Due to differences in the methods of presentation used, the drawing of 


this fabric in [13] resembles a sketch of a 2-fold fabric.) 

On the other hand, it can be shown that in the case of 3-way, 6-fold fabrics, the 
interlacing procedure can yield isonemal fabrics only if the boundaries of the two 
copies of F coincide. 


5. Construction of 4-way, 4-fold and 6-way, 6-fold isonemal fabrics. The only 
method known to us of constructing these fabrics is by a variation of the interlacing 
procedure described in the previous section. For w = 2 or 3, we place two copies of 
a w-way, w-fold fabric F together as in Section 4, but in such a way that the 
corresponding layers of the two copies are not parallel. If we interlace the copies so 
that they hang together then we obtain a 2w-way, 2w-fold fabric, and it may be 
possible to arrange for this to be isonemal. 
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Fic. 11. A 4-way, 4-fold isonemal fabric. Two copies of a “sponge weave” (FIGURE 1) are laid one on 
top of the other in such a way that the strands indicated by asterisks can be interlaced. The reader is 
invited to fold the page along the dashed line, to see that then the asterisks in the two copies of the 
“sponge weave” coincide. This representation has been chosen since a design for the fabric would be so 
complicated as to be unintelligible. 


Fabrics produced in this way are difficult to describe since their designs are 
usually so complicated as to be unintelligible, and sketches do not suffice. We, 
therefore, adopt the method indicated in the examples of FiGuREs 11 and 12. In 
FIGURE 11 we show sketches of two copies of the “sponge weave” of FIGURE 1, and 
indicate how the page is to be folded so that the two sketches lie one on top of the 
other. If this is done carefully, 1t will be noted that the long floating strands marked 
with asterisks cross each other in such a way that they can be interlaced. If each 
such pair is interlaced then it is easily verified that the resulting 4-way, 4-fold fabric 
is isonemal. In FIGURE 12 we give a second example. Here a 6-way, 6-fold isonemal 
fabric is obtained in a similar manner by interlacing two copies of the 3-way, 3-fold 
hexagonal fabric F,(2) of FIGURE 6. 


RID 


Fic. 12. A 6-way, 6-fold isonemal fabric obtained by interlacing two copies of the hexagonal fabric F,(2) 
(see FIGURES 6(a) and (b)). The representation is the same as in the previous figure. 
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The same procedure may be applied to many 2-way, 2-fold fabrics with suffi- 
ciently long floating strands, such as other “sponge weaves” (see [15, Figure 6] for 
examples), and to many 3-way, 3-fold fabrics such as the hexagonal and triangular 
fabrics described in Section 3. Other fabrics of these kinds may be obtained by 
interlacing in FIGURES 11 and 12 only suitable proper subsets of the strands marked 
by asterisks. We deduce that infinitely many 4-way, 4-fold and 6-way, 6-fold 
isonemal fabrics exist, as claimed. 


6. Characterization of the (w, f) pairs. From [15] we know that infinitely many 
2-way, 2-fold isonemal fabrics exist, and in the previous three sections we have 
established the existence of infinitely many w-way, f-fold fabrics where (w, f) is 
one of the pairs (2, 4), (4, 4), (3, 3), (3, 6), or (6,6). To complete the proof of the 
theorem it is, therefore, only necessary to show that no other pairs (w, f) can occur 
for isonemal fabrics. 

We shall accomplish this by considering the properties of the group S,(F); this 
group must be one of the 17 crystallographic plane groups (for a description of these 
groups see, for example, [6], [10], [21], [30], or [38)]). 

If S)(F) contains a reflection in a line L (lying in the plane of the fabric) then 
every strand of F must be either parallel to L or else perpendicular to it. Hence, if 
S(F) contains lines of reflection lying in more than one direction, then these lines 
must be in two perpendicular directions. In any case, whether all the lines of 
reflection are parallel or not, all images of any given strand T of F under the group 
S,(F) lie in either one or two directions. 

If S,(F) contains no reflections but contains glides (glide-reflections), then So(F) 
must be either the group pg (in which all the glides are along parallel axes) or the 
group pgg (in which the glides are along axes in two mutually perpendicular sets of 
lines parallel to each other). In either case all images of any given strand T of F 
under the group S,(F) must lie in either one or two directions. It should be noted 
that in the latter case the two directions do not have to be perpendicular; see, for 
example [15, Figure 22]. 

If So(F) contains neither reflections nor glides then it must be one of the groups 
pl, p2, p3, p4, or p6. It follows from this that all images of a given strand T of F 
under the symmetries of S,(F) must lie in 1, 1, 3, 2, or 3 directions, respectively. 

We deduce from the above that in all cases the images of T under S,(F) must lie 
in 1, 2, or 3 directions. We now ask in how many directions will the images of T lie 
when we apply the whole symmetry group S(F) instead of S,(F)? The question 1s 
trivial if S(F) = S,(F); if the two groups are not equal, the fact that S,(F) is a 
normal subgroup of index 2 in S(F) shows that the number of directions either 
remains the same or doubles. The latter occurs when the image of a strand T under 
some symmetry in S(F) is not parallel to the image of T under any symmetry in 
So(F). We deduce that every isonemal fabric is w-way for w = 2, 3, 4, or 6; the 
value w = 1 is clearly inadmissible. 

We now consider the possible values of f for an isonemal fabric to be f-fold. If 
all the strands visible on the upper side of F lie in w directions (where w = 2 or 3) 
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then so must all the strands visible on the lower side of F. The latter may be in 
different directions from those on the upper side, or they may be in the same 
directions but distinct from them, or they even may be the same strands. These three 
possibilities lead (for w = 2) to 4-way, 4-fold; 2-way, 4-fold; or 2-way, 2-fold 
fabrics. Or (for w = 3), to 6-way, 6-fold; 3-way, 6-fold; or 3-way, 3-fold fabrics, 


respectively. 
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Fic. 13. A sketch of a 4-way, 4-fold isonemal fabric which illustrates the fact that it is possible for 
strands in all four directions to be visible on one side of the fabric. The construction is similar to that in 
FIGURE 11. Here two copies of a “sponge weave” with long “floating strands” are used; the upper copy 
is shown and the lower has strands in the directions indicated by the arrows under the diagram. The two 
copies are placed in such a way that sets of five adjacent parallel strands from the lower fabric can be 
interlaced with sets of five adjacent parallel strands from the upper fabric; this is not visible from the top, 
but is indicated by the shaded areas in the sketch. The middle strand of each of these interlaces can then 
be interlaced again— those from the bottom fabric with the topmost layer of the upper fabric as indicated 
by black rhombs. In the same way, strands from the upper fabric are interlaced with the lowermost layer 
of the bottom fabric. It is easily verified that the fabric so produced is a 4-way, 4-fold isonemal fabric 
with strands of all four directions visible on each side of the fabric. The small black squares mark centers 
of 4-fold rotational symmetries of the resulting fabric. 
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In the remaining case, the strands visible on the upper side of F lie in 2w 
directions (where w = 2 or 3). Now the strands visible on the lower side of F must 
be the same as those visible on the upper side. This follows since, by the earlier 
arguments, half the strands visible on the upper side are images of a strand T under 
symmetries of S(F) which do not belong to S,(F), and which, therefore, interchange 
the sides of the fabric. Hence, we obtain, in this case, either a 4-way, 4-fold fabric 
(for w = 2), or a 6-way, 6-fold fabric (for w = 3). 

This completes the characterization of the possible values of w and f given by 
the Theorem. 

The possibility discussed at the end of this proof can actually occur. This is 
illustrated by the 4-way, 4-fold fabric sketched in FIGURE 13; analogous 6-way, 
6-fold fabrics can also be constructed. It will be noted that the fundamental block of 
the fabric in FIGURE 13 is exceedingly large; it is not known whether such fabrics 
exist with reasonably small fundamental blocks. 


7. Comments. (a) Clearly the symmetry group of an isonemal fabric must 
contain translations. If these translations lie in at least two nonparallel directions 
then we have called the fabric periodic, and our main theorem relates to such 
fabrics. However, there exist isonemal fabrics in which all the translations are 
parallel, and we shall refer to these as nonperiodic. A sketch of such a fabric appears 
in FIGURE 14. In fact, the theorem remains valid even for nonperiodic isonemal 
fabrics since it can be shown that every nonperiodic isonemal fabric must be either 


Fic. 14. A sketch of a nonperiodic 2-way, 2-fold isonemal fabric. The diagonal “stripe” continues in 
both directions, and the rest of the fabric is the same as in the “tabby weave.” The only translational 
symmetries are parallel to the diagonal, hence the fabric is ‘not periodic according to the definitions 
adopted. A 2-way, 4-fold isonemal fabric can be obtained, for example, by replacing each strand by two 
strands whose boundaries coincide. 
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2-way, 2-fold or 2-way, 4-fold. The sketch in FIGURE 14 may be regarded as 
representing a fabric of either kind—for the 4-fold fabric the strands in both 
directions should be doubled, as explained at the beginning of Section 4. 

(b) A fabric F is said to be strongly mononemal if, in the design D of F, every 
strip that corresponds to a strand of F is an isometric image of every other strip, 
under an isometry that either preserves all the rankings or else reverses them all. 
(The isometry in question is not required to be a symmetry of the design D.) We 
conjecture that our Theorem remains valid for strongly mononemal fabrics. 

A weaker restriction applies if the fabric F is weakly mononemal. For that, in the 
design D of F, every strip that corresponds to a strand T of F must again be an 
isometric image of the strip that corresponds to any other strand T *. However, the 
isometry that accomplishes this is not required to preserve (or reverse) all the 
rankings, only the position of the layer to which the strands T and T* belong. In 
other words, if T is on top in one tile, then‘ 7* should be on top of the 
corresponding tile regardless of the rankings of the other strands in these tiles. 

Clearly, for 2-way fabrics the concepts of weakly and strongly mononemal fabrics 
coincide. It is not known whether for 3-way fabrics either of the two mononemality 
properties implies isonemality. 

It seems likely that the Theorem remains valid even for weakly mononemal 
fabrics. 

(c) In Section 4 it was shown how 2-way, 4-fold and 3-way, 6-fold isonemal 
fabrics can be obtained by interlacing two 2-way, 2-fold or 3-way, 3-fold fabrics. 
Using the criteria for a fabric to hang together given in [3] or [26], it is easy to show 
that, conversely, every 2-way, 4-fold isonemal fabric can be obtained from two 
2-way, 2-fold fabrics by interlacing. It is not known whether an analogous converse 
is valid for 3-way, 6-fold; 4-way, 4-fold; and 6-way, 6-fold isonemal fabrics, or in 
the case that all the fabrics involved are assumed to be (strongly, or else weakly) 
mononemal. 

(d) It seems likely that for isonemal 3-way prefabrics condition R2 is automati- 
cally fulfilled. It even may be that the same is true for (strongly or weakly) 
mononemal prefabrics. If either of these possibilities is valid, then it would easily 
follow that all (isonemal, or mononemal) 3-way, 6-fold fabrics arise by interlacing. 

(e) The concept of a fabric can be generalized in various ways. For example, we 
may modify the definition of a layer of strands by deleting the condition that every 
point of the plane belongs to the closure of a strand. In other words, we now allow 
the possibility that the (congruent and parallel) strands of a layer may be separated 
by gaps. A “fabric,” defined exactly as in Section 2 but using these modified layers, 
will be called a pseudofabric or a weave. Examples of pseudofabrics appear in 
FIGURES 4(c), 4(d), 15, and 16. (The weave of FIGURE 4(c) is frequently used in 
basketry, especially in the manufacture of chair seats. It is one of the weaves that 
can be woven mechanically, see [41].) It is clear that if F is a 2-way pseudofabric 
then it is possible to obtain a fabric F* by widening the strands of F; moreover, if F 
is isonemal, it is possible to do this in such a way that F* is isonemal as well. Such 
replacement of w-way pseudofabrics by fabrics is clearly not always possible if 
w > 3; see, for example, the weave in FIGURE 4(c). 
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Fic. 15. A design (a) and sketch (b) of a 3-way isonemal pseudofabric. In each of the three directions the 
strands are separated by a gap equal in width to half that of the strand. 


A simple but interesting 3-way, 3-fold pseudofabric is shown in FIGURE 16. We 
have not found it mentioned anywhere, although similar— but not isonemal— weaves 
are of common use in basketry. 

We believe that mathematically meaningful results on pseudofabrics are possible, 
and present here some relevant ideas. 

Starting from a given isonemal fabric, it is sometimes possible to produce a 
pseudofabric by omitting every alternate strand (or every k > 1 consecutive strands 
out of every k + 1 or k + 2) in every layer. Examples are easy to find, but we have 
been unable to formulate necessary and sufficient conditions for this procedure to 
produce an isonemal weave. 

For any point P of the plane let s( P) denote the number of strands of a weave F 
that contain P. Let f= maxs(P), the maximum being over all points P of the 
plane which are not on the boundary of any strand. Then the weave F 1s said to be 
f-fold. Most of the other definitions and properties of fabrics carry over to 
pseudofabrics without any modifications. For example, an isonemal pseudofabric 
may be defined as one whose symmetry group is transitive on its strands. The 
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Fic. 16. A 3-way, 3-fold isonemal pseudofabric. Despite differences in overall appearance, it is not hard 
to verify that it is of the same henomeric type as the pseudofabric in FiGURE 15 and the “mad weave” 
fabric of FIGURE 3. The homeomeric types of these three weaves are all different. 


weaves of FIGURES 4(c), 4(d), 15, and 16 are all isonemal; moreover, it is easy to 
verify that those shown in FIGURES 4(c), 15, and 16 are of the same henomeric type. 

The main result of this article has an analogue for pseudofabrics. It can be 
proved that w-way, f-fold isonemal pseudofabrics exist if and only if 2 < f < w for 
w = 2,3,4, 6, or if (w, f) is one of the pairs (2, 4), (3, 4), (3, 5), (3, 6). In each case 
there are infinitely many fabrics of different henomeric types. 

(f) Another generalization of fabrics, due to J. J. Pedersen, is to “fabrics” on 
polyhedral surfaces. A discussion of isonemal fabrics on such surfaces appears in 
[36]. 

(g) The first version of this article, entitled “The Geometry of Fabrics,” was 
written in 1978 and presented at the Summer Meeting of the American Mathemati- 
cal Society [14]; about a hundred copies were distributed at the meeting, or in 
response to requests. The publication was delayed, in part because small sections of 
that first version grew, on elaboration, into articles of considerable size; this led to 
publications [15], [18], [19], and [20]. The main result of the present paper (Theorem 
5 of the 1978 version) has been described in several publications, in particular [14], 
[17], [18], and [36]; for related results concerning transitive families of straight lines 
see [21, Section 7.7]. Several other versions of the manuscript were prepared over the 
years, but at each stage there seemed to exist some reason for delaying the 
publication. 
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(h) The terminology of the various kinds of objects considered here has not been 
standardized so far; therefore, we have found it worthwhile to introduce different 
words for the different concepts of prefabrics, fabrics, and weaves (pseudofabrics). 
In all the cases we have tried to combine mathematical precision with common 
sense. The word “isonemal,” introduced in [14], appears to have been universally 
accepted. Other terms (such as mononemal, twillin, w-way, /-fold, etc.) which we 
introduced in the various papers seem also to have met acceptance, or at least 
aquiescence. The word “fabric” has also been accepted quite generally. In some 
publications (see [23], [24]) the authors use “fabric” for what we call prefabrics. 
There is clearly no objection to authors using words in whatever manner they find 
convenient, but we strongly deplore the fact that they make no distinction between 
fabrics and prefabrics, and even attempt to justify the fact that no such distinction is 
made. (Our criticism in [19] that this gave rise to unnecessary confusion led to the 
polemic in [25], but the statements made there are fallacious besides misrepresenting 
the facts and the sources quoted.) We hope that the present paper will contribute to 
an accepted terminology. 

(i) Several recent papers deal with 3-way fabrics. As already mentioned, [4] and 
[9] consider criteria for deciding whether a given 3-way prefabric hangs together. In 
[34] a study is begun of 3-way weaves which are the analogues of twills. In [11] 
several 3-way weaves are constructed, and the effects of using strands of various 
colors in the “mad weave” are investigated. 


8. Practical uses of 3-way fabrics. Until very recently there seem to be no records 
or samples of woven textiles with strands in three or more directions; for informa- 
tion about the present situation see [5, p. 172], [39], [41]. But in related 
fields—basketry and plaiting in their different forms—the uses of several 3-way 
weaves are widespread, and go back to very ancient times. (More complex weaves 
are also quite common; for example, a 4-way weave is often used for seats and 
backs of cane chairs. However, since these weaves are not isonemal, we shall not 
discuss them here.) 

The isonemal 3-way weave most frequently used in basketry is the pseudofabric 
shown in FIGURE 4(c). It has been called “hexagonal plaiting” or “hexagonal work” 
((31], [33]), but these terms have also been applied to more complicated weaves. Its 
popularity is probably a result of its great strength and rigidity combined with light 
weight. For uses of this weave (and related ones) in various cultures, in objects 
ranging from cradles and crabpots to hats and snowshoes, see, for example [7], [27], 
[28], [29], [35], [37]. 

Another very popular 3-way isonemal weave (actually, a fabric) is the “mad 
weave.” It is, apparently, quite old in various cultures of Southeast Asia. The first 
description of it in publications accessible to the West is by Bland [1] in 1906. The 
name of that weave seems to be related to the difficulties one may experience in 
practical work with it. These difficulties led to several detailed “how-to” descrip- 
tions [12, pp. 111-133], [22, pp. 172-183], [32], [40]. In fact, the “mad weave” as 
such is not all that hard—practical difficulties arise only if it is desired to enclose a 
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volume as for a basket, or to weave a polyhedral surface like one of those described 
in [36]. It appears that in the original version described in [1], the final product was 
a 3-way, 6-fold fabric! Moreover, although this 6-fold fabric is of the simple type, 
obtained by using a double thickness of each strand, the actual weaving was done 
the “hard way”: the material used had one shiny side, and the first step consisted in 
producing a 3-fold weave with one shiny side; then the other three layers were 
worked into this weave in such a way that the other side also became shiny! This 
seems to be the only kind of isonemal 6-fold weave which has been found in objects 
of practical use. 
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A Dozen Difficult Diophantine Dilemmas 


ANDREW BREMNER 
Department of Mathematics, Arizona State University, Tempe, AZ 85287 


RICHARD K. Guy 
Department of Mathematics & Statistics, The University of Calgary, Calgary, Alberta, Canada T2N 1N4 


Most diophantine equations present a dilemma. Do you reach for a computer or 
a cold towel? Do you try to prove nonexistence of solutions or do you try to find the 
general solution, or at least an infinite family of solutions? Diophantine equations 
illustrate the dilemma of mathematical research in a very pure form: Do you try to 
prove a theorem, or do you search for a counterexample? 
E. T. Prothro (1120 S. Duck, Stillwater, OK 74074) asks us to show that the 
equation 
xy(x4 — y4) = 22w(24 — w4) (1) 


has no nontrivial solution in integers, or to exhibit such a solution. 

By coincidence, in Crux Mathematicorum 10 (1984) Problem 994, Ernest J. Eckert 
asked if there are two distinct primitive Pythagorean triples whose products are 
equal, i.e., a nontrivial solution of 


xy(x4 — y*) = zw(z* - w4). (2) 
Computer searches over a small range failed to find any solution to (1) or (2). 
One feels instinctively that it is a difficult problem to decide whether there do 
indeed exist solutions. In geometric language, each of the equations (1) and (2) 
represents a sextic surface in three-dimensional projective space. There is reasonable 
knowledge of the arithmetic-geometric properties of surfaces of degrees 2, 3, and 4; 
but for higher degrees relatively little can be said. It is true, for instance, that on a 
nonsingular surface of degree greater than 4, there can exist only finitely many 
curves of genus zero. 
It seems necessary to treat each such equation on its own merits, though we 
cannot think of any equation representing such a surface that has been satisfactorily 
solved by nontrivial methods. Perhaps the simplest example of the family is 


x + y®& = 7° + w® (3) 
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for which, again, a complete solution is not known. By analogy with the recent 
proof by Faltings [11] of the Mordell conjecture, it may be that where some 
geometric invariants of a surface pass beyond a certain bound, there can be at most 
finitely many rational points on that surface. 

Similarly, no nontrivial solution of 

x°+yp=2+wW? (4) 
is known [D1 in 12]. The paper [14] of Lander, Parkin, and Selfridge provides many 
other examples concerning sums of like powers: see also [2], [3], [6], [7]. 

No note on difficult diophantine equations is complete without a reference to 

Fermat’s equation [D2 in 12], 
x" + y"= 2", n>3 (5) 
and to the equation* 
xi¢tyt+zt=w4 (6) 
associated with a conjecture of Euler [D1 in 12]. 

A host of problems arises from Pythagorean triangles and their areas and 
medians. Dickson [9, p. 208] states that it “remains in doubt that no Heron triangle 
has more than one rational median.” A Heron triangle is one with rational sides and 
area. Notice that the rationality of the area is what makes this problem difficult: for 
example, Euler noted that the triangle with sides 136,174, and 170 has medians 
158, 127,131 (but area 240/2002). We have four simultaneous equations, three 
applications of Apollonius’s theorem and one of Heron’s formula: 

2b? + 2c? =a? + 4x? 
2c7 + 2a*=b* + 4y” 
2a* + 2b? = c* + 42? 
2b7c? + 2c2a* + 2a*b* = at + b4+ 4 + 16A?. 

Recently, Ralph Heiner Buchholz has dispelled Dickson’s doubt by finding a 
triangle with sides 146,102,52, area 1680 and medians 35,97 and 4/949, ie., a 
Heron triangle (ABC in Fic. 1) with two rational medians. This implies the 
existence of a triangle (AMG in Fic. 1) with medians 73,51, 26, area 560 and two 
rational sides. Randall L. Rathbun also found the triangle ABC, and another Heron 
triangle with sides 1750, 1252, 582, area 221760, and two rational medians, 1144 and 
433. So the status of the problem is very similar to that of the rational cuboid [15]: 
we have seven quantities (here 3 sides, 3 medians, and the area; for the cuboid, 3 
edge-lengths, 3 face-diagonals, and the body-diagonal) of which any six may be 
rational, but no one knows if all seven can be. 

A further geometrically inspired problem is to find seven points, no four on a 


(7) 


circle, no three in line, that define (7) rational distances, 


(x, — x,)° + (y;, - y;)° = 4? (l<i<j<7). (8) 


*Noam Elkies, of Harvard University, has found an infinity of solutions of equation (6). 
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Fic. 1. Triangle whose sides are two-thirds of the medians of another triangle. 


Here, only the d;, are required to be rational, not necessarily the x,, ¥;. 

It seems particularly difficult to determine if there is a point at integer distances 
a, b,c, d from the corners of a square of integer side s. Find solutions to the pair of 
simultaneous diophantine equations [D19 in 12]: 


(s? + b? — a?) + (5? + b? - c?)° = 467s? 
aa+c?=b*+d’ 
or prove that there aren’t any. It is known [5], [10] how to construct infinitely many 
parametric solutions to the first of these, but no solution has yet been found which 
also satisfies the second. 
Find all points at integer distances from the vertices of an integer sided 


equilateral triangle. If s is the side of the triangle, and x, y, z the distances, then the 
cosine formula yields 


(9) 


stat xt tyttzt=s2x2 + s2y? + 5227 + p22? + 27x? + xy? (10) 
and symmetry shows that solutions come in sets of 24. It is not hard, using 


Ptolemy’s theorem [8, p. 42], to find an infinite number of solutions with the points 
on the sides, or on the circumcircle, of the equilateral triangle (Fic. 2). 


Fic. 2. Points at integer distances from the corners of an equilateral triangle. 
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However, Arnfried Kemnitz notes that x = m? + n*; y,z =m? + mn + n? with 
m = 2(u? — v*), n =u? + 4uv + v* gives s = 8(u* — v*)(u2 + w+ v") and an 
infinity of solutions in the which the point lies in general position with regard to the 
triangle. A computer search showed that (57,65,73,112) was the smallest such 
solution (FIG. 3). 


112 112 


112 


Fic. 3 


Since we produced an earlier draft of this article, John Leech has sent us a neat 
and elementary proof ‘of the fact that the points at rational distances from the 
vertices of any triangle with rational sides are dense in the plane of the triangle. 
This result was earlier proved by Almering [1]. 

Leech also observed that this problem has a sort of dual: find scalene triangles 
with integer sides, whose Fermat point (where the sides each subtend an angle of 
120°) is at integer distances from the vertices. This requires that each of 


yrtyzt2z? z7+2x4+x7 x*4+xy+y? 


be made a square. Dickson [9, p. 511] indicates that J. Cunliffe did this in 1809 by 
equating the first two expressions to (y + z — m)?,(z + x — n)’, giving x and y in 
terms of z (and m and n), whereupon the third expression gives a quartic in z, 
which is “made square in a classical manner.” There is a strong similarity to the 
rational cuboid problem [15], [17]. We are dealing with the intersection of three 
quadrics in five-dimensional projective space. Such an object is a surface (in general, 
of type K3), but here, as in the case of the rational cuboid, the surface possesses 
singular points. At present, the general solution of such equations seems to be 
beyond reach. 
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Leech also asked (85-05-20) for triads of squares with equal sum and equal 
product: 


xe+y?tzr7=ar*+hb?4+7,  x*y2z?2 = abc?. (11) 
For example, 37,87, 267 and 27, 137,247. He writes, “I have a few others, but no 
systematic method.” 
Mark Samuelovitch Weissmann writes from Minsk to observe that (1, —1, 
a + 1, —a) gives an infinity of solutions of Mordell’s equation [D28 in 12], 
1 1 1 21 1 


—~+—-+-4+-—4 = 0 (12) 
xX yp 2 Ww wxyz 


and that there are others, e.g., (—2,3,7,41). In fact, if we write x =u — z, 
y=v-—2z,w=t—z, then2z? -— z*(t+u+v)+ tw +1 =O. There are plenty of 
solutions, e.g., x =1—z; y,w = 2z* — 2z +1, or, more generally, if z7 -— z - 1 
=ay,z7—-z+1= Bb, y =2z(z — 1) + a8, w = 2(z — 1) + 6. For example, each 
of z = 173, 488, 668, and 983 gives 128 solutions. 

As a warning not to reach too rapidly for your computer, glance at the paper [4]. 
A more recently worked example in similar vein is that the smallest rational solution 
of the equation y* = x(x? + 1877) is 

488 114890 132279 420338 923511 111187 227812 402249 922481 

* “1170 488036 631991 182951 963428 713037 871725 514476 328100 
1 120418 474814 606797 599253 948625 207508 935768 951951 287850 506482 622440 320026 076879 

40045 200163 862032 520789 539685 815389 316574 590961 010977 921502 391797 012063 929000 © 
Clearly one sometimes needs to think before rushing to the computer! 


Send single solutions to any of equations (1) to (9), or general solutions to 
equations (10), (11), and (12). 


yr 
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Miscellanea 


To be wholly devoted to some intellectual exercise is to have succeeded in life; 


and perhaps only in law and the higher mathematics may this devotion be 
maintained, suffice to itself without reaction, and find continual rewards without 
excitement. 


from Weir of Hermiston, by Robert Louis Stevenson 
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A Characterization of Graphs of Diameter Two 


Gary S. BLoom! 
Computer Science Department, City College, CUNY, New York, NY 10031 


JoHN W. KENNEDY’ AND Louis V. QUINTAS? 
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In a recent Note [4] and a Letter to the Editor [5] the relationship between the 
diameter of a graph and the diameter of its complement was considered. The 
authors of [4] noted that the simplicity of their method of establishing the Ringel- 
Sachs relation was tantamount to including it in an ultimately understandable, 
apocalyptic “Divine Book of Proofs.” 

We propose another related entry in the awaited tome that shows clearly how 
graphs of diameter two can be characterized by a simple condition on their 
complements. This simple condition, in turn, implies several computationally con- 
venient conditions which are sufficient to show that a graph has diameter two. 
When considered in appropriate fashion, these results are all easily grasped; 
nevertheless, we have found that they have considerable utility as lemmas to 
simplify potentially complex arguments (as exemplified in [1] and [2]). With the 
exception of our Theorem 2a, which is proved in [3] using five cases and subcases, 
we have not seen these results proved elsewhere. Parenthetically, we note that the 
theorem in [5] also appears as a lemma in [3]. 

The graph theory terminology we use is as follows. The complete graph K, has p. 
vertices with every two vertices adjacent (i.e., forming an edge). The complement G 
of a graph G is the graph with the same vertex set as G and precisely those edges 
that are not in G. A graph with no edges is called an empty graph. The distance 
between vertices x and y in G is the minimum number of edges possible in a path 
from x to y. The diameter diam(G) is the maximum distance which occurs in G, 
with the convention that diam(G) = oo when G is not connected. 

Let x be a vertex in G. A star centered at x consists of any set of edges in G 
which have x as a vertex. Let uv be an edge in G. A (maximal) double-star on uv is a 
maximal tree in G which is the union of stars centered at u or v such that each star 
contains the edge wv. A subgraph H of G is said to span G if H has the same vertex 
set as G. 


‘Research partially supported by PSC-CUNY Research Award 6-63230 of the City University of New 
York. 


*-3Research partially supported by Pace University Scholarly Research Committee and Dyson 
College Faculty Research Grants. 
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THEOREM 1. A graph G has diameter two if and only if G is not empty and G is not 
spanned by a double-star. 


Proof. G is empty if and only if G = K » and has diameter 0 or 1. 

If G is not spanned by a double-star, then in G, for every edge wv there is a 
vertex x that is neither adjacent to u nor to v. Thus, in G, x is the intermediate 
vertex in a path of length 2 between u and v. On the other hand, if u and v are not 
adjacent in G, then the edge wu is present in G. Consequently, the distance between 
any pair of vertices in G is no more than 2. _ 

If G is spanned by a double-star on edge wv, then in G every vertex x is adjacent 
to at least one of the points u or v. Thus, in G, every path between u and v must 
have length at least 3.0 

A vertex x has degree r if x is the center of a maximal star with r edges. A graph 
is r-regular if each vertex of the graph has degree r. 


THEOREM 2. A graph G on p vertices has diam(G) = 2, if G # K, and any one of 
the following conditions holds: 

(a) diam(G) > 4. _ 

(b) diam(G) = 3 and G is r-regular. _ 

(c) p> d,+d,; where d, >d,> :-: 2d, with d, the degree of vertex i in G. 

Two observations are worth noting here: When G is r-regular, condition (c) 
reduces to “p > 2r.” Secondly, when diam(G) = 2, only condition (c) of Theorem 2 
can apply. These observations can be used to infer, for example, that the diameter 
of the complement of the well-known Petersen graph is two, since the Petersen 
graph, which has diameter two, is 3-regular and has p = 10. 


Proof. (a) Use Theorem 1 to observe that diam(G) > 4 implies that there is no 
double-star which spans G. _ 

(b) Pick vertices x and y that are at distance 3 in G. Thus, stars centered at x 
and at y are disjoint. Since G is r-regular, the number of vertices in each of the 
maximal stars centered at x and y is r+ 1. Thus, G has at least 2r + 2 vertices. 
Since a double-star in G has no greater than 2r vertices, a double-star cannot 
span G. 

(c) Every double-star in G has at most d, + d, vertices. Thus, p > d, + d, 
implies that G is not spanned by any double-star. 0 
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The Jordan-Brouwer Separation Theorem for Smooth Hypersurfaces 


ELON L. Lima 
I.M.P.A., Estrada Dona Castorina 110, 22460 Rio de Janeiro, Brazil 


We give here a simple proof of the following: 


JORDAN-BROUWER SEPARATION THEOREM. Let MC R”™ be a connected, com- 
pact, orientable smooth hypersurface. Its complement R™— M has two connected 
components, each of which has M as its point set boundary. 


A subset MCR” is called a smooth hypersurface when every point x € M 
belongs to an open set U, on which is defined a smooth function m: U > R with the 
following properties: i) grad@(x) ¥ 0; ii) py 1(0) = MOU. A vector v € R™ is 
said to be normal to M at x when vu is a multiple of grady(x). The tangent space to 
M at x is the set TM of all vectors in R” that are perpendicular to grady(x). A 
map f: M — R” is called smooth when, for every x € M, f is the restriction to 
UM of a smooth map F: U > R", defined on an open set U containing x. The 
derivative of a smooth map f: M — R” is the linear map /f’(x): T7,M— R’”, 
obtained by restriction of F’(x): R™ — R”. (Recall that the matrix of F’(x) is the 
Jacobian matrix of F.) A diffeomorphism is a smooth map with a smooth inverse. 
The Inverse Mapping Theorem says that if f’(x): T.M—> RR”! is a linear 
isomorphism then f, restricted to some neighborhood V of x in M, gives a 
diffeomorphism of V onto an open subset of R”~+. (For more details, see Thorpe 
[2].) 

A smooth hypersurface M C R” is said to be orientable when it admits a smooth 
field of normal unit vectors, i.e., when there exists a smooth map v: M > R™ such 
that |v(x)| = 1 and v(x) is normal to M at x, for every x € M. 

The assumption of orientability in Theorem A is redundant: any compact 
hypersurface must be orientable. (See Samelson [1] for a short proof of the smooth 
case.) Its presence, however, makes possible an easy proof. In many cases (for 
instance, when M = S”~') orientability is known a priori. 

By “smooth” we mean C®. The proof holds verbatim for C surfaces and, with a 
small technical modification (transverse, instead of normal fields) it would apply for 
C! surfaces as well. 

Samelson’s method of proof may also be used to get Theorem A, but we believe 
that our approach is more elementary. Instead of the Transversality Theorem and 
the classification of one-dimensional manifolds, we use the well-known fact that any 
smooth vector field X: R” > R”, X(x) = (a,(%),..., @,,(x)), which fulfills the 
integrability conditions da,/0x, = 0a,/0x; (i, j= 1,...,m), is the gradient of a 
smooth function »: R” > R. 

Once and for all, we fix a smooth field of unit normal vectors v: M — R”™ and 
define a smooth map h: M X R > R”™ by A(x, t) = x + t- v(x). For any given 
e > 0, we denote by h,: M X (~«, €) > R”™ the restriction of h. 

The following standard result is included here for completeness. 
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LEMMA. Let MC R™ be a compact, orientable, smooth hypersurface. For some 
e>O0,h,: M X (~e6, €) > R”™ is a diffeomorphism onto an open subset of R”™. 


Proof. For every x © M, the derivative h’(x,0): T,.M <x R > R”™ is a linear 
isomorphism, since it sends each horizontal vector (w,0) into w and each vertical 
vector (0, ¢) into t - v(x) (which is perpendicular to w). By the Inverse Mapping 
Theorem, we may find 6, > 0 and an open neighborhood V, of x in M such that h 
maps V,. x (—6,, 6,) diffeomorphically onto an open neighborhood of x in R”. It 
remains only to show that, for some e > 0, A, is injective. Assuming otherwise, we 
would find, for each n EN, distinct pairs (x,, 5,),(¥,,t,) in M X (-—1/n,1/n) 
such that h(x,, 5,) =h(y,, t,). Since M X [—1,1] is compact we may assume (by 
taking subsequences, if necessary) that x, ~x EM, y,>yEM, s,—0 and 
t, 2 0. Then 


x = h(x,0) = limhA(x,, s,) = limhA(y,, t,) = h(y,0) = y. 


Hence lim(x,, s,,) = lim(),, t,,) = (x,0). For all large values of n, (x,,5,,) and 
(y,.t,) would belong to V, x (—6,,6,) and then h(x,,s,) #h(),,t,). This 
contradiction proves the lemma. 


We denote the image of h, by V.(M) and call it a tubular neighborhood of M. We 
also write V,[M] = h(M xX [~g, €]). 


Proof of the Theorem. We begin by showing that R™ — M is disconnected. More 
precisely, we define a smooth function »: R™— R such that M = qm ‘(0) and 
grad m(x) # 0 for every x € M. Then the open sets A = {x € R”™; p(x) > 0} and 
B= {x © R”™; p(x) < 0} are nonempty, disjoint, with R”-— M=AUB. The 
definition of is as follows. 

Let V,,(/) be a tubular neighborhood of M. Take a smooth function f: R > R 
such that f(0) = 0, f’(t) > O when —e <t <e, f(t) =e fort > «and f(t) = —« 
fort < —e. 


Any point in V,,(M) may be written uniquely as x + ¢- v(x), with x © M and 
\t} < 2e. Define a function g: V,,(M) > R by g(x + t- v(x)) = f(t). Then g 1s 
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smooth, M = g~\(0), g(x + t- v(x)) =e for t € [e,2e) and g(x + t- v(x)) = —e 
for t © (—2e, —e]. Let X: R”™ > R”™ be the vector field which equals gradgg on 
V,.(M) and vanishes outside this tubular neighborhood. The set V,[M] is compact, 
hence closed in R™. X is smooth on each of the open sets V,.(M) and R” — V[M] 
(in fact, identically zero on the latter). So, X is a smooth vector field in R™, which 
clearly fulfills the integrability conditions. Hence we may find a smooth function ¢: 
R™” — R such that grado = X. 

By adding a constant to g, if necessary, we may assume that p(x) = g(x) for 
every x in V,,(M), since this is a connected open set on which, and g have the 
same gradient. Moreover, @ is constant on every connected component of R™ — 
V.(M ), since its gradient vanishes there. Now, every such component meets V,,(M). 
[Given any y € V,,(M), let p be a point in the closed set V.[M] that minimizes 
distance from y. The line segment [y, p] hes in R” — V,(M) (hence in the 
component of y in this set) and meets V,,(M/) at all points near p.] Then g = +e 
outside V(M) and M=q71(0). If x © M then gradg(x) =c- v(x), cER, 
because the gradient is perpendicular to the level surface M. This gives 


c= (grad (x), 0(2)) = SLelx + ¢-0(x))] 0 = 1) = 0, 


hence grad : g(x) # 0 at each point x € M. 

Next, we show that the open sets A, B, defined at the beginning of the proof, are 
connected. In fact, A contains the connected set P= h(M X (0,2e)) = {x +t: 
v(x); x € M, 0 <t < 2e}. Moreover, every y € A is either in P or may be joined 
to a point p € P by a line segment [y, p] C A: just take p, (as before) as a point in 
the closed set h(M X [0, e]) that minimizes distance from y. A similar argument 
proves that B is connected. 

Finally, we prove that M is the common point set boundary of both connected 
components A, B of R” — M. 

For any x € M, the point x + t- v(x) belongs to A when 0 <¢ <e and to B 
when —e <t¢t <0. Let fr- S denote the point set boundary of a set S. This shows 
that x E fr: Afr: B for all x © M. On the other hand, if x © fr- A then 
m(x) > 0 because x € A, but (x) < 0 because x € A, so y(x) = 0. Therefore 
fr: AC M, that is, fr: A = M. Similarly, fr - B = M and the proof is finished. 


REMARK. The same kind of argument applies when the hypersurface M, instead 
of compact, is assumed only to be a closed subset of R™. One needs only to change 
the Lemma, where instead of a constant e > 0, a continuous positive function 
e: M —R must be found with the following property: if x|y are in M then 
x + s+ v(x)|y + t.v(y) for all s € (—e(x), e(x)) and t € (—e(y), €(y)). One may 
even go one step further and replace R™” by any simply-connected m-dimensional 
surface N containing the (m — 1)-dimensional surface M as a closed subset. The 
same idea still applies, except that the construction of the tubular neighborhood 
V(M) C N is more subtle. (Instead of straight line segments one may use geodesics.) 
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The Konigsberg Bridges— 250 Years Later 


PETER A. FOWLER 
Department of Mathematics, California State University, Hayward, CA 94542 


The year 1736 is generally accepted as marking the beginning of graph theory. In 
that year Euler published his article [2] on the K6nigsberg bridges problem and its 
generalizations, in which he demonstrated the necessity that every vertex have even 
degree. Remarkably, he dismissed the more difficult sufficiency argument as an 
“easy task... after a little thought.” Not until 1873 was a sufficiency proof 
published [3]. Since that time there have been many proofs published. (See [1] for 
translations of [2] and [3] and for an interesting history of the problem.) The usual 
approach considers an Euler tour as a circuit to be traced out dynamically, ever 
enlarging until the graph is covered. In honor of the 250th anniversary of Euler’s 
paper, we give a new proof in which we view the Euler tour as a static object in a 
graph and employ induction. 


THEOREM. A connected multigraph in which each vertex is of even degree has an 
Euler tour, that is, a circuit containing all the edges of the graph. 


Proof. We proceed by induction on the number of edges. If a graph has fewer 
than three vertices, the result is true by inspection, so consider a graph G with at 
least three vertices and n edges. Clearly, n is at least three. 

Assume that an Euler tour exists in any connected graph with all even degree 
vertices and fewer than n edges. Select any vertex v of G. Since G is connected and 
all vertices have even degree, there exist two distinct edges which join v to (not 
necessarily distinct) vertices u and w. Delete these two edges. If u = w, insert a loop 
at u; otherwise, join u to w by an edge. The resulting graph has all vertices of even 
degree and has n—1 edges. If it is connected, it follows from the inductive 
hypothesis that it has an Euler tour. An Euler tour for G can be obtained by 
replacing u, uw, w by u, wv, v, vw, w. If the new graph is disconnected, then it has 
exactly two components one of which contains v and the other of which contains u 
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and w. Both components satisfy the conditions and have fewer than n edges, hence 
they contain Euler tours, say, 71 and T2. An Euler tour for G can be constructed 
by replacing u, uw, w in T1 by u, wv, T2, vw, w. 

It is worth remarking that essentially the same proof can be given by using 
induction on the number of vertices. 

The author thanks David Gale and Kenneth Rebman for their comments. 
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THE TEACHING OF MATHEMATICS 


EDITED BY JOAN P. HUTCHINSON AND STAN WAGON 


We were deeply saddened to learn of the death of Tory Parsons on April 2, 1987. His enthusiasm and 
energy for mathematics made it a pleasure and a privilege to have worked with him. 


The Editors 


ON MONOTONE SUBSEQUENCES 


DONALD J. NEWMAN 
Department of Mathematics, Temple University, Philadelphia, PA 19122 


T. D. PARSONS* 
Department of Mathematics, California State University, Chico, CA 95929 


We will give a very short proof of the well-known fact that every infinite 
sequence { a, } of real numbers contains an infinite monotone subsequence. We will 
then comment on how this can best be used to advantage in teaching elementary 
courses in real analysis. Lastly, we will discuss connections between the proof and 
Ramsey’s Theorem. : 

Let S = {i|for all 7 > i, a; < a,}. If S is infinite, with elements i, <i, < ---, 
then a; <a,,< --- is monotone strictly increasing. Otherwise, S is finite. In that 
case, there is a least index i, such that for all i => i) we have i € S. Since iy & S, 
there exists i, > ig such that a; >a,. And i, € S so there exists i, > i, such that 
;, 2 4;, and so forth. Then a, >a; >a,,= -:- is monotone nonincreasing, 
completing the proof. 

Courses in elementary real analysis usually begin with the properties of the real 
number field, including the “least upper bound property” or one of its equivalents. 
It is then natural and transparently easy to prove that every bounded monotone 
sequence of real numbers converges to its least upper bound (for nondecreasing 
sequences) or its greatest lower bound (for nonincreasing sequences). The above 
proof then quickly implies the Bolzano-Weierstrass Theorem and the convergence of 
Cauchy sequences. The former follows because every bounded sequence contains a 
(bounded) monotone subsequence that must converge. The latter follows because 
every Cauchy sequence is bounded, and must converge to the same limit as any one 
of its (bounded) monotone subsequences. 

Our proof of the existence of monotone subsequences appears not to be well 
known, though it has appeared in a problem book [2], and it was independently 
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rediscovered by Parsons while working on some problems related to Ramsey’s 
Theorem. We have been unable to find it elsewhere, though we would expect that 
such a simple proof must have been discovered long ago. Our purpose in presenting 
it here is to publicize it to teachers of elementary analysis, and to point out its 
interesting connections with variants of Ramsey’s Theorem. 

The infinite version of Ramsey’s Theorem states that in any coloring, with two 
colors, of the edges of the complete graph with vertices the positive integers, there 
must exist an infinite monochromatic complete subgraph. This implies the existence 
of a monotone subsequence in any sequence of reals: a sequence { a, } induces the 
coloring in which edge {i, j}, for i < j, is colored red if a; < a, or blue if a, 2 a,. 
A monochromatic infinite complete subgraph with vertices i; <i, < --- then 
yields the monotone subsequence a, ,-.., which is increasing or nonincreasing 
according as the subgraph has its odes ‘all red or all blue. (See [1, p. 17].) 

There is a simple proof [3] of Ramsey’s Theorem that is only slightly more 
complicated than our proof above for the existence of monotone subsequences. Our 
proof corresponds to a weaker type of Ramsey theorem, involving ordered sets. 
Namely, let K be the infinite directed complete graph with vertex-set the positive 
integers and with arcs the ordered pairs (i, /) such that i < 7. Then every coloring 
(with colors red and blue) of the arcs of K contains either an infinite (directed) 
complete subgraph (isomorphic to K itself) with all its arcs colored red or else an 
infinite directed path with all its arcs blue. 

The proof of this is essentially the same as that given in the second paragraph of 
this note: in the definition of the set S, just replace “a; <a,” by “arc (i, /) is 
colored red” and consider the cases where S is infinite or finite. 

This Ramsey variant clearly implies the monotone subsequence theorem as a 
corollary. It also implies an undirected version in which the arrows on the edges are 
disregarded, so that a coloring then gives either an infinite red complete subgraph or 
an infinite blue path. And that version implies one in which we get an infinite 
monochromatic path. But these undirected versions are apparently each insufficient 
to prove the monotone subsequence theorem. 

Finally, we note that our proof used only the property that the reals are a linearly 
ordered set. Therefore, any infinite sequence of elements from any linearly ordered 
set has a monotone subsequence. 


We are grateful to Simon Goberstein, Neville Robbins, and the referee for some comments that 
improved this note. 
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Why Does the Wronskian Work? 


MARK KRUSEMEYER 
Mathematics Department, Carleton College, Northfield, MN 55057 


The approach to the question in the title that is presented here has become 
nonstandard, although it can be found in 19th century literature. This older 
approach is more easily motivated, and perhaps more convincing, than the “mod- 
ern” approach. 

Recall that if f,,...,f, are (n — 1) times differentiable functions of a real 
variable t, their Wronskian is the function W(/,,..., f,,) defined by 


f(t) h(t) an f(t) 
W(hi,..., f, )(t) = Wo) Ao) an wt) 
f"-D(1) f"-D(2) see fe-D(0) 


The question in the title refers to the following standard result. 


THEOREM 1. If f;,..., f,, are linearly dependent on some interval (with nonempty 
interior), then W(f,,...,f,,) is identically zero on that interval. Conversely, if 
W(f.,---, f,) = 0 on an interval on which f,,..., f, are solutions to the homogeneous 


linear differential equation 


YO + Dy (t) yO + ++ +pi(t)y’ + pol(t)y = 0 


with Po,--+, Py—1 defined and continuous, then f,,..., f, are linearly dependent on 
that interval. 


Along with this result, it is customary to give a warning, as follows: If W(f,,..., f,) 
= (Q but nothing else is known about f),..., f,, it does not follow that f,,..., f, are 
linearly dependent. 

This warning was first given by Peano [8], [9]; he actually had to give it twice, 
because an editor added a mistaken footnote that contradicted the main point of 
Peano’s first warning. One of Peano’s examples, which is still often used, is 
f(t) = t?, fp(t) = t|t|, for which W(f,, f,) = 0 but f,, f, are linearly independent. 
(Note, however, that f,, f, are linearly dependent if they are restricted either to the 
interval (— 00, 0] or to [0, co).) 

On the other hand, it seems unsatisfactory to have the second part of the theorem 
include a hypothesis involving the nth derivatives of f,,..., f, (since these don’t 
occur in the Wronskian itself). Other hypotheses ensuring the linear dependence of 
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fi,.--.f, when W(f),...,f,) =0 were studied by various authors after Peano, 
including Bécher [1]. Bécher gave several such criteria, including: 


THEOREM 2. Assume W(f,,..., f,) = on an interval on which the Wronskian of 
the first n — 1 functions does not vanish, i.e.. W(f,,.--, f,-1)(t) # 0 for all t. Then 
fi,.--» f, are linearly dependent on that interval. 


By induction and using continuity, one quickly derives the following from this, as 
in Hurewicz [6, p. 48]. 


THEOREM 3. If W(f,,..., f,) = 0 on an interval, then there exists a subinterval on 
which f,,..., f, are linearly dependent. 


Note that Theorem 3 implies the second part (which is the harder) of Theorem 1. 
For suppose f;,..., f, are linearly dependent on some subinterval, so we have 
Cif, + eof, +--+: +ce,f, = 0 there with not all the numbers c,,...,c, zero. Then in 
the situation of Theorem 1, both c,f, + c,f, + --: +c,f, and 0 are solutions of the 
initial value problem 


y™ + p,_s(t) yO) + +++ +py(t)y’ + po(t)y = 0, 
y(to) = 0, 
y(t) = 0,..., y"" (t)) = 0 (for any fy in the subinterval). 


Therefore, by a well-known uniqueness theorem (for a proof see, e.g., Coddington 
[3, p. 259]), c.f, + --- +c, f, must be identically 0 on the whole interval. 

Although Hurewicz’s approach clears up the situation considerably, it requires 
that Theorem 2 be proved first. There is, however, a direct proof of Theorem 3 using 
a recursive formula for the Wronskian, which formula was already known to 
Christoffel [2, p. 298] and Hesse [5, p. 249, 250]. The proof itself was apparently first 
published by Frobenius [4], who characterized it as “auf rein rechnendem Wege”’ 
(“purely computational’). Frobenius derived the recursion anew, without citing 
Christoffel or Hesse. He did not actually refer to the Wronskian (this term was 
coined by Muir in 1882; see [7]), but to the “determinant” of f,,..., f,. Apparently 
he did not realize that his conclusion was only true for a subinterval, and some time 
after Peano’s warning his method must have fallen into disuse. 

The recursive formula for the Wronskian is as follows. 


PROPOSITION. W(fj,..., f,) = fTW(h/f's (f/f - «+> Spf’) - wherever fy, is 


not zero. 
This is proved using the 


LEMMA. For any (n — 1) times differentiable function g, 


W(hi8s f285---5fn8) = "Wf. fa): 
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Proof of the lemma. 


fig fog _ fg 
W(f.2. fe... f-8) = ig) (fs) _ (M8) 
(f.e)" (fe) (fg) 

fig fg Le fig 


fig + fig’ fyg + fog’ 
ig + 2fig’ + fig” 


We can factor out g from the first row of this determinant, then subtract g’ times 
the new first row from the second to obtain 


fi thot oN 
fi ig vee ffl 
Whig, fo8s---5 f,8) =§ fi'g + hie + fig" fag f 6 . 


Now we can factor g from the second row and subtract g” times the first row and 
2g’ times the new second row from the third. Continuing in this way, we obtain 


Whig, fo85-++5 S,8) = g"W( fis. tn)s 


as claimed. 


Proof of the proposition. By the lemma, wherever f, is not zero, 


Wfis--s tr) = f7w(, hoS hi, f/ fis +s If) 


1 ho/Sfi hh; me Sil fa 
0 (f/f,)’ (f/f) oS Ch/fi)’ 
=f? 0 : 


0 (f/f) 
= STW h/t’ (f/f,)’: ce9 (f,/f)’) 
(by expanding along the first column). 


Proof of Theorem 3. We use induction on n; the case n = 1 is clear. Assume 
W(f.,---> f,) = 0 on an interval. If f, = 0 on the interval, we are done; otherwise, 
we can find a subinterval on which f, does not vanish. By the proposition, 


W(( f/f)’: (f/f), sso (f./f.)’) = 0 
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on this subinterval, so by induction we can find a subinterval on which 


(fo/f,)’,--->(f,/f,)’ are linearly dependent. Thus there are numbers c,,...,c,, not 
all zero, for which 


(f/f)! + +++ +e, f/f)’ = 0, 


that is, 


((e)f, + °°: +e. fi)/fi)’ = 


Therefore, (c,f, + --- +c,f,)/f; must be a constant c, and it follows that f,,..., f, 
are linearly dependent. 
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E 3243. Proposed by William F. Trench, Trinity University, San Antonio, TX. 


Suppose that g is constant in sign and locally Riemann-integrable on [0,+ oo). 
Suppose f is nonincreasing on [0,+ oo) and / f(x) dx = + oo. Show that 
0 


[F(x )c0s x + g(x)sin x| dx = [f(eo)sin x + g(x)cos x| dx = +00. 
0 0 


E 3244. Proposed by R. S. Rodriguez and H. Sherwood, University of Central 
Florida, Orlando. 


Suppose dy, a),..., a, are integers such that 0 < ay <a, < -:: <a,. Put 


x x? xk 


The function f is easily seen to be strictly decreasing on (0,+ 00) when each 
a,=i+1.Is f strictly decreasing for every permissible choice of ag, a;,..., @,? 
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E 3245. Proposed by Bruce A. Reznick and Lee A. Rubel, University of Illinois at 
Urbana-Champaign. 


(a) For what positive integers n does there exist an n by n matrix A over C 
having the following three properties: 
(i) n? — n of the entries are zero, 


(11) there are n distinct nonzero entries r,, ’,,..., 7,, none of which lies on the 
main diagonal, 
(i1) the eigenvalues of A are 7,, 7,,..., 7, 


(b) Same as (a) except that all entries are required to be real. 


E 3246. Proposed by Solomon W. Golomb, University of Southern California. 


Define a function y on the positive integers by putting ¥(n) = n(n), where ¢ 
is Euler’s function. (a) Is wy an injective function? (b) For how many positive 
integers n is W(n) a perfect square? (c) Given any positive integer m, prove that 
there is an integer n divisible by m such that W(n) is a perfect cube. 


E 3247. Proposed by Nick M. Kinnon, Winchester College, Winchester, England. 


Given a positive integer k greater than 2, the k-gonal numbers are the terms of 
the sequence 


1,k,3k — 3,6k — 8,..., 


which has constant second difference k — 2. The Fermat numbers are the terms of 
the sequence 


322741, 5=27 41, 17=27 41, 257=27 +1,.... 


Under what circumstances can a k-gonal number be a Fermat number? 


E 3248. Proposed by Paul Erdos, Hungarian Academy of Sciences, and John 
Selfridge, Northern Illinois University. 


(a) Let S be the set of positive integers with no prime factors bigger than 3. 
Prove that every positive integer is expressible as a sum of distinct elements of S 
such that no summand is a multiple of any-other. For example, 19 = 9 + 6 + 4. 

(b) Let T be the set of positive integers with no prime factors other than 2, 5, or 
7. Prove that every sufficiently large positive integer 1s expressible as a sum of 
distinct elements of T such that no summand is a multiple of any other. For 
example, 62 = 49 + 8 + 5. 
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SOLUTIONS OF ELEMENTARY PROBLEMS 
Different Difference Bounds 


E 3095 [1985, 428]. Proposed by Alexandru Lupas, Facultatea de Mecanica, Sibiu, 
Romania. 


Let p be a fixed natural number, (a,,)?°_, a sequence of real numbers, and 


P k{P 
A?a,, = » (-1)?~ ( Janes ~~ 
k=0 k 


If there exist constants M p> M, such that mM, < A’a, < M,, n = 0, 1,2,..., prove 
that 
: A?A : M 0,1 
< < 9 = 9 9° ee 6 
p+l me p+1 ? " 


Solution by N. Sivakumar (student), University of Alberta, Edmonton, Canada. 
Let k‘”) denote k(k + 1)---(k + p — 1). Since 


(K+ 1)P*) — kD = (p + 1)(k +1), 


it follows that 
YY (k+ 1) = (n $+ 1)?*?/( p +1). 
k=0 


The desired bounds on the pth forward difference of the A’s follow from the 
given uniform bounds on the pth forward differences of the a’s and the following 
identity between these differences: 


M4, = ren De (e+ ara (+) 


The identity (*) may be proved by induction on p as follows. 
For p = 0, (*) holds by definition. If (*) holds for a given value of p, then 


AP+l4 = APA, — APA, 


n+1 
y (k +1) A?a, — 


= ——___ k +1) ara 
(n + 2)Pr k=0 a ( ) * 
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so that 


n+l n 


(n + 1)?*?APt14 = (n4+1) ¥ (kK +1) APa, — (n+ p +2) X (kK +:1) AP a, 
k=0 k=0 


= (n+ 1)P*Paray., — (p +1) ¥ (ke + 1) Pa, 
k=0 
(n +1)?" ara, — Yo {(k+ YO? = KO Vara, 
k=0 


(k+1)?*P% ara, -— ¥ (Kk 4+ 177? AP, 


0 


n 


k 


ll 
> 
= IMs 


y (k 4 LPP Arta, 
k=0 


and (*) holds with p replaced by p + 1. 


Also solved by A. Bege (Romania), I. E. Leonard (Canada), S. Marivani, R. K. Meany, J. E. Pe€ari¢ 
(Yugoslavia), University of South Alabama Problem Group, and the proposer. 


Integral Quotients of Factorial Products 


E 3107 [1985, 591]. Proposed by Ira Gessel, Brandeis University. 


—- m\(2m+2n)! 
Show that if m and n are nonnegative integers, then ———_—-————_— 18 an 
integer (2m)!n'\(m +n)! 


Solution I by A. A. Jagers, Universiteit Twente, The Netherlands. 


m!(2m + 2n)! 2” 
Gm)inm +n nm + an YOm + 2n— 3) ++ Qm + 1) 


1 h 1 
= (-1)"2?"} —7 7] = (-1)"2"7 YP | 75 (,} 
n k=0 k 
_ nh “k\(m tn 1) 
2 (3 n—k 


a sum of integers. 


Solution II by O. P. Lossers, Eindhoven University of Technology, The Netherlands. 
Denote the given quotient by A(m, n)(m > 0, n > 0). Then we immediately see that 


A(0, n) = Ga is an integer for all n > 0 


A(m,0) = 1. isan integer for all m > 0 
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Cube Inscribed in a Tetrahedron 


E 3114 [1985, 666]. Proposed by M. J. Pelling, London, England. Find the 
largest cube that can be inscribed in some tetrahedron of volume 1. 


Solution by J. G. Mauldon, Amherst College. A volume of 2/9 can be achieved by 
considering the cube 0 < x, y, z < (2/9)? in R° and the tetrahedron formed by 
the three coordinate planes and the plane x + y + z = 6'/°; this is the greatest 
possible volume. More generally we have the following: 


THEOREM. If V(T) is the volume of a tetrahedron T containing a given parallelo- 
piped II of volume V(II), then V(T) > 2V(II). 


Similarly we have its two-dimensional analogue: 


LEMMA. If A(A) is the area of a triangle A containing a parallelogram R of area 
A(R), then A(A) > 2A(R). 


The proof of the Theorem involves the Lemma, whose own independent proof 1s 
a simpler version of that for the Theorem. 


Proof of the Lemma. By an affine transformation we may assume that R is a 
square of area a*. Also we may assume that A lies in a compact disc, concentric 
with R, of radius 4a, since otherwise A has a vertex D outside this disc, and the 
convex hull of {D} UR, a subset of A, already has area greater than 2a”. By 
compactness the continuous function A(-), defined on the set of such triangles A 
containing R, attains its infimum. Thus we may assume that A(A) is minimal. 

If any one side f of A does not contain a side of R, then f meets R in only a 
single point P which must be a vertex of R and must also be the midpoint of f, 
since Otherwise A(A) could be reduced by a small rotation around P of the side f 
alone. 

Hence if no side of A contains a side of R then a side (in fact, each of two sides) 
of R is the join of the midpoints of two sides of A. Consequently, in this case, the 
opposite side of R lies in the third side of A, which contradicts the hypothesis. 

The above contradiction shows that at least one side (the base) of A, of length b 
say, contains a side of R. If the corresponding altitude of A is of length a+k 
(k > 0), then by similar triangles we have k/a=(a+k)/b and so A(A) = 
h(a + k) = a(a + k)*/(2k) = 2a? + a(a — k)*/(2k) > 2a? = 2A(R), as 
required. 


Proof of the Theorem. This proof closely follows the proof of the Lemma. Similar 
argumentation shows that we may assume that II is a cube C of volume a’, that T 
is contained in a concentric compact ball of radius 27a and, by compactness and 
continuity, that V(7’) is minimal, so that in particular V(T) < 9a?/2. 

We denote by 7* the tetrahedron whose four vertices are the centroids of the 
four faces of T, observing that the faces of 7* are parallel to the faces of 7 and that 
T* has one-third the linear dimensions of T, so that V(T*) = V(T)/27 < a3/6 by 
hypothesis and consequently, in particular, V(T*) # a?/3. 
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If any one face F of T does not contain an edge of C then F meets C in only a 
single point P which must be a vertex of C and must also be the centroid of F, 
since otherwise (cf. Pappus’ Theorem) the (minimal) volume V(T) could be reduced 
by some small rotation around P of the face F alone. 

Suppose that no face of T contains an edge of C. Then all four vertices of the 
tetrahedron T* are vertices of C, and, since V(T*) # a?/3, it follows that a face of 
T* lies in a face of the cube C. (We have here used the fact that a tetrahedron whose 
vertices are four of the vertices of C has volume either a>/3 or a*/6, the latter case 
occurring only when a face of the tetrahedron lies in a face of C.) Hence the 
opposite face of the cube C lies in a face of T, which contradicts the hypothesis. 

The above contradiction shows that at least one face F (the base) of T, of area 
A(F’), contains an edge of C and makes an angle 6 (0 < 6 < 7/4) with a face of C. 
A plane parallel to F at a distance acos @ from F meets C in a rectangle R of area 
a*sec 0 and (by the Lemma) meets T in a triangle A of area A(A) > 2a’sec 0. 

If the altitude (to F) of T is of length (k + a)cos 8, where k > 0, we have (by 
similar tetrahedra) A(F) = ((k + a)/k)?A(A) and 


V(T) = <A(F)(k + a)cos 6 = =A(A)((k + a)/k)(k + a)cos 6 
> = (2a%sec 9)(k + a)’cos 0/k? 


9a aa*(4k +a) 


= oy (RE ~ 20) 
2 oer 6K ~ 24) 
9a? > M1) 
>—_— =. 
7 7 WM 


as required, with equality only if k = 2a and A(A) is minimized in relation to the 
rectangle R contained in A. 


Also solved by P. L. Hon (Hong Kong), L. Kuipers (Switzerland), J. Dou (Spain), and M. Orlowski & 
M. Pachter (South Africa). 


Regular Simplexes in n Dimensions 


E 3133 [1986, 132]. Proposed by Peter D. Zvengrowski, The University of Calgary. 

Let A” C R” be a regular n-simplex with center at 0. Let a; = (a,,, j,..., Gn) 
be its vertices, i = 0,1,..., 2. Show that for all j and k, L7_9a@;,a;, = c6,, for some 
constant c, where Bi is the Kronecker delta. 


Solution by Bjorn Poonen, Winchester, MA. Without loss of generality, assume 
the length of each vector a; is (n/(n + 1). By symmetry, a;'a;=a;:a, for 
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J; k F l, and Li -04; = 0. Thus 


O=a;- Diaj;=n/(n+1)+n(a;-a,) and a;-a,= —1/(n+1) 
j=0 

for all i # k. Now let b; © R”** be the vector formed by placing /1/(n + 1) after 
the n components of a;. Then b;- b, = a,;:a,+1/(n + 1) = 6, for all i# k, so 
the (n + 1) X (n+ 1) matrix whose ith column is 5, is orthogonal. But the 
transpose of an orthogonal matrix is orthogonal, so 794; ;4;, = 5. 

Similarly we can prove the following partial converse: If L7_oa;;a;, = cd, for 
some c > 0, and 1?_,)a; = 0, then the endpoints of the vectors a; form the vertices 
of a regular n-simplex with center at zero. 


Proof. Without loss of generality assume c = 1. Define b,; as before. Then the 
(n + 1) X (n + 1) matrix whose ith column is b; must be orthogonal, since the 
given conditions imply that its transpose is. Hence, 6,,= b;-b;=a;-a,+ 
1/(n + 1), so for all i # 7 we have 


7 
(a;—a,;)° =a,-a;— 2a;-a;+a,;-4,; 
=n/(n+1)-—2-1/(n+1)+n/(n4+ 1). 


But if the lengths of the sides of the simplex formed by the endpoints of the vectors 
a, are equal then the simplex is regular, and its center 1s 


1/(n + 1) Y a,= 0. 


i=0 


Also solved by O. P. Lossers (The Netherlands), D. B. Shapiro, A. Tissier (France), W. P. Wardlaw, 
J. B. Wilker, K. Zacharias (W. Germany), and the proposer. 


E 3140 [1986, 216]. Proposed by Khristo Boyadzhiev, University of Sofia, Bulgaria. 
Let K = 4n?(y2 + 1). 
(a) Show that 


oo In(x + vx? — 1) 


———————- dx = K. 
J x(1 + x7) * 
(b) Also show that 
o arctan x W 
(i) —=—— dx = — + K 
J xvVx*-1 
i) f arctan x d wT K 
fy = 
(ii b Won? 3 


Editor’s note. The statement of this problem gave a value of K equal to one half 
of what appears here. All the solvers noted the misprint. 
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Solution to part (a) by Hans Kappus, Rodersdorf, Switzerland. Denote the given 
integral by J. Substituting x = 4(¢'/7 + 17/7), we obtain 


r= f° _ Intd 
= | ———_—_Inratt. 
I Gees ea " 


By partial fractions, 


where a = 3 + 272 and b = 1/a = 3 — 2y2. The substitution u = bt gives 


1 Int » Inu 
/ dt = Inain(b +1) + [°—— du, 
otta outl 


and an analogous equation holds with a and b interchanged. Hence, 


1 Int » Int a Int 
= —— dt-—lnaln(b+1)- | ——at—-Inbln(a+1) - 
4] 2[ 44 naln(b + 1) [m4 nblin(a+ 1) Ia 

a+1 a Int 1 Int 

= ~ | ——dt+ | ——dt 
in ain Lm Im 
a Int a Int 
— fia fa 
1 t+1 1 t(t +1) 


1 
substituting 7 for t in the second integral 


dt 


alnt 
=In’ta- | —adt 
na i] 


1 
= FIn’a = 2In’'Va = 2In(V2 +1). 


Dividing by 4 completes the argument. 
Solution to part (b) by the proposer. Consider the function 
arctan ax 
Fla) = [Vine ee ik, Oa <1 
1 xvx*-1 
By Leibniz’s formula for differentiation, 


F(a) = ——— arctan -——— + per a” 


v1 v1 (1 + area — 
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| 
Put ¢? = 1 — (1/x”) and note that arcsina = arctana/ y1 — a’ to obtain 


arcsin a a dt 
F’(a) = = + | rR 
arcsina 1 lta? t+t . 
= —————_ |) |= —_—_—_—_—_——_ 
V1 — a’ 2V1 + a? Vlt+a*—t 


arcsin a 1 1 tn lt+art+a 
V1 — a’ 2V1 + a? Vl+a’*—a 
arcsin a In(V1 +a*7 +a 


V1 — a? V1 + a 
Integrating, we obtain 
1 1 2 
F(a) = 5 (arcsin a)" a 5 (In(a +V1+ a? )) + C. 
Now F(0) = 0 implies that C = 0. Letting a approach 1 gives us the result in part 
(i). 
We may derive the result in part (ii) from that in part (i) by noting that 
arctan x = 7/2 — arctan(1/x). Hence 
ii arctan x heat f dx f arctan(1/x) 
ee Jy = — | ee — Of 
0 V1l—x? 2/0 v1 — x? 0 vyl—x? 
qr? co arctan t 
aod 


—— dt 
tvi7 —1 
(setting t = 1/x) 


Pia 


= —-K. 
8 


Editorial comments. O. P. Lossers and the University of South Alabama Problem 
Group evaluated the integral in part (a) by residues. N. Ortner expressed the 
integrals as double integrals and evaluated them by interchanging the order of 
integration. The proposer solved part (a) by differentiating the function 


o In(ax + vax? — 1) 
F (a) = i] 2 

L/e x(1 + x*) 
Most of the other solvers converted the integrals to infinite series. These series were 
summed most often by reference to handbooks, tables, or other sources. 


Also solved by P. Bracken (Canada), K. W. Lau (Hong Kong), O. P. Lossers (The Netherlands), 
W. A. Newcomb, The University of South Alabama Problem Group, J. E. Wilkins, Jr., and J, W. 
Wrench. Part (a) was also solved by the Chico Problem Group, N. Ortner (Austria), K. Zackarias (West 
Germany) and the proposer. Part (b) was also solved by L. M. Christophe and A. J. Krishna (student). 


ax, a> 0. 
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ADVANCED PROBLEMS 


6563. Proposed by C. A. Spiro, SUNY at Buffalo. 


Prove that, if N is sufficiently large, at least one of the integers N, N+ 1,..., 
N + 9 has more than two distinct prime factors. 


6564. Proposed by Gerald Weinstein, C.U.N.Y. 


Define S(@) = lim, _., , ,,(sin ny”. 

(i) Show that S(2) does not exist. 

(ii) If C is a constant such that | p/q — 7| > q~ © for all pairs p, q of sufficiently 
large positive integers, prove that S(@) exists and is zero for all 6 > 2C — 2. 

Mahler [ Indagationes Math, 15 (1953) 30-42] showed that we can take C = 41.09. 
For the best known value of C see D. V. Chudnovsky and G. V. Chudnovsky, 
Springer Lecture Notes 1052, (1984) 37-84. 


6565. Proposed by L. A. Rubel, University of Illinois at Urbana-Champaign. 


Suppose A = {a,, a5, a3,...} 18 a Sequence of distinct positive integers and 
B = {b,, b,, b;, ...} is another. Suppose A contains arbitrarily long arithmetic 
progressions. 

(a) If |b, — a,| < K for all n, where K is a fixed positive number, prove that B 
must contain arbitrarily long arithmetic progressions. 

(b) If °° ,1/{1 + |b, — a,|} diverges, does the same conclusion hold? 


REVIEWS 


EDITED BY JOSEPH KONHAUSER 
Department of Mathematics, Macalester College, St. Paul, MN 55105 


Enigmas of Chance: An Autobiography. By Mark Kac. Harper & Row, Publishers, 
New York, 1985, xxvii + 163 pp. 


WOLFGANG FUCHS 
Department of Mathematics, Cornell University, Ithaca, NY 14853 


The Sloan Foundation had the splendid idea of asking eminent scientists to write 
their autobiographies, concentrating on their life in science; how it shaped their 
existence, what it means to them and what they see as its aim. Enigmas of Chance is 
the autobiography that Mark Kac wrote at the Foundation’s request. It is a fitting 
memorial to a great mathematician and a fine person. Unfortunately, he did not live 
long enough to see its publication. 

The manner of writing is delightful, witty and lucid, very much like Mark’s* 
conversational style. Those who had the good luck of knowing him personally will 
hear his voice behind the printed page. 

The book does many different things and can be enjoyed at many different levels. 

It recalls the life of a brilliant man in a series of charming vignettes and 
delightful anecdotes. It is a (deservedly) happy story of success in his work and 
happiness with his family and friends. Mark’s personality emerges with wonderful 
clarity, excepting only that his modesty did not let him dwell on some sides of it, 
like his extraordinary kindness and helpfulness to those in trouble, be they flounder- 
ing graduate students or victims of totalitarian oppression. 

Although Mark’s life deserves to be called happy, it lay under the shadow of the 
Holocaust in which all his Polish family was wiped out. This makes his book a 
moving document of what it meant to be a Jewish scientist born in Poland, living 
through the dark times of our 20th century. 

Mark tells vividly of his incurable infection by the mathematical virus while still 
a school boy, and gives an excellent account of the continuing affliction: The total 
absorption in a problem, the frustration when things do not work out, the elation 
when they do. 

In keeping with the Sloan Foundation’s charge, there is a very interesting 
description of Mark’s mathematical evolution, a discussion about the nature of 
mathematics and even some actual mathematics (one chapter is ““The Search for the 
Meaning of Independence”). Judging from a very small sample of nonmathematical 


*T hope that the reader does not mind if I refer to my friend for the remainder of this review as Mark. 
(I do this despite my strong disapproval when hearing a well-known historian refer to Churchill as 
Winston during World War II. But that’s different, isn’t it?) 
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family and friends who have read the book, this chapter conveys the excitement and 
enthusiasm of discovery, even if the material itself remains inaccessible. 

Since the general public (and expert opinion, too) is not even clear about what 
constitutes the subject matter of mathematics, the discussion of the nature of 
mathematics required great skill and careful selection. Mark steered clear of the 
philosophy of the subject and of foundational questions, except for a quick sneer at 
Bertrand Russell’s well-known dictum that mathematics can be defined as the 
subject in which we never know what we are talking about, nor whether what we are 
saying is true. On the foundations Mark has this to say: “If a subject is robust, it 
should be insensitive to its foundations, and therefore too great an emphasis on the 
latter tends to produce a misleading slant. Concern for the foundations should 
come, if at all, after one has a firm intuitive grasp of the subject.” 

Mark stresses the fact that mathematics is a cultural activity as old as civilization 
itself. And he underlines the esthetic component of its appeal. He points out the two 
mainsprings of mathematical activity: The search for logical perfection and simplic- 
ity (“Pure Mathematics”) and the quest for solutions of concrete problems, usually 
arising in other fields (“Applied Mathematics”). He disagrees sharply with those 
who only recognise one of these two parts as ‘True Mathematics’ and emphasizes 
the great strength that mathematics derives from its dual inspiration. 

His own research is a perfect justification of this view. Even when working on 
abstract theories his starting point was always a concrete problem, simple enough to 
allow detailed calculations, complex enough to have the essential features of the 
general case. The ability to devise such problems is the hallmark of his genius. It 
was his firm conviction that all good work was founded on one simple, good, key 
idea, “the joker,” as he called it. Once the joker was found, the rest followed, 
though it might still involve a lot of hard work. In this sense Mark was an ‘applied 
mathematician,’ no matter how abstract his subject matter might be. It is also true 
that in later life his interest shifted more and more towards physics. Yet it is my 
impression that his work in physics had an unmistakeable mathematical flavor, both 
in the choice of problems and in the standard of rigor. 

Since it is not usual to write a book review with nothing but praise, let me finish 
by expressing regret at an omission in the book. In Mark’s lifetime his special field, 
probability theory and its applications, grew explosively. To a large extent this is 
due to his own efforts. But there was also much work done by others. It would have 
been most interesting to have his comments on the progress achieved and on its 
history. He refrained from this completely; not even his famous colleague at 
Cornell, Will Feller, appears in the index. But, then, this is the life story of Mark 
Kac and it is quite proper that the wealth of his own ideas has pushed other topics 
aside. 

I am sadly aware that my enumeration of the topics of Enigmas of Chance in no 
way recaptures the charm, the ebullience, and also the well-disguised seriousness of 
the book. 

I urge the reader to rush out to get hold of a copy. 

We all owe a great debt to the Sloan Foundation for godfathering this lovely 
work. What a pity that they were not around at the time of Georg Cantor or 
Dirichlet or Euclid. 


1988] REVIEWS 63 


Tilings and Patterns. By Branko Griinbaum and G. C. Shephard. W. H. Freeman & 
Co., New York, 1986. 700 + ix pages, 1406 illustrations, cloth, $59.95. 


SOLOMON W. GOLOMB 
Department of Mathematics, University of Southern California, Los Angeles, CA 90089 


This is a marvelous book. I am tempted to say “I wish I had written it,” but the 
comprehensiveness and attention to detail exceed anything I would have ever 
contemplated, let alone executed. The book is profusely illustrated, with an average 
of two figures per page, and each picture is meticulously drawn. It must have been a 
difficult decision for the publisher to confine all illustrations to black-and-white, 
thereby foregoing the art gift book market, though the production cost would no 
doubt have been exorbitant. However, for the mathematician, all the necessary 
distinctions are amply illustrated with shadings, letters, and numbers, and an 
excellent aesthetic effect is achieved. 

One major achievement is the organization of the vast literature on planar tilings 
into a systematic structure, arranged into twelve chapters. The first eight chapters 
develop a “classical” theory of tilings, including classification of tilings with 
transitivity properties (Chapter 6), with respect to symmetries (Chapter 7), and 
when color is added (Chapter 8). Interesting recreational problems are liberally 
interspersed: rectangles of unequal squares, including Duijvestijn’s decomposition 
of a square into 21 unequal squares (in section 2.4); superposition of two polygonal 
tilings to obtain a dissection of one polygon where the pieces can be reassembled to 
form the other polygon, in the fashion of H. Lindgren’s 1972 Dover book on 
Geometric Dissections (in section 2.6); and many others. 

The last four chapters contain some advanced material, mostly rather recent, and 
(to this reviewer, at least) especially interesting. Chapter 9, “Tilings by Polygons,” 
includes a wealth of tiling material involving polyominoes and their cousins, 
polyiamonds and polyhexes, as well as a discussion of spiral tilings such as 
Voderberg’s. Chapter 10, “Aperiodic Tilings,’ begins with “similarity tilings,” 
including certain rep-tiles, which can tile aperiodically, and continues with the 
six-piece sets of Robinson, of Ammann, and of Penrose, which can only tile 
aperiodically. This topic is developed extensively, leading to the two-piece sets of 
Penrose (the kite-and-dart, and the Penrose rhombs) and Ammann’s set (of two 
similar concave hexagons), which can tile the plane, but only aperiodically. The 
central unsolved problem is suitably highlighted: Is there a single tile which can tile 
the plane aperiodically, but not periodically? Chapter 11 (“Wang Tiles”) develops 
Hao Wang’s theory of the close relationship between Turing machines, undecidabil- 
ity, and the question of whether a given set of colored tiles (actually, “MacMahon 
squares”) can tile the plane. The final chapter (“Tilings with Unusual Kinds of 
Tiles”) discusses tiles with unconventional connectivity (e.g., with reentrant 
boundaries, and disconnected tiles), tilings by irregularly shaped figures, and tilings 
with special adjacency conditions on abutting tiles. 

Sets of exercises are distributed liberally throughout the book, and it would be 
possible to use it as a text in a somewhat unconventional geometry course. The book 
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concludes with 42 pages of references, citing articles in sixteen languages, and a 
six-page index. 

The authors have made a commendable effort to cite relevant references for each 
topic they discuss. My only complaint is that sometimes the originator of an idea is 
not directly credited, if the concept appears in the work of another author who in 
turn cites the inventor. Thus, Ammann is not listed as an author, which makes it 
difficult to attribute proper credit to his discoveries. Several of my own contribu- 
tions (coining the term rep-tile and observing the aperiodic tilings they generate; the 
problem of tiling the plane using exactly one square of each integer side; etc.) are 
not directly attributed, as the reference given is to Scientific American columns by 
Martin Gardner, or to anthologies of such columns. (This may be mere petulance on 
my part. I am credited with inventing the term polyominoes in 1953, and with 
writing a principal reference on this subject in 1965.) 

As extensive as this book is, it obviously cannot present every result ever 
obtained about tiling. However, the authors have excellent taste in selecting their 
material, and make an admirable effort to cite references to the topics which they do 
not cover. In a first edition of a book of this length there are inevitably some 
typographical errors, but they are surprisingly few and far between. I recommend 
this book enthusiastically to anyone interested in problems of tiling the plane. 
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General, S, P*. Writing Mathematics Well: A 
Manual for Authors. Leonard Gillman. MAA, 1987, 
ix + 49 pp, (P). [ISBN: 0-88385-443-0] A brief, pithy, 
blunt manual for writing mathematics; uses examples 
marked by checks (,/ ) and bold crosses (x) to dis- 
tinguish right and wrong alternatives. Covers organi- 
zation, mechanics (including copy-editing) as well as 
usage (mathematical English and standard English) 
from a traditional perspective (e.g., no mention of 
the tricky problems posed by non-sexist language or 
of the new editing, layout, and procedural choices 
posed by desk-top publishing). Too short to be com- 
prehensive, but not too long to be read quickly and 
repeatedly. LAS 


General, S, P, L. No Way: The Nature of the Im- 
possible. Ed: Philip J. Davis, David Park. WH Free- 
man, 1987, xvi + 325 pp, $17.98. [ISBN: 0-7167- 
1813-8] A fascinating collection of essays by scien- 
tists, humanists, and artists on the importance and 
role of impossibility in various fields. An unusual ap- 
proach to defining the limits of human achievement; 
provides a unique cross-disciplinary application of a 
fundamental concept in epistemology and mathemat- 
ics. LAS 


General, P, L. Quiddities: An Intermittently Philo- 
sophical Dictionary. W.V. Quine. Harvard U Pr, 
1987, 249 pp, $20. [ISBN: 0-674-74351-2] Inspired 
by Voltaire’s Philosophical Dictionary, the eminent 
philosopher Quine reflects on topics in and peripheral 
to philosophy, language and mathematics, includ- 
ing belief, communication, formalism, free will, Latin 
pronunciation, longitude and latitude, mathematosis 
(a disease), ‘usage and absuage.’ Whimsically hu- 
morous intellectual free play by a great thinker, for 
browsing. RB 


Elementary, T(13: 1). Algebra Programmed, Part 
3, Third Edition. Robert H. Alwin, Robert D. Hack- 
worth, Joseph W. Howland. Prentice-Hall, 1987, 
xviii + 680 pp. [ISBN: 0-13-021940-1] Changes from 
the Second Edition (TR, March 1972) include less 
material on the structure of the real numbers and 
more emphasis on “skills and concepts involving ex- 
ponents, radical expressions, and the graphing of lin- 
ear and quadratic equations.” JS 


Blementary, T. Intermediate Algebra, Fifth Edi- 
tion. Mervin L. Keedy, Marvin L. Bittinger. 
Addison-Wesley, 1987, xxvi + 646 pp, $28.95 (P) 
[ISBN: 0-201-15379-3]; A Problem-Solving Approach 
to Intermediate Algebra, Second Edition, 1986, x + 
518 pp, $30.95 [ISBN: 0-201-12179-4]; Intermediate 
Algebra with Problem Solving, 1986, xii + 449 pp, 
(P). [ISBN: 0-201-15650-4] Three “new” entries in a 
plethora of new names and editions by the same au- 
thors. Intermediate Algebra features an “interactive 
worktext pedagogy;” changes in this “significant re- 
vision” of the Fourth Edition include a new focus on 
skill-maintenance; more applications, exercises, and 
examples. A Problem-Solving Approach covers linear 
equations earlier than usual as part of the emphasis 
on problem solving; no indication of changes from 
the First Edition. Intermediate Algebra with Problem 
Solving is a paperbound version of the previous text. 
Supplementary materials available for the series in- 
clude instructional software, computerized testbank, 
videotape cassettes, and a placement test. JNC 


Elementary, T(13: 1, 2). Business Algebra & 
Mathematics. Thomas F. George. Prentice-Hall, 
1987, xiv + 634 pp. [ISBN: 0-13-091562-9] Alge- 
braic topics up to quadratics and system of equa- 
tions, the standard deviation and measures of central 
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tendancy, and eleven chapters on topics in business 
mathematics. Appendices on accounting and income 
taxes. FLW 


Mathematics Appreciation, S*, L**. Time 
Travel and Other Mathematical Bewilderments. 
Martin Gardner. WH Freeman, 1987, ix + 295 pp, 
$12.95 (P); $17.95. [ISBN: 0-7167-1925-8; 0-7167- 
1924-X] Twelfth collection of reprints from Gard- 
ner’s Scientific American column, this one devoted 
to paradoxes (nontransitivity, time travel), puzzles 
(card problems, rubber rope), pasttimes (tangrams, 
tilings, magic squares), and much more. Includes the 
famous 1975 April Fool’s column that fooled thou- 
sands of readers with a counterexample to the four- 
color conjecture. LAS 


Education, P. No More Teacher Traps—Neither in 
the Name of Holmes Nor Carnegie. Julius G. Gold- 
berg. Vantage Pr, 1987, 410 pp, $18.95. [ISBN: 
0-533-06550-X] Goldberg, a Russian-born and edu- 
cated mathematics professor at Ohio State at Mar- 
ion, addresses problems of declining class perfor- 
mance, mutually hostile student-teacher relations, 
grade inflation and other failings of the American ed- 
ucational system in six essays: aspects of educational 
reform; how to create a teacher, lesson, student; 
shortcomings of currently proposed reforms. RB 


Education, S(16-17), P. Stochastik in der Kol- 
legstufe. Harald Scheid. Lehrbrucher und Mono. zur 
Didaktik der Math., B. 6. Bibliographisches Insti- 
tut, 1986, 250 pp, 32 DM (P). [ISBN: 3-411-03129-8] 
On probability and statistics in the secondary-school 
curriculum—what should be taught and how. JD-B 


Education, L. History in Mathematics Education. 
Ed: I. Grattan-Guinness. Belin (8, rue Ferou, 75006 
Paris, FRANCE), 1986, 208 pp, 60F (P). [ISBN: 2- 
905-746-05-X] Proceedings of a workshop held at the 
University of Toronto during the summer of 1983: 
a dozen papers by authors from around the world 
on historical themes (e.g., the origins of “proof”) 
and on teaching practice (e.g., the use of transparen- 
cies). LAS 


Education, $(15-16). Teaching with Student Math 
Notes. NCTM, 1987, iv + 123 pp, $12.50 (P). [ISBN: 
0-87353-244-9] Collection of all the four-page Stu- 
dent Math Notes from the NCTM News Bulletin 
from September 1982 through May 1986. Additional 
teacher notes include teaching ideas, solutions, sug- 
gested extensions. Perforated and reproducible, suit- 
able for teaching-training activities. MW 


Education, S(11-13), L. Enterprising Mathemat- 
ics. The Spode Group. Oxford U Pr, 1986, ix + 
181 pp, $24.95 (P). [ISBN: 0-19-853652-6] An inno- 
vative collection of rich, realistic problems from prac- 
tical life designed to motivate British high school age 
students who have been turned off by mathematics. 
Typical “problems” are projects with lengthy (page- 
long) descriptions requiring concrete mathematical 
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exploration and analysis. Grouped by theme: sport, 
travel, parties, money, home, house design, buying, 
and “odds and ends.” LAS 


Education, $(15-18). How To Evaluate Progress 
in Problem Solving. Randall Charles, Frank Lester, 
Phares O’Daffer. NCTM, 1987, v + 85 pp, $5.60 (P). 
[ISBN: 0-87353-241-4] Practical advice for teachers 
based on authors’ research. Discusses goals of teach- 
ing problem solving and describes evaluation tech- 
niques of observation and questioning, student self- 
assessment, holistic scoring as well as the familiar 
multiple choice and completion tests. Tips on or- 
ganizing evaluation and using results. Bibliography 
of problem-solving resources. Excellent for in-service 
programs. MW ~ 


Edueation, S(15-16). How to Teach Mathematics 
Using a Calculator: Activities for Elementary and 
Middle School. Terrence G. Coburn. NCTM, 1987, 
v + 58 pp, $6.90 (P). [ISBN: 0-87353-245-7] Repro- 
ducible activity sheets covering topics from numer- 
ation to prealgebra. A few straightforward “check 
your answer on the calculator” types; most involve 
pattern finding and problem solving. Useful in ele- 
mentary education. MW " 


Education, S(13-16). Problem-solving Techniques 
Helpful in Mathematics and Science. Charles A. 
Reeves. NCTM, 1987, iii + 35 pp, $4 (P). [ISBN: 
0-87353-246-5] Designed to familiarize elementary 
teachers with common problem-solving strategies, 
such as working backwards, making a chart or dia- 
gram, search for patterns on guess-check-revise. Em- 
phasis on mathematics problems. Useful as overview, 
but neophytes would need more problems. MW 


History, S, P, L*. Women of Mathematics: A Bio- 
bibliographic Sourcebook. Ed: Louise G. Grinstein, 
Paul J. Campbell. Greenwood Pr, 1987, xx + 292 
pp, $45. (ISBN: 0-313-24849-4] 43 profiles of women 
(either deceased or born before 1925) who overcame 
familial and societal pressures to achieve distinction 
as mathematicians. Each profile contains a biogra- 
phy, a summary of professional work (in nontechnical 
terms), a bibliography of writings by and about the 
subject. Profiles are written by a wide variety of 
mathematicians, historians, and scholars. A distinc- 
tive and valuable resource for anyone interested in 
equality of opportunity in mathematics. LAS 


History, P, L. Riemann, Topology, and Physics. 
Michael Monastyrsky. Transl: James King, et al. 
Birkhauser Boston, 1987, xiii + 158 pp, $39.50. 
[ISBN: 0-8176-3262-X] Translations of two works by 
a Russian physicist: a biography of Riemann, and a 
monograph on topological methods in physics. The 
biography “describes in few words and fewer equa- 
tions the revolutionary ideas which Riemann brought 
in to mathematics and physics ....” In the mono- 
graph, homotopy theory provides a unified descrip- 
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tion of physical phenomena, including defects in liq- 
uid crystal, monopoles in gauge fields. RB 


History, P, L. Kepler’s Physical Astronomy. Stud. 
in Hist. of Math. & Phys. Sci., V. 13. Springer- 
Verlag, 1987, vii + 216 pp, $58. [ISBN: 0-387-96541- 
6] Critical examination of Kepler’s Astronomia nova, 
a record of his ten-year labors on pre-Newtonian 
astronomy written to convince trained astronomers 
that the then-current planetary theory was wrong. 
Dispels notion that Kepler was merely “a diligent 
condenser of data, or a numerically inclined mystic.” 
JK 


History, L. The Mechanization of the World Pic- 
ture: Pythagoras to Newton. E.J. Dijksterhuis. 
Transl: C. Dikshoorn. Princeton U Pr, 1986, x + 
537 pp, $14.50 (P). [ISBN: 0-691-02396-4] Reprint- 
ing of the 1961 Oxford University Press edition (a 
translation of the Dutch work De Mechanisering van 
het Wereldbeeld) with a new foreword by D.J. Struik. 
Detailed, scholarly account of the development of 
a mechanistic world view. Written for the general 
reader, “who is assumed to have no special training 
in mathematics or physics.” BH 


History, P, L*. The Newton Handbook. Derek 
Gjertsen. Routledge & Kegan Paul (US Distr: 
Methuen), 1986, xiv + 665 pp, $59.95. [ISBN: 0- 
7102-0279-2] An alphabetical mini-encyclopedia of 
Newton: his theories, friends, relatives, publications, 
intellectual descendents, and cultural influences, all 
arranged alphabetically (whence ... Bible, Bibliogra- 
phies, Binomial Theorem, ...). Includes much of 
interest to anyone wanting to comprehend the in- 
credible depth and diversity of Newton’s accomplish- 
ments. Contains thorough summaries and complete 
bibliographic details on Newton’s voluminous pub- 
lished and unpublished manuscripts. LAS 


History, T*(16), S. The Origins of the Infinitesi- 
mal Calculus. Margaret E. Baron. Dover, 1987, viii 
+ 304 pp, $7.95 (P). [ISBN: 0-486-65371-4] Thor- 
ough study of the rise of geometric techniques from 
the Greeks to early 17th century. Excellent detail 
and documentation allowing reader to work through 
many problems and to find original sources. Would 
serve well as a text for undergraduate course in his- 
tory of calculus. Dover reprint of 1969 Pergamon 
Press edition (TR, October 1969). MR 


History, P, L. The Historical Development of 
Quantum Theory, Volume 5: Erwin Schrodinger and 
the Rise of Wave Mechanics, Part 1: Schrodinger 
in Vienna and Zurich, 1887-1925. Springer-Verlag, 
1987, xviii + 366 pp, $54. [ISBN: 0-387-96284-0] 
Covers in two parts Schrédinger’s path to (but not 
including) his fundamental papers of 1926 on wave 
mechanics. First part is devoted to his early life 
and work in Vienna. Second part concentrates on 
Schrodinger’s post-World War I work, mainly in sta- 
tistical mechanics, and the work of others on the 
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development of quantum mechanics and theory of 
atomic structure. MR 


Logic, P. Siz Papers in Logic. S.N. Artemov, et al. 
AMS Transl. Ser. 2, V. 135. AMS, 1987, ix + 79 pp, 
$38. [ISBN: 0-8218-3111-9] Topics are: finite model 
for finitely axiomatizable theory that is categorical 
in all infinite powers, spectrum function of countable 
theory, Gentzen systems of postulates for set theory, 
modal logic and extensions of arithmetic, complexity 
of disjunctive normal forms. KS 


Logic, T(18: 1), P. Multiple Forcing. T. Jech. 
Tracts in Math., V. 88. Cambridge U Pr, 1986, viii + 
136 pp, $34.50. [ISBN: 0-521-26659-9] Set theorists 
use forcing to construct models of set theory. This 
book presents unified treatment of product, iterated 
and proper forcing. Includes applications to algebra 
and topology. Assumes first graduate course in set 
theory. KS 


Foundations, P, L*. Reflections on Kurt Gédel. 
Hao Wang. MIT Pr, 1987, xxvi + 336 pp, $25. 
(ISBN: 0-262-23127-1] A detailed commentary on 
“a life of fundamental theoretical work,” covering 
Godel’s wide ranging contributions to mathemat- 
ics, philosophy of science, and metaphysics, as well 
as details of events and interests that marked his 
life. Written in diary style, mixing abbreviated notes 
(e.g., “G,” rarely “Godel”) with reflective commen- 
tary on the biographer’s own views and relations 
with Godel. A fascinating contribution to intellec- 
tual history, revealing unknown depth and richness 
to Godel’s thought. LAS 


Combinatorics, P. A Source Book in Matroid The- 
ory. Joseph P.S. Kung. Birkhauser Boston, 1986, 
413 pp, $59.50. [ISBN: 0-8176-3173-9] An anthology 
of papers, both classical and recent, on matroid the- 
ory. The papers are preceded by commentary provid- 
ing background, notation, and historical notes. Suit- 
able as an introduction to matroid theory. LC 


Discrete Mathematics, T(13-14: 1), L. A 
Primer of Discrete Mathematics. Daniel T. Fink- 
beiner II, Wendell D. Lindstrom. Books in the 
Math. Sci. WH Freeman, 1986, xvi + 363 pp. 
[ISBN: 0-7167-1815-4] True to its name, the book is 
a carefully-written introduction to several basic con- 
cepts presented in interesting and challenging ways. 
Includes elementary set theory, proofs, functions, re- 
lations, combinatorial methods (including generating 
functions), and graph theory with applications. Ex- 
ercises, answers. JS 


Number Theory, P. Elliptic Functions and Rings 
of Integers. Ph. Cassou-Nogués, M.J. Taylor. Prog. 
in Math., V. 66. Birkhauser Boston, 1987, xvii + 
198 pp, $29.50. [ISBN: 0-8176-3350-2] Covers re- 
cent work on the structure of the rings of integers of 
Abelian extensions of quadratic imaginary number 
fields. The approach is based on complex function 
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theory and elliptic functions, instead of algebraic ge- 
ometry and elliptic curves. RH 


Number Theory, T*(16: 1), S, L. Algebraic 
Number Theory, Second Edition. Ian Stewart, David 
Tall. Math. Ser. Chapman & Hall, 1987, xix + 
262 pp, $19.95 (P). [ISBN: 0-412-29690-X] Changes 
since First Edition (TR, May 1980) include correc- 
tions, historical updates on Fermat’s Last Theorem 
and the class number problem, minor deletions, and 
two new, short sections on computing rings of inte- 
gers and on factorization in cyclotomic fields (neither 
referred to in Index). Remains a good book. GG 


Number Theory, P. Lecture Notes in Mathemat- 
ics-1252: Représentations de Weil et GL2 Algébres 
de division et GL,,. Tetsuo Kaise. Springer-Verlag, 
1987, vii + 203 pp, $20 (P). [ISBN: 0-387-17827-9] A 
report on the author’s effort, carried out over several 
years, to develop non-abelian class field theory. SG 


Linear Algebra, T(17: 1), S. Matriz Theory: A 
Second Course. James M. Ortega. Univ. Ser. 
in Math. Plenum Pr, 1987, xii + 262 pp, $34.50. 
[ISBN: 0-306-42433-9] Developed from lecture notes 
for in-coming applied mathematics graduate stu- 
dents; review of matrix theory, linear spaces and 
operators, and canonical forms followed by chapters 
on quadratic forms and optimization, differential and 
difference equations, and miscellaneous topics. JNC 


Group Theory, P. Combinatorial Group Theory 
and Topology. Ed: S.M. Gersten, John R. Stallings. 
Annals of Math. Stud., No. 111. Princeton U Pr, 
1987, viii + 551 pp, $65; $22.50 (P). [ISBN: 0-691- 
08409-2; 0-691-08410-6] Proceedings of a Alta, Utah 
conference (July 1984) which encompasses aspects of 
group theory arising from low-dimensional topology. 
Twenty-five papers, including numerous open prob- 
lems and reports on current research. RB 


Group Theory, P. Subnormal Subgroups of Groups. 
John C. Lennox, Stewart E. Stonehewer. Math. 
Mono. Clarendon Pr, 1987, xiii + 253 pp, $69. 
{ISBN: 0-19-853552-X] An up-to-date survey of the 
subject beginning with Wielandt’s 1939 paper. Con- 
tains many results on the joining of subnormal sub- 
groups, criteria for subnormality, and characteriza- 
tions of groups satisfying various subnormality con- 
ditions. SG 


Group Theory, T(16), S. Introduction to Group 
Characters, Second Edition. Walter Ledermann. 
Cambridge U Pr, 1987, x + 227 pp, $49.50; $14.95 
(P). (ISBN: 0-521-33246-X; 0-521-33781-X] Changes 
from the First Edition (TR, June-July 1978) include 
a treatment of arithmetic properties of characters 
and the Frobenius-Schur condition for real represen- 
tations. Other notable topics include induced char- 
acters, permutation groups, and the Burnside pgq- 
theorem. SG 


Group Theory, P. Free Group Rings. Ed: Narain 
Gupta. Contemp. Math., V. 66. AMS, 1987, xi + 
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129 pp, $22 (P). [ISBN: 0-8218-5072-5] Free group 
rings are the group rings over Z of free groups. This 
research monograph systematically presents some 
developments in the subject since Magnus’ work 
(1930's); intended for workers in combinatorial group 
theory. Introduction; the Magnus embedding; the 
Fox problem and its recent solution; the Dimension 
Subgroup problem; topics and open problems. RB 


Group Theory, S(18), P. Proceedings of Groups— 
St. Andrews 1985. Ed: E.F. Robertson, C.M. Camp- 
bell. London Math. Soc. Lect. Note Ser., V. 
121. Cambridge U Pr, 1986, ix + 358 pp, $39.50 
(P). [ISBN: 0-521-33854-9] A selection of 37 papers 
chosen from the presentations given at the interna- 
tional conference on groups held at St. Andrews, 
Scotland in the summer of 1985. Includes survey 
articles by Bachmuth, Baumslag, Neumann, Rose- 
blade, and Tits. JS 


Algebra, P. Semigroups and Their Applications. 
Ed: Simon M. Goberstein, Peter M. Higgins. D Rei- 
del (US Distr: Kluwer Academic), 1987, xiv + 211 
pp, $69. [ISBN: 90-277-2463-6] A collection of over 
20 papers on a variety of topics in semigroup the- 
ory presented at a conference held at Chico State in 
1986. SG 


Algebra, P. Antimorphic Action: Categories of Al- 
gebraic Structures with Involutions or Anti-Endo- 
morphisms. W.H. Cornish. Res. & Expos. in Math., 
V. 12. Heldermann Verlag, 1986, xix + 189 pp, $30 
(P). [ISBN: 3-88538-212-1] A categorical approach to 
certain involutions of a Lie algebra, a quadratic Jor- 
dan algebra, or a free lattice in a self-dual variety of 
lattices. Explores properties of the variety formed 
by those algebras in a variety of finitary algebras 
expanded by antimorphisms which are acted upon 
by members of a fixed monoid in a homomorphic or 
antihomomorphic manner. The deepest results are 
relevant to algebras obtained from propositional log- 
ics. LW 


Algebra, S(18), P. Dimensions of Ring The- 
ory. Constantin Nastasescu, Freddy van Oystaeyen. 
Math. & Its Applic. Kluwer Academic, 1987, xi 
+ 360 pp, $74. [ISBN: 90-277-2461-X] Treatment 
of several of the concepts of dimension in ring the- 
ory, including Krull and Gabriel dimension, homo- 
logical dimension, and Gelfand-Kirillov dimension. 
Exercises, bibliographical comments by chapter; ref- 
erences; index. JS 


Algebra, S(18), P. The Algebraic Structure of 
Crossed Products. Gregory Karpilovsky. Math. 
Stud., V. 142. Elsevier Stience, 1987, x + 348 pp, $71 
(P). {ISBN: 0-444-70239-3] Assumes basic algebra 
background leading to a discussion of various ques- 
tions for cross products. Chapters on group-graded 
algebras, classical theory (the Brauer group), Clif- 
ford theory for graded algebras, primitive and prime 
ideais. Bibliography, index. JS 
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Algebra, P. Near-Rings and Near-Fields. Ed: Ger- 
hard Betsch. Math. Stud., V. 137. Elsevier Science, 
1987, xiv + 298 pp, $62.25 (P). {ISBN: 0-444-70191- 
5] Proceedings of an August 1985 conference held 
at the Mathematical Institute of the University of 
Tubingen. Consists of two invited lectures and thirty 
additional papers. Focuses on general structure of 
near-rings, which may have a non-commutative addi- 
tion and only a one-sided distributive law, but satisfy 
the other ring axioms, and of near-fields, near-rings 
whose non-zero elements form a group under multi- 
plication. GG 


Calculus, T(14: 1). Vector Calculus. Peter Baxan- 
dall, Hans Liebeck. Appl. Math. & Comput. 
Sci. Ser. Clarendon Pr, 1986, ix + 550 pp, $49.95 
(P). [ISBN: 0-19-859652-9] A first-year text covering 
the usual topics—functions of several variables, dif- 
ferentiation, integration, including the theorems of 
Green, Stokes, and Gauss. A comfortable familiarity 
with basic analysis and linear algebra is assumed, al- 
though reviewed in the first chapter. A straightfor- 
ward, concise introduction to vector calculus done 
with more rigor than usual. Many good examples 
illustrating important topics. Nice use of counterex- 
amples in exercises to show significance of theorems’ 
hypotheses. PS 


Calculus, T(13-14: 2). Hdhere Mathematik I: 
Eine Einfihrung. Gunter Hellwig. Bibliographis- 
ches Institut, 1986, 411 pp, 29,80 DM (P). [ISBN: 3- 
411-05553-7] Second Edition; first published in 1971. 
Covers largely traditional topics in calculus of func- 
tions of one variable. However, it begins with fairly 
detailed treatment of real numbers, defines sine and 
cosine as solutions of differential equations, and ends 
with sketch of development of Riemann-Stieltjes in- 
tegral. JD-B 


Calculus, $*(13), L. Motivating A-Level Mathe- 
matics: A Source Book. The Spode Group. Ox- 
ford U Pr, 1986, ix + 130 pp, $24.95 (P). (ISBN: 0- 
19-853653-4] A series of case studies in applications 
of A-level mathematics (functions, calculus, mechan- 
ics, probability, and statistics) intended to motivate 
British vocational students between 16 and 19 years 
old. Well-known examples (radioactive decay) mix 
with unconventional applications (screen slopes, ten- 
nis service, blood donors). Each case study concludes 
with a few exercises, whose solutions occupy the final 
quarter of the book. LAS 


Complex Analysis, T*(17-18: 1, 2), S*, P. 
Lectures on Complez Approzimation. Dieter Gaier. 
Birkhauser Boston, 1987, xv + 196 pp, $42. [ISBN: 
0-8176-3147-X] Translation by Renate McLaughlin of 
the 1980 Vorlesungen uber Approzimation in Kom- 
plezen, with some updates. Part I is constructive, 
a classical treatment of series expansions and inter- 
polation. Part II covers the more modern existence 
theory. Both parts are largely self-contained, yet give 
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statements (with references) of more recent develop- 
ments. Extensive bibliography. GG 


Complex Analysis, P. Lecture Notes in Mathemat- 
ics-1268: Compler Analysis. Ed: S.G. Krantz. 
Springer-Verlag, 1987, 195 pp, $20 (P). [ISBN: 0- 
387-18094-X] Proceedings of a conference on complex 
analysis held at Penn State in March, 1987. The lead 
article is an account of some of the progress in several 
complex variables over the past fifteen years. RH 


Differential Equations, T(15-16: 1), S, L. Non- 
linear Ordinary Differential Equations, Second Edi- 
tion. D.W. Jordan, P. Smith. Appl. Math. & 
Comput. Sci. Ser. Clarendon Pr, 1987, ix + 
381 pp, $24.95 (P); $42.50. [ISBN: 0-19-859656- 
1; 0-19-859657-X] Introduction to dynamical sys- 
tems on intuitive level. Emphasis on qualitative 
approach. Flexible. Many chapters self-contained. 
Emphasis on applications with examples in mechan- 
ics, electrical systems, and biology. Changes include 
many chapters revised or rewritten, added up-to-date 
material on nonlinear systems and new elementary 
treatment of bifurcation, structural stability, and 
chaotic behavior of oscillators. Well-written. Good 
exercises, Many with answers. (First Edition, TR, 
June-July 1978.) JK 


Differential Equations, P. Lecture Notes in 
Mathematics-1258: Spectral Theory of Ordinary Dif- 
ferential Operators. Joachim Weidmann. Springer- 
Verlag 1987, vi + 303 pp, $28.60 (P). [ISBN: 0- 
387-17902-X] General theory of ordinary differen- 
tial equations covering operators of arbitrary order 
n operating on C™-valued functions for arbitrary 
m. Applications to Sturm-Liouville operators and 
Dirac systems. Some new results “connected with 
existence of self-adjoint realizations with separate 
boundary conditions, multiplicity of the spectrum 
and the absolute continuity of the spectrum.” Fairly 
self-contained. For mathematicians interested in ap- 
plications to physics and to physicists with mathe- 
matical background. JK 


Differential Equations, P*. Linear Differen- 
tial Equations and Group Theory from Riemann to 
Poincaré. Jeremy Gray. Birkhauser Boston, 1986, 
xxv + 460 pp, $51.50. [ISBN: 0-8176-3318-9] Histor- 
ical origins of the geometric view of complex function 
theory culminating in Klein’s and Poincaré’s theory 
of automorphic functions. Tracing of two converging 
approaches both with roots in the theory of ellip- 
tic functions, one via the hypergeometric differential 
equation and the-other via Galois theory of polyno- 
mial equations. Extensive notes and references, often 
to original sources. JK 


Differential Equations, T(14-15: 2). Analysis 
ITI: Gewohnliche Differentialgleichungen und Ausbau 
der Infinitestmalrechnung. Erich Martensen. Bibli- 
ographisches Institut, 1986, x + 242 pp, 26,80 DM 
(P). [ISBN: 3-411-06834-5] 
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Differential Equations, P. Asymptotic Methods 
for Relazation Oscillations and Applications. Johan 
Grasman. Appl. Math. Sci., V. 63. Springer-Verlag, 
1987, xiii + 221 pp, $28 (P). [ISBN: 0-387-96513-0] 
Relaxation oscillation refers to a nonlinear type of 
oscillation often encountered in physics and biology 
in which certain systems behave as if periodically 
reset in an initial state. Grasman thoroughly cov- 
ers the method of matched asymptotic expansions to 
approximate orbits of relaxation oscillators. Begins 
with Van der Pol’s equation and gives special atten- 
tion to biological and chemical relaxation oscillators. 
Sections on free oscillation, forced oscillation and mu- 
tual entrainment, and the Van der Pol oscillator with 
a sinusoidal forcing term. With exercises. PS 


Differential Equations, T(14-15: 1). <A First 
Course in Differential Equations with Applications, 
Third Edition. William R. Derrick, Stanley I. Gross- 
man. West, 1987, xv + 545 pp, $29.76. [ISBN: 
0-314-26895-2] First-order equations, second- and 
higher-order linear equations, power series solutions, 
Laplace transforms, linear systems, numerical meth- 
ods, introduction to Fourier series and partial differ- 
ential equations. Many sections on applications; “ex- 
cursions” into non-traditional topics. Almost three 
hundred examples and 2000 exercises, many of an 
applied nature. DFA 


Differential Equations, P, L. Inverse Spectral 
Theory. Jurgen Péschel, Eugene Trubowitz. Pure 
& Appl. Math., V. 130. Academic Pr, 1987, x + 192 
pp, $29.95. [ISBN: 0-12-563040-9] Can you hear the 
density of a string? In other words, how many ways 
can mass be distributed along a vibrating string to 
produce a given set of frequencies? This is a physical 
example that leads to a question in inverse spectral 
theory. The book is based on lectures given at the 
Courant Institute in 1980 and is intended for a gen- 
eral mathematical audience. It discusses the problem 
of characterizing the Sturm-Liouville equations that 
produce a given set of eigenvalues under Dirichlet 
boundary conditions. AM 


Differential Equations, T(14-15: 1). Introduc- 
tion to Differential Equations with Boundary Value 
Problems, Third Edition. William R. Derrick, Stan- 
ley I. Grossman. West, 1987, xv + 630 pp, $32.16. 
[ISBN: 0-314-26897-9] The authors’ A First Course 
in Differential Equations with Applications, Third 
Edition, with additional material on Fourier se- 
ries and partial differential equations, a chapter on 
nonlinear equations and stability (concerning two- 
dimensional systems), and examples and exercises to 
go with these extra topics. DFA 


Partial Differential Equations, T(18), L. Par- 
tial Differential Equations: An Introduction to a 
General Theory of Linear Boundary Value Problems. 
Aleksei A. Dezin. Transl: Ralph P. Boas. Springer- 
Verlag, 1987, 163 pp, $82. [ISBN: 0-387-16699-8] The 
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point of view in this text is how the boundary condi- 
tions influence the solvability of linear partial differ- 
ential equations; the basic approach is through linear 
operators in Hilbert space. LC 


Partial Differential Equations, P. The Equations 
of Navier-Stokes and Abstract Parabolic Equations. 
Wolf von Wahl. Aspects of Math., V. E8. Friedr 
Vieweg & Sohn, 1985, xxiv + 264 pp, $25 (P). [ISBN: 
3-528-08915-6] Functional analytic consideration of 
instationary Navier-Stokes equations. Studies ad- 
missible initial data and global continuation of local 
solutions. Applications to global solvability of semi- 
linear parabolic systems. DFA 


Partial Differential Equations, P. Lecture Notes 
in Mathematics-1270: Nonlinear Hyperbolic Prob- 
lems. Ed: C. Carasso, P.-A. Raviart, D. Serre. 
Springer-Verlag, 1987, xv + 351 pp, $32.90 (P). 
[ISBN: 0-387-18200-4] Twenty-five papers from an in- 
ternational conference held in Saint-Etienne on Jan- 
uary 13-17, 1986, addressing open questions, exis- 
tence and qualitative theory for hyperbolic partial 
differential equations; and general theory and main 
applications in numerical analysis of the same. DFA 


Numerical Analysis, T(15-16: 1), L. Applied 
Numerical Methods for Digital Computation, Third 
Edition. M.L. James, G.M. Smith, J.C. Wolford. 
Harper & Row, 1985, xiii + 753 pp. [ISBN: 0-06- 
043281-0] Roots of equations, systems of equations, 
curve fitting, integration and differentiation, initial- 
and boundary-value problems for ordinary differ- 
ential equations, introduction to partial differential 
equations. First chapter presents Fortran, last chap- 
ter introduces IBM’s simulation language CSMP. 
Flowcharts and programs throughout. Exercises are 
for solution using computer programs or (in some 
cases) pocket calculators. DFA 


Numerical Analysis, T(15-16: 1, 2). Introduc- 
tion to Numerical Analysis, Second Edition. F.B. 
Hildebrand. Dover, 1987, xiii + 669 pp, $13.95 (P). 
[ISBN: 0-486-65363-3] Unabridged, slightly corrected 
republication of the 1974 McGraw-Hill edition (TR, 
November 1974). DFA 


Numerical Analysis, P. Tables for Converting 
Polynomials and Power Series into Chebyshev Se- 
ries. Herbert E. Salzer, Norman Levine. Applied 
Science, 1984, 65 pp, $12 (P). [ISBN: 0-915061-01- 
5] Tables of coefficients, computed to 35 significant 
figures, for converting polynomials in a of degree n, 
which includes truncated power series, into Cheby- 
shev series, through n = 50 for z in the interval [0,1] 
and through n = 100 for z in the interval [-1,1]. In- 
cludes a brief discussion of Chebyshev series, infor- 
mation about incorporating the tables in computer 
programs, and additional references. RH 


Numerical Analysis, T(17-18: 1). Nonlinear 


Methods in Numerical Analysis. Annie Cuyt, Luc 
Wuytack. Math. Stud., V. 136. Elsevier Science, 
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1987, 278 pp, $53.25 (P). (ISBN: 0-444-70189-3] An 
introduction to several nonlinear techniques for the 
solution of various numerical problems. The tech- 
niques result mainly from the use of Padé approxi- 
mants and rational interpolants. The text is fairly 
self-contained. Each chapter includes a section of 
remarks suggesting extensions of the theory and a 
problem section. RH 


Numerical Analysis, T(16-18: 2). Numerical Al- 
gorithms. Ed: J.L. Mohamed, J.E. Walsh. Claren- 
don Pr, 1986, xii + 356 pp, $60. (ISBN: 0-19-853364- 
0] Presents a wide spectrum of computational prob- 
lems, for example eigenvalue problems, solution of 
differential equations, and curve fitting, and the algo- 
rithms used to solve them. Includes both the theoret- 
ical and practical ideas needed to design and imple- 
ment the algorithms. Gives descriptions and sources 
of the current numerical software for the methods 
described. A good reference text for both specialists 
and non-specialists, or possibly a text for advanced 
undergraduate or graduate courses in computational 
mathematics. RH 


Numerical Analysis, P. Numerical Integration: 
Recent Developments, Software and Applications. 
Ed: Patrick Keast, Graeme Fairweather. NATO ASI 
Ser C, V. 203. D Reidel (US Distr: Kluwer Aca- 
demic), 1987, xiii + 394 pp, $89. [ISBN: 90-277- 
2514-4] A collection of twenty-five papers and ten 
extended abstracts of papers presented at the NATO 
Advanced Research Workshop which was held at 
Dalhousie University, Halifax, Canada, August 11- 
15, 1986. RH 


Numerical Analysis, S(16-17), P. Numerische 
Mathemattk fir Ingenieure. Gisela Engeln-Millges, 
Fritz Reutter. Bibliographisches Institut, 1987, xvi 
+ 524 pp, 48 DM (P). [ISBN: 3-411-03151-4] 


Functional Analysis, P. Lecture Notes in Mathe- 
matics-1242: Functional Analysis IT. Ed: S. Kurepa, 
H. Kraljevi¢é, D. Butkovié. Springer-Verlag, 1987, 
vii + 432 pp, $37.10 (P). [ISBN: 0-387-17833-3] Lec- 
ture notes from a conference at the Inter-University 
Center of Postgraduate Studies, Dubrovnik, Yu- 
goslavia, November 1985. Centers mainly on func- 
tional-analytic aspects of probability and operator 
theory on Hilbert and Banach spaces. LC 


Functional Analysis, P. Saks Spaces and Appli- 
cations to Functional Analysis, Second, Revised Edi- 
tion. J.B. Cooper. Math. Stud., V. 139. Elsevier 
Science, 1987, ix + 372 pp, $75.50 (P). [ISBN: 0- 
444-70219-9] Closely related to the theory of locally 
convex spaces and generalized Banach spaces, mixed 
spaces—or Saks spaces—are vector spaces with a 
norm and a locally convex topology. Theory of Saks 
spaces is related to special topologies used on spaces 
of functions where natural norm topologies are not 
suitable for certain applications. This Second Edi- 
tion takes recent developments in functional analy- 
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sis into account. (1978 North-Holland edition, TR, 
November 1978.) AWR 


Functional Analysis, P. Characterizations of In- 
ner Product Spaces. Dan Amir. Oper. Theory, V. 20. 
Birkhauser Boston, 1986, 200 pp, $31.50. (ISBN: 0- 
8176-1774-4] A catalog of characterizations of inner- 
product spaces by normed-space geometry and ap- 
proximation theoretic properties. Arguments pre- 
sented are mostly elementary and often geometric. 
A reference for the expert which is also accessible to 
advanced undergraduates. LW 


Functional Analysis, P. On Banach Algebras, Re- 
newal Measures and Regenerative Processes. J.B.G. 
Frenk. CWI Tract, V.38. Math Centrum, ‘1987, 201 
pp, Dfl. 31.60 (P). [ISBN: 90-6196-321-4] Formally, 
tersely, yet clearly written. Presents a characteri- 
zation of the space of homomorphisms on the Ba- 
nach algebra of complex measures concentrated on 
the positive half-line. Results on Banach algebras 
are then used to derive asymptotic results for the re- 
newal sequence and the renewal measure in the case 
that the expectation of the waiting time distribution 
is finite. LW 


Analysis, T(15-16: 1, 2). Elements of Real Anal- 
ysis. David A. Sprecher. Dover, 1987, viii + 343 
pp, $8.95 (P). [ISBN: 0-486-65385-4] A republica- 
tion of the 1970 Academic Press edition (TR, April 
1972). LC 


Analysis, P. Transactions of the Fourth Army Con- 
ference on Applied Mathematics and Computing. US 
Army Research Office (PO Box 12211, Research Tri- 
angle Park, NC 27709), 1986, xxx + 1345 pp, (P). 
One hundred technical papers from a conference held 
May 27-30, 1987 at Cornell University as part of the 
opening of Cornell’s new Mathematical Sciences In- 
stitute (MSI). Includes papers on combustion, fluid 
dynamics, stochastic analysis, parallel computation, 
numerical analysis. LAS 


Algebraic Geometry, P. Current Trends in Arith- 
metical Algebraic Geometry. Ed: Kenneth A. Ribet. 
Contemp. Math., V. 67. AMS, 1987, x + 293 pp, 
$30 (P). [ISBN: 0-8218-5070-1] Six selected talks plus 
four additional papers from the 1985 Joint Summer 
Research Conference in Algebraic Geometry held in 
Arcata, California. Main topics involve cohomology, 
hyperbolic geometry, and Arakelov theory. GG 


Algebraic Geometry, T**(17: 4), S, P. Plane 
Algebraic Curves. Egbert Brieskorn, Horst Knorrer. 
Transl: John Stillwell. Birkhauser Boston, 1986, vi + 
721 pp, $49. [ISBN: 0-8176-3326-X] Wonderful intro- 
duction to the investigation of singularities of alge- 
braic curves. Topics covered range from elementary 
methods (e.g., decomposition into irreducible com- 
ponents and Bézout’s theorem) to resolution of sin- 
gularities (e.g., localisation and topology of singular- 
ities). Along the way the author includes hundreds of 
drawings and also historical, heuristic, and method- 
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ological considerations, topics often neglected in the 
modern rush towards proving more theorems. MR 


Algebraic Geometry, P. Lecture Notes in Mathe- 
matics-1239: Diophantine Approzimations and Value 
Distribution Theory. Paul Vojta. Springer-Verlag, 
1987, x + 132 pp, $12.80 (P). [ISBN: 0-387-17551-2] 
A summary of the author’s research on the question 
of when algebraic varieties of dimension two or more 
have a finite number of rational points over every 
number field. SG 


Algebraic Geometry, P. Lecture Notes in Mathe- 
matics-1269: Nash Manifolds. Masahiro Shiota. 
Springer-Verlag, 1987, vi + 223 pp, $20 (P). [ISBN: 
0-387-18102-4] A study of C’ Nash manifolds which 
are finite semialgebraic open sets glued together by 
certain diffeomorphisms. SG 


Differential Geometry, P. Lectures on Fibre 
Bundles and Differential Geometry. J.L Koszul. 
Springer-Verlag, 1986, ii + 127 pp, $12.60 (P). 
[ISBN: 0-387-12876-X] These are the well-known 
notes of Koszul’s 1960 lectures at the Tata Institute 
for Fundamental Research, Bombay, re-issued (the 
original typewriting, but improved paper). Algebraic 
(non-manifold) formulation of covariant derivation 
and related notions; connection forms on principal 
bundles; relations between covariant derivations and 
connection forms; connections in almost complex and 
holomorphic bundles. RB 


Differential Geometry, T(17: 1), L. Applicable 
Differential Geometry. M. Crampin, F.A.E. Pirani. 
London Math. Soc. Lect. Note Ser., V. 59. Cam- 
bridge U Pr, 1986, 394 pp, $34.50 (P). [ISBN: 0- 
521-23190-6] Essentially an introduction to the stan- 
dard topics of differential geometry, including discus- 
sions of manifolds, metrics, connections, Lie groups, 
spinors and bundles along with over 650 exercises. 
The presentation is reasonably clear and it would 
make an adequate text for a first graduate course in 
differential geometry. Despite the title of the book, 
however, this is not a text in which the ideas of dif- 
ferential geometry are developed within the context 
of applications. AM 


Differential Geometry, T(17-18: 1), S. Lec- 
tures on Complez Analytic Manifolds. L. Schwartz. 
Springer-Verlag, 1986, iv + 182 pp, $12.60 (P). 
[ISBN: 0-387-12877-8] Reissue of Volume 4 of Tata 
lecture series, first published in 1955. Twenty-five 
lectures, based on notes by M.S. Narasimhan, in in- 
formal style. Basics of complex variables, differential 
geometry comprise necessary background. Topics in- 
clude differential forms and operators, de Rham’s 
theorem, Kahlerian manifolds. No frills typesetting, 
but a good buy. GG 

Differential Geometry, P. Lecture Notes in 
Mathematics-1244: Manifolds with Cusps of Rank 


One. Werner Muller. Springer-Verlag, 1987, xi + 
158 pp, $18 (P). [ISBN: 0-387-17696-9] Given an el- 
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liptic differential operator on an arbitrary globally 
symmetric space with a homogeneous vector bundle, 
how does one find an explicit formula for the L?- 
index? Read this and find out. MR 

Topology, T(17: 1), S, P. Topological Graph The- 
ory. Jonathan L. Gross, Thomas W. Tucker. Ser. 
in Discrete Math. & Optimiz. Wiley, 1987, xv 
+ 351 pp, $59.95. (ISBN: 0-471-04926-3] Compre- 
hensive, “historically sensitive” introduction to the 
study of embeddings of graphs in surfaces such that 
no two edges cross. Discussion of the solution to the 
Heawood conjecture (determination of the chromatic 
numbers of non-spherical surfaces), voltage graphs, 
covering spaces, genus formulas, connections to other 
areas of mathematics. RM 


Topology, P. The Clone of a Topological Space. W. 
Taylor. Res. & Expos. in Math., V. 13. Heldermann 
Verlag, 1986, 91 pp, $25 (P). [ISBN: 3-88538-213-X] 
An intriguing paper which explores how properties 
of a topological space A are reflected by its clone 
C(A), the set of all continuous maps from the prod- 
uct of n copies of A to A where n ranges over all 
integers. Basic properties of A (connectedness, fixed 
point properties, dimension) correspond to algebraic 
or first order properties of C(A); the most interesting 
results are obtained for homotopy properties. The 
author also shows that the clone of A can be inter- 
preted as the set of continuous functions from A to 
R. LW 


Operations Research, S(17-18), P. Vehicle Rout- 
ing with Time-Window Constraints: Algorithmic So- 
lutions. Ed: Bruce L. Golden, Arjang A. Assad. 
American Sciences Pr, 1986, 177 pp, $49.75 (P). 
[ISBN: 0-935950-15-X] Papers on vehicle routing; 
also appeared as the American Journal of Mathe- 
matics and Management Sciences, V. 6, No. 3-4, 
1986. RM 


Operations Research, P. Production-inventory 
Control Models: Approzimations and Algorithms. 
A.G. de Kok. CWI Tract, V. 30. Math Centrum, 
1987, iii + 214 pp, DA. 33 (P). [ISBN: 90-6196-310- 
9] From the author’s introduction: “This monograph 
concerns the probabilistic analysis of a variety of one- 
product production-inventory models in which the 
control problem is to coordinate the production rate 
with the inventory level in order to cope with random 
fluctuations in demand.” AO 


Operations Research, P. Stochastic Games with 
Finite State and Action Spaces. O.J. Vrieze. CWI 
Tract, V. 33. Math Centrum, 1987, 221 pp, Dfl. 
34.20 (P). [ISBN: 90-6196-313-3] A slightly revised 
version of the author’s thesis. Analyzes discounted 
and undiscounted two-person zero sum stochastic 


games. Presents both structural results and algo- 
rithms. AO 
Optimization, P. Lecture Notes in Computer 


Science-268: Constrained Global Optimization: Algo- 
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rithms and Applications. P.M. Pardalos, J.B. Rosen. 
Springer-Verlag, 1987, vii + 143 pp, $15.40 (P). 
(ISBN: 0-387-18095-8] A summary of recent work 
on deterministic methods (enumerative techniques, 
branch and bound, cutting planes, etc.) for find- 
ing global solutions of constrained optimization prob- 
lems. AO 


Optimization, P. Contributions to Modern Calcu- 
lus of Variations. Ed: Lamberto Cesari. Wiley, 
1986, 231 pp, $39.95 (P). [ISBN: 0-470-20378-1] Pa- 
pers of fourteen authors contributed to the sympo- 
sium honoring Leonida Tonelli held in Bologna on 
May 13-14, 1985. Includes several survey articles 
(historical and topical) as well as some current re- 
sults. LW 


Optimization, P. Optimization Models Using Fuzzy 
Sets and Possibility Theory. Ed: J. Kacprzyk, S.A. 
Orlovski. Theory & Decision Lib., Ser. B. Kluwer 
Academic, 1987, xii + 462 pp, $99. [ISBN: 90-277- 
2492-X] A collection of papers surveying the use of 
fuzzy sets and possibility theory in optimization and 
decision theory problems. Includes an introductory 
section for the non-expert. AO 


Optimization, P. Lecture Notes in Control and 
Information Sciences-82: Analysis and Algorithms 
of Optimization Problems. Ed: K. Malanowski, 
K. Mizukami. Springer-Verlag, 1986, vili + 236 
pp, $18.60 (P). [ISBN: 0-387-16660-2] Ten indepen- 
dent articles summarizing research resulting from a 
Japanese-Polish cooperative program entitled “Nu- 
merical Methods of Optimization and Game The- 
ory.” Most of the papers are concerned with the 
modeling and analysis of distributed parameter sys- 
tems. AO 


Dynamical Systems, P. Cell-to-Cell Mapping: A 
Method of Global Analysts for Nonlinear Systems. 
C.S. Hsu. Appl. Math. Sci., V. 64. Springer- 
Verlag, 1987, xii + 352 pp, $49.80. (ISBN: 0-387- 
96520-3] Introduction to the field, with many exam- 
ples and 125 illustrations. Point mapping; simple 
cell mapping and applications; generalized cell map- 
ping; an iterative method of global analysis. Many 
references. DFA 


Dynamical Systems, P. Oscillation, Bifurcation 
and Chaos. Ed: F.V. Atkinson, W.F. Langford, A.B. 
Mingarelli. Canadian Math. Soc. Conf. Proc., V. 
8. AMS, 1986, xv + 711 pp, $75 (P). [ISBN: 0-8218- 
6013-5] A collection of 92 lectures delivered at the 
1986 Annual Seminar of the Canadian Mathematical 
Society. The lectures from the first week of the Con- 
ference focus on Sturmian theory, oscillation theory 
for second order equations, and modern generaliza- 
tions. Those of the second week present results of 
current research in bifurcation theory and chaos. LW 


Dynamical Systems, P. Instabilities and Nonequi- 
librium Structures. Ed: Enrique Tirapegui, Danilo 
Villarroel. Math. & Its Applic. Kluwer Academic, 
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1987, x + 337 pp, $69.50. [ISBN: 90-277-2420-2] A 
series of papers from a workshop in Valparaiso, Chile 
in December 1985 studying the behavior of physi- 
cal systems close to and far away from equilibrium 
where instabilities appear. Papers cover both deter- 
ministic and stochastic behavior under the influence 
of noise. MR 


Control Theory, T(18: 2), P. Structured Heredi- 
tary Systems. James A. Reneke, Robert E. Fennell, 
Roland B. Minton. Pure & Appl. Math., V. 107. 
Marcel Dekker, 1987, viii + 217 pp. [ISBN: 0-8247- 
7772-7| Title describes complex systems with alge- 
braic relationships between operators (on appropri- 
ate function spaces) describing system components. 
Problems such as optimal control, parameter estima- 
tion, and decentralized control in four main classes 
of systems are studied. Many examples, some prob- 
lems, and sparse index. MR 


Control Theory, S(18), P. Lecture Notes in Con- 
trol and Information Sciences-99: Robust Stabiliza- 
tion Against Structured Perturbations. S.P. Bhat- 
tacharyya. Springer-Verlag, 1987, ix + 172 pp, 
$25.10 (P). [ISBN: 0-387-18056-7] Addresses prob- 
lems connected with stability of transfer functions of 
feedback control for engineering plants (e.g., aircraft, 
robots). Two topics serve as themes: first, how to 
determine largest hypersphere of stability; and sec- 
ondly how to design transfer function stable under a 
given class of perturbations. MR 


Control Theory, P. Systems Analysts by Graphs 
and Matroids: Structural Solvability and Controlla- 
bility. Kazuo Murota. Algorithms & Combin., V. 
3. Springer-Verlag, 1987, ix + 281 pp, $48 (P). 
[ISBN: 0-387-17659-4] An eloquent and thorough 
monograph which presents systematic procedures for 
structural analysis of large-scale systems. The the- 
ory is developed and then applied to existing as well 
as new models of dynamical systems. Emphasizes 
the importance of relevant physical observations for 
successful modeling. LW 


Probability, T(15-16: 2), L. Probability Theory 
for Engineers. V.P. Chistyakov, B.A. Sevast’yanov, 
V.K. Zakharov. Transl. Ser. in Math. & Eng. Opti- 
mization Software, 1987, xiii + 161 pp, $30. [ISBN: 
0-911575-13-8] Introduction to basic probability the- 
ory at the undergraduate level requiring only calculus 
as background. Uses random variables throughout. 
Clear explanations along with plenty of examples and 
problems. MR 


Probability, P. Lectures on Topics in Probability 
Inequalities. M.L. Eaton. CWI Tract, V. 35. Math 
Centrum, 1987, v + 197 pp, Dfl. 30.40 (P). [ISBN: 
90-6196-316-8] Based on lectures given by the author 
at the University of Amsterdam. Topics include ma- 
jorization results and their extension to other group- 
induced orderings, log concavity, association, and the 
FKG inequality. Also includes a review of back- 
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ground material and an appendix on topics in con- 
vexity. RH 


Stochastic Processes, T(18), P. Stochastic Dif- 
ferential Systems: Analysis and Filtering. V.S. Pu- 
gachev, I.N. Sinitsyn. Wiley, 1987, xx + 549 pp, 
$95. [ISBN: 0-471-91243-3] Translated from the 1985 
Russian edition. For specialists in applied math- 
ematics. Linear and nonlinear systems. Optimal, 
suboptimal, conditionally optimal filtering and ex- 
trapolation. Exercises. Over 100 references to the 
literature. DFA 


Stochastic Processes, P. Lecture Notes in 
Mathematics-1233: Stability Problems for Stochas- 
tic Models. Ed: V.V. Kalashnikov, B. Penkov, 
V.M. Zolotarev. Springer-Verlag, 1987, vi + 223 pp, 
$19.40 (P). [ISBN: 0-387-17204-1] Proceedings of the 
ninth international seminar in Varna, Bulgaria, May 
1985. Fifteen papers on approximation problems 
considered as stability problems and use of proba- 
bility metrics on spaces of random variables. MR 


Elementary Statistics, T(14-16: 1), C. Introduc- 
tion to Probability and Statistics for Engineers and 
Scientists. Sheldon M. Ross. Ser. in Prob. & Math. 
Stat. Wiley, 1987, xv + 492 pp, $32.95. (ISBN: 0- 
471-81752-X] A well-written, thorough introduction 
to probability and statistics. Assumes a knowledge 
of elementary calculus. The examples and exercises 
are plentiful and especially interesting. Includes a 
diskette of 35 programs for the use of students who 
have access to a personal computer. For students 
without such access, tables are included to enable 
them to solve the problems in the text. RH 


Elementary Statistics, S$(13-14). Elementary 
Statistics for IBM PCs. Donald W. Moffat. 
Prentice-Hall, 1988, xv + 279 pp, $23.95 (P). [ISBN: 
0-13-260050-1] A collection of BASIC programs to- 
gether with examples of their use. Could be used to 
supplement an introductory textbook, but could not 
substitute for one. Unfortunately, the programs are 
not available on diskette and must be entered from 
listings in the book. AO 


Elementary Statistics, T(13-14: 1, 2). Intro- 
duction to Statistics, Fourth Edition. Sol Weintraub. 
University Statistical Tracts (75-19 171st St., Flush- 
ing, NY 11366), 1987, xii + 403 pp, $34.95. [ISBN: 
0-931316-02-2] An introduction to statistics which 
students should find very readable. Calculus is not 
assumed. Many examples are worked out and plenty 
of exercises are included. The tables have been newly 
computed. Proofs and special topics are presented in 
the appendices. (Third Edition, TR, February 1979.) 
RH 

Statistics, T*(13-14: 1, 2), S, L. Statistics To- 
day: A Comprehensive Introduction. Donald Byrkit. 
Benjamin/Cummings, 1987, xxv + 939 pp, $32.95. 
[ISBN: 0-8053-0740-0] Presupposes no college math- 
ematics. The usual topics plus some explanatory 
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data analysis, two-way analysis of variance, and mul- 
tiple regression. MINITAB or SAS is used through- 
out. FLW 


Statistics, T(16-18), S, P. New Perspectives in 
Theoretical and Applied Statistics. Ed: Madan Lal 
Puri, José Pérez Vilaplana, Wolfgang Wertz. Ser. 
in Prob. & Math. Stat. Wiley, 1987, xxiii + 
544 pp, $59.95. [ISBN: 0-471-84800-X] Papers from 
the Third International Meeting of Statistics held in 
Spain in 1986. Covers new developments in model- 
ing, design, time series, density estimation, and many 
other areas. FLW 

Statistics, S(15-18). Modern Statistical Selection: 
Part II of Proceedings of the Conference on Statisti- 
cal Ranking and Selection— Three Decades of Devel- 
opment. Ed. M. Haseeb Rizvi. Amer. J. of Math. 
& Manag. Sci., V. 6, Nos. 1 & 2. American Sciences 
Pr, 1986, 249 pp, $49.75 (P). [ISBN: 0-935950-14-1] 
Statistics, T(15-16: 1, 2), S. Statistics for Eco- 
nomics, Business Administration, and the Social Sci- 
ences. Erling B. Andersen, Niels-Erik Jensen, Nils 
Kousgaard. Springer-Verlag, 1987, xi + 439 pp, $28 
(P). [ISBN: 0-387-17720-5] Translation of a Danish 
text. Presupposes calculus and some matrix algebra. 
Includes chapters on multiple regression, autocorre- 
lation, and contingency tables. FLW 


Statistics, S(18), P. Normal Approzimation and 
Asymptotic Ezpansions. R.N. Bhattacharya, R. 
Ranga Rao. Robert E Krieger, 1986, xiv + 291 
pp, $46.95. [ISBN: 0-89874-690-6] Corrects misprints 
and adds seventeen-page chapter of updates to orig- 
inal 1976 Wiley edition (TR, October 1976). GG 


Statistics, T(13-14: 1), S. Modern Elementary 
Statistics, with Theoretical Supplement and BASIC 
Programming. Donald W. Zimmerman, Richard H. 
Williams. Ser. in Math. & Manag. Sci., V. 16. 
American Sciences Pr, 1986, viii + 206 pp, $27.50 
(P). [ISBN: 0-935950-16-8] Attempts to reveal some 
of the theory behind elementary statistics without 
bogging down in mathematical details or presup- 
posing calculus. Includes an introduction to BA- 
SIC. FLW 


Elementary Computer Science, T(13: 1). Pas- 
cal and Its Applications: An Introduction to Modern 
Programming. Larry Joel Goldstein. Holt, Rinehart 
& Winston, 1987, xi + 619 pp, $23.50 (P). [ISBN: 
0-03-009928-5] Problem-solving approach. Empha- 
sis on structured programming and program design. 
Many examples and exercises. Introduces procedures 
early. Effective warnings about common program- 
ming pitfalls. Attractive book with an easy-to-read, 
often entertaining style. DFA 


Elementary Computer Science, T*(13: 1), L. 
Intermediate Problem Solving and Data Structures: 
Walls and Mirrors. Paul Helman, Robert Veroff. 
Ser. in Struct. Prog. Benjamin/Cummings, 1986, 
xxi + 612 pp, $32.95. [ISBN: 0-8053-8940-7] Solid 
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CS2 text, with enough topics to cover a variety of 
courses. Review of Pascal, nice treatment of recur- 
sion (“mirrors”), data abstraction (“walls”), lists, 
trees, tables, sorting. Nice program development, 
code examples (available on disk), exercises. RM 


Programming, T(13: 1). Structured VAX BA- 
SIC. R. Hirschfelder, et al. Benjamin/Cummings, 
1986, xxili + 643 pp, $28.95 (P). [ISBN: 0-8053- 
3690-7] Standard, readable introduction to program- 
ming with VAX BASIC, with emphasis on algo- 
rithms, pseudo-code, structured design, modularity, 
good style, with some advanced topics. Integrated 
with VAX/VMS command syntax. Sample programs 
in the text follow the stated guidelines. RM 


Programming, P. Engineering and Scientific Com- 
putations in PASCAL. Lawrence P. Huelsman. Com- 
put. Sci. & Tech. Ser. Harper & Row, 1986, xiv 
+ 285 pp, (P). [ISBN: 0-06-042994-1] How to use 
Pascal for finding numerical solutions of engineer- 
ing problems. Develops Pascal subroutines for nu- 
merical integration, numerical solution of differential 
equations, computer determination of partial frac- 
tion expansions, inverse Laplace transforms, Fourier 
series analysis, and others. Examples from electri- 
cal engineering clearly illustrate the application of 
each technique. Emphasis placed on use of numeri- 
cal techniques, not on the techniques themselves. In- 
tended for microcomputers. Includes exercises. PS 


Programming, T(13: 1). Fundamental Program- 
ming with FORTRAN 77: A Science and Engineer- 
ing Approach. J. Denbigh Starkey, Rockford J. Ross. 
West, 1987, xviii + 876 pp, $32.75 [ISBN: 0-314- 
77805-5]; Instructor’s Guide, xviii + 519 pp, (P). 
[ISBN: 0-314-87243-4] Introduction to programming 
and to Fortran. Emphasizes program design, anal- 
ysis, verification. Case studies are central. De- 
signs programs in top-down, structured fashion us- 
ing pseudo-language, then implements them in For- 
tran. Instructor’s Guide discusses points to empha- 
size, provides answers to exercises, supplies test ques- 
tions. DFA 


Programming, T(13-15: 1), S, P, L*. Prolog: A 
Relational Language and Its Applications. John Mal- 
pas. Prentice-Hall, 1987, x + 465 pp, $28.67 (P). 
[ISBN: 0-13-730805-1] An introduction to the Pro- 
log programming language, logic programming, ap- 
plications. Prerequisite: at least one programming 
language. Features will appeal to wide audience: op- 
tional introduction to theoretical notions, answered 
exercises, artificial intelligence applications, manu- 
facturing case study, bibliography. Examples (in 
“core Prolog”) run on a dozen different language ver- 
sions. RB 


Programming, $(13-14). Advanced Turbo C. Her- 
bert Schildt. Osborne/McGraw-Hill, 1987, xii + 397 
pp, $22.95 (P). [ISBN: 0-07-881280-1] Turbo C is the 
C compiler produced by Borland International for 
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the IBM PC and PC-compatible systems. C is a sys- 
tems implementation language that is widely used by 
computer scientists for building software packages. 
This text is an introduction to the Turbo C compiler 
which has a number of extensions and capabilities 
not found in other C compilers. MS 


Programming, T(13: 1). Pascal and Algo- 
rithms: An Introduction to Problem Solving. Gre- 
gory F. Wetzel, William G. Bulgren. Comput. Sci. 
Ser. SRA, 1987, xviii + 570 pp, (P). [ISBN: 0- 
574-18630-1] Problem-solving and algorithm devel- 
opment have center stage; treats programming as a 
useful tool for implementing solutions. Contains all 
the material for the first course, but syntax and se- 
mantics have secondary importance. Procedures and 
functions come after control structures but before 
structured data types. Historical anecdotes. Worth 
a look. DFA 


Programming, S(14-15). FORTH: The Neat 
Step. Ron Geere. Addison-Wesley, 1986, ix + 89 pp, 
$16.95 (P). [ISBN: 0-201-18050-2] FORTH is a new 
programming language that is widely used in the con- 
trol of laboratory equipment such as machine tools, 
telescopes, and robots. This text is an introduction 
to the language for people with some programming 
experience, but no prior knowledge of FORTH. It 
uses a wide range of examples to illustrate the power 
and capabilities of the language. MS 


Programming, S(15-16). A Guide to the SQL 
Standard. C.J. Date. Addison-Wesley, 1987, xiv + 
205 pp, $25.95 (P). [ISBN: 0-201-05777-8] SQL is the 
official standard language for use with all relational 
database systems. Most relational products in the 
marketplace use SQL as their query language. This 
text is a complete introduction to the language. It 
shows how users can form queries in SQL to retrieve 
arbitrary data from the database. The emphasis is 
on the language standard and the principles involved, 
rather than on any specific product or software sys- 
tem. MS 

Programming, T??(13), S, L. Using Turbo Pro- 
log. Phillip R. Robinson. Osborne McGraw-Hill, 
1987, xxv + 340 pp, $19.95 (P). [ISBN: 0-07-881253- 
4] Fairly complete introduction to the syntax and 
use of Turbo Prolog for PC’s, with some discussion 
of advanced programming techniques and applica- 
tions. RM 


Programming, S. Using Turbo C. Herbert Schildt. 
Osborne/McGraw-Hill, 1988, xv + 431 pp, $19.95 
(P). [ISBN: 0-07-881279-8] A primer on the C lan- 
guage and the programming environment provided 
by Turbo C. For programmers without prior experi- 
ence with C. AO 


Languages, P. Languages for Sensor-Based Con- 
trol in Robotics. Ed: Ulrich Rembold, Klaus 
Hormann. NATO ASI Ser. F, V. 29. Springer- 
Verlag, 1987, ix + 625 pp, $100. [ISBN: 0-387-17665- 
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9] Proceedings of the NATO Advanced Research 
Workshop on Languages for Sensor-Based Control in 
Robotics held September 1-5, 1986 in Castelvecchio 
Pascoli, Italy. The purpose of the workshop was to 
define the state-of-the-art of languages for robots and 
make recommendations for future research. AO 


Languages, P. Lecture Notes in Computer Science- 
262: A Review of Ada Tasking. Alan Burns, Andrew 
M. Lister, Andrew J. Wellings. Springer-Verlag, 
1987, viii + 141 pp, $15.40 (P). [ISBN: 0-387-18008- 
7] The Ada programming language’s task model for 
parallel processing has been extensively discussed 
and analyzed. “This book draws on the available lit- 
erature to present a comprehensive review of the Ada 
tasking model,” evaluating this approach to concur- 
rency programming in the context of real-time and 
distributed systems, with reference to formal seman- 
tics and implementation. RB 


Languages, P. Lecture Notes in Computer Science- 
260: ANNA: A Language for Annotating Ada Pro- 
grams, Reference Manual. David C. Luckham, et 
al. Springer-Verlag, 1987, v + 143 pp, $15.40 (P). 
(ISBN: 0-387-17980-1] ANNA is an extension of Ada 
that allows the formal specification of the intended 
behavior of Ada programs. These annotations are 
machine processable so that they can be used in con- 
junction with other tools. AO 


Algorithms, P. Lecture Notes on Bucket Algo- 
rithms. Luc Devroye. Progress in Comput. Sci., No. 
6. Birkhauser Boston, 1986, 146 pp, $26. [ISBN: 
0-8176-3328-6] Bucket algorithms are techniques for 
managing the storage and retrieval of data into spe- 
cial data structures called hash tables. This is a set 
of lecture notes for a course in theoretical computer 
science which treats the subject of bucket algorithms 
and its relationship to the statistical distribution of 
data. MS 


Computer Systems, S(16-17), P. Testing in Soft- 
ware Development. Ed: Martyn A. Ould, Charles 
Unwin. British Comput. Soc. Mono. in Informat- 
ics. Cambridge U Pr, 1986, x + 124 pp, $16.95 
(P). [ISBN: 0-521-33786-0] Guide to the management 
of testing of software throughout the life cycle (re- 
quirements specification, design, implemented sys- 
tem) from the viewpoints of the user, development 
manager, designers, programmers. Key ideas are 
that testing must be considered broadly (as informal 
design reviews, test case analyses, formal proofs), 
and must be managed throughout the process so 
that test results become part of the project through- 
out. RM 


Computer Systems, P. An Introduction to Tezt 
Processing Systems: Current Problems and Solu- 
tions, Ed: J.J.H. Miller. Boole Pr, 1985, vii + 120 
pp. [ISBN: 0-906783-51-8] Notes from 11 lectures 
given at an October 1985 workshop held in conjunc- 
tion with the Protext II Conference in Dublin. In- 
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cludes introductions to TX, Metafont, and the Stan- 
dard Generalized Markup Language (SGML). LAS 
Computer Systems, P, L. UNIX System Read- 
ings and Applications. AT&T. Prentice-Hall, 1987, 
$18 each (P). Volume I: UNIX Time-Sharing Sys- 
tem, xiv + 397 pp [ISBN: 0-13-938532-0]; Volume 
II: The UNIX System, xii + 324 pp. [ISBN: 0-13- 
939845-7] Reprints of two special issues of the Bell 
System Technical Journal (July-August 1978, Octo- 
ber 1984) devoted to the UNIX operating system 
and extensions. UNIX was developed at Bell Lab- 
oratories beginning about 1970, and offers elegant, 
powerful design features. It is heavily used in edu- 
cational institutions and increasingly in commercial 
settings; a considerable body of related software has 
evolved. These 38 articles survey the system and 
basic applications (Volume J) as well as later devel- 
opments ( Volume JI), and are largely written by the 
authors of the system and applications themselves. 
The readings are generally less terse, more exposi- 
tory than comparable entries in the UNIX Program- 
mer’s Manual. A collection of important, often-cited 
background papers. RB 

Computer Systems, P. Lecture Notes in Com- 
puter Sctence-257: Database Machine Performance: 
Modeling Methodologies and Evaluation Strategies. 
Ed: Francesca Cesarini, Silvio Salza. Springer- 
Verlag, 1987, x + 250 pp, $23.10 (P). [ISBN: 0- 
387-17942-9] Thorough coverage of methods for ana- 
lyzing performance of database machines—machines 
designed specifically to deal with databases. Sug- 
gestion of performance indexes, measuring methods, 
and strategy for machine analysis. Includes perfor- 
mance analysis of existing database machines, DB- 
MAC and VERSO. Stresses the importance of ana- 
lyzing and modeling during the development stages 
of a database machine. PS 

Computer Systems, P. Lecture Notes in Com- 
puter Science-258 & 259: PARLE: Parallel Architec- 
tures and Languages Europe. Ed: J.W. de Bakker, 
A.J. Nijman, P.C. Treleaven. Springer-Verlag, 1987, 
$34.60 each (P). Volume I: Parallel Architectures, xii 
+ 479 pp (ISBN: 0-387-17943-7]; Volume II: Parallel 
Languages, xii + 463 pp. {[ISBN: 0-387-17945-3] Pro- 
ceedings of a conference on theory, application and 
design of parallel computer systems in Eindhoven, 
The Netherlands, June 1987, initiated by Project 
415 of the European organization ESPRIT. 54 re- 
search papers by European authors on topics includ- 
ing: concurrent, object-oriented, logic and functional 
programming; MIMD and reduction parallel comput- 
ers; process theory, design and verification of paral- 
lel systems, performance evaluation; interconnection 
networks, systolic arrays; VLSI, RISC architectures; 
applications and special purpose architectures. In- 
cludes overviews of the field (invited lectures and 
ESPRIT 415 reports). RB 


Computer Systems, P. Parallel Computer Vision. 
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Ed: Leonard Uhr. Academic Pr, 1987, xiv + 303 
pp, $29.95. [ISBN: 0-12-706958-5] A pyramid is a 
massively parallel multi-computer network consist- 
ing of a tree-like hierarchy of computers intercon- 
nected as arrays at each layer, such that successively 
higher layers contain fewer computers. These eleven 
research papers concern hardware architecture, algo- 
rithms and software for pyramids and variations, and 
applications of such systems to computer vision. RB 


Computer Systems, P. Evaluation of Multicom- 
puters for Image Processing. Ed: L. Uhr, et al. 
Academic Pr, 1986, xiv + 346 pp, $35. [ISBN: 0- 
12-706962-3] A benchmark is a computer program 
whose execution time serves as a measurement stan- 
dard for performance comparisons between computer 
systems. This book, proceedings of the 1984 Tanque 
Verde workshop on multicomputer systems in image 
processing (Tucson, Arizona), focuses on benchmark- 
ing and other evaluation techniques appropriate for 
such systems. Seventeen research papers. RB 


Computer Systems, P. Smalltalk-80: Bits of His- 
tory, Words of Advice. Ed: Glenn Krasner. Addi- 
son-Wesley, 1983, viii + 344 pp, $25.95 (P). [ISBN: 
0-201-11669-3] Smalltalk-80 is a software system rep- 
resenting on-going efforts at Xerox Corporation’s 
Palo Alto Research Center (PARC) to make large 
amounts of computing power accessible to users. Xe- 
rox invited outside groups to implement Smalltalk-80 
on a variety of hardware. This book describes the im- 
plementors’ experiences. Primarily for Smalltalk-80 
implementors, software engineers. RB 


Computer Systems, P. Design of VLSI Circuits 
Based on VENUS. E. Horbst, C. Miuller-Schloer, 
H. Schwartzel. Springer-Verlag, 1987, xii + 318 
pp, $59.50. [ISBN: 0-387-17663-2] A textbook on 
designing VLSI circuits using the VENUS product 
(Siemens AG, Munich). This CAD system facil- 
itates representation, design and verification, with 
accommodations for manufacture and testing. Sum- 
mary of integrated circuit design methods, microelec- 
tronic technologies, layout design methods, and test- 
ing principles; practical VENUS instructions and cell 
libraries; outlook. RB 


Computer Systems, P. Mechanisms for Reliable 
Distributed Real-Time Operating Systems: The Al- 


pha Kernel. J. Duane Northcutt. Perspect. in 
Comput., V. 16. Academic Pr, 1987, xiv + 245 
pp, $25. [ISBN: 0-12-521690-4] A research mono- 


graph describing a largely implemented operating 
system kernel (Alpha kernel) which integrates up- 
dated techniques (including decentralized manage- 
ment of global resources, object orientation, exclu- 
sion of policy from kernel, deadlines) in a modular 
distributed real-time system; created as a vehicle for 
the study of issues in decentralized operating sys- 
tems. RB 


Computer Systems, P. Lecture Notes in Com- 
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puter Science-272: Future Parallel Computers. Ed: 
P. Treleaven, M. Vanneschi. Springer-Verlag, 1987, v 
+ 492 pp, $34.60 (P). [ISBN: 0-387-18203-9] A collec- 
tion of topics following the structure of a June 1986 
Advanced Course held in Pisa. Covers main classes 
of parallel computers, illustrates these classes by ex- 
amining important parallel systems being developed, 
and presents topics that influence all classes of par- 
allel computers. PS 


Computer Systems, T(14-16), S, P*. A Guide 
to INGRES. C.J. Date. Addison-Wesley, 1987, xiii 
+ 385 pp, $32.95. [ISBN: 0-201-06006-X] A detailed 
description of the INGRES relational database man- 
agement system, intended for end-users and applica- 
tion programmers in data processing, database pro- 
fessions, including students and teachers of data pro- 
cessing. Thorough coverage of user-oriented mate- 
rial, e.g., SQL query language; sketchy on details of 
interest only to system programmers or machine op- 
erators. Highly readable, highly informative. RB 


Computer Systems, P. Computer Aided Tezt 
Processing—An Introduction. Ed: J.J.H. Miller. 
Boole Pr, 1986, vii + 69 pp. [ISBN: 0-906783-57- 
7| Lecture notes from a short course on TRX, La- 
TeX, Postscript, and SGML (Standard Generalized 
Markup Language) held in conjunction with the Oc- 
tober 1986 Protext III Conference in Dublin. LAS 


Computer Systems, P. Lecture Notes in Com- 
puter Science-253: WOPPLOT 86: Parallel Process- 
ing: Logic, Organization, and Technology. Ed: J.D. 
Becker, I. Eisele. Springer-Verlag, 1987, vi + 226 
pp, $20.60 (P). [ISBN: 0-387-18022-2] Proceedings of 
a workshop at Neubiberg, FRG (July 1986) featur- 
ing reports on current work and future prospects of 
parallel processing. Silicon vs. molecular electronics 
technology; organizational structure of parallel pro- 
cessors, including pyramid architectures and MIMD 
(multiple instruction/multiple data) machines; ap- 
plications to physics and image processing. RB 


Computer Graphics, T(13), P*. Device-Inde- 
pendent Graphics with Examples from IBM Personal 
Computers. Robert F. Sproull, W.R. Sutherland, 
Michael K. Ullner. McGraw-Hill, 1985, xii + 546 
pp, $35.95 (P). [ISBN: 0-07-060504-1] The Graph- 
ical Kernel System (GKS) is a device-independent 
graphics subroutine package that is an approved in- 
ternational standard. This book is an introduction 
to the design and programming of interactive graph- 
ics applications. It includes an introduction to GKS 
and examples of programs using GKS. Also includes 
a discussion of device-independent graphics software 
and standards. JAS 


Computer Graphics, P, L. Introduction to the 
Graphical Kernel System (GKS), Second Edition. 
F.R.A. Hopgood, et al. APIC Stud. in Data Pro- 
cessing, No. 28. Academic Pr, 1986, xii + 250 
pp, $23.95 (P). [ISBN: 0-12-355571-X] A descriptive 
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introduction with examples to GKS, the first (pro- 
gramming language-independent) graphics system to 
be endorsed by the International Standards Organi- 
zation (ISO, 1985), by the authors of the ISO doc- 
ument. Essential concepts; features for the special- 
ist. Intended for the application programmer with 
at least rudimentary knowledge of computer graph- 
ics. RB 


Computer Graphics, P, L., Computer Graphics 
Programming: GkKS—The Graphics Standard, Sec- 
ond, Revised and Enlarged Edition. G. Enderle, K. 
Kansy, G. Pfaff. Symb. Computat. Ser. Springer- 
Verlag, 1987, xxiii + 651 pp, $59. [ISBN: 0-387- 
16317-4] A complete reference on the GKS standard. 
This edition reflects changes made to the standard 
after 1983 and adds a new section on the three- 
dimensional extension of GKS. (First Edition, TR, 
November 1984.) AO 


Theory of Computation, P. Lecture Notes in 
Computer Science-256: Rewriting Techniques and 
Applications. Ed: Pierre Lescanne. Springer-Verlag, 
1987, vi + 285 pp, $23.10 (P). (ISBN: 0-387-17220- 
3] Proceedings of second conference on Rewriting 
Techniques and Applications, Bordeaux, May 1987. 
Twenty-two research papers and one invited lecture 
on rewriting, a part of formal language theory with 
computer science applications involving substitution, 
pattern matching, formal logic, combinatorial group 
theory, etc. Implementation; Knuth-Bendix theo- 
rem; theoretical aspects; unification; efficiency. RB 


Theory of Computation, T, P, L. Computability. 
Klaus Weihrauch. EATCS Mono. on Theor. Com- 
put. Sci., V. 9. Springer-Verlag, 1987, x + 517 pp, 
$59.50. (ISBN: 0-387-13721-1] A textbook on recur- 
sion theory, presenting a framework of foundations 
rather than a comprehensive survey of all branches 
of the field. Part 1: Introduction, computability on 
natural numbers, words; written at undergraduate 
level. Part 2: Recursion theory of natural numbers, 
related topics. Part 3: Computability on uncount- 
able sets (Type 2 computability), applications. Each 
part largely self-contained; hundreds of exercises. RB 


Artificial Intelligence, T??, S??, P? Artificial 
Intelligence Using C. Herbert Schildt. Osborne 
McGraw-Hill, 1987, xi + 412 pp, $21.95 (P). [ISBN: 
0-07-881255-0] A survey of some programming is- 
sues related to artificial intelligence, geared toward a 
popular audience. Considers expert systems, natural 
language processing, computer vision, pattern recog- 
nition, robotics, machine learning, etc. Presented as 
“how-to” manual; necessarily superficial; very debat- 
able opinions expressed as fact, e.g., “[t]here are only 
vague historical reasons why special artificial intelli- 
gence languages [LISP, Prolog] were invented in the 
first place.” 63-page review of C. RB 


Computer Science, S(17), P. Lecture Notes in 
Computer Sctence-245: Lectures on the Complezity 
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of Bilinear Problems. H.F. de Groote. Springer- 
Verlag, 1987, 135 pp, $15 (P). [ISBN: 0-387-17205- 
X] Based on the author’s 1982 lectures given at the 
University of Zurich. The objective is to present the 
complexity theory of bilinear problems in a unified 
and coordinate-free form. The emphasis is on those 
problems which are defined by multiplication in as- 
sociative algebras. CEC 


Computer Science, P. Lecture Notes in Com- 
puter Science-261: Translating Relational Queries 
into Iterative Programs. Johann Christoph Frey- 
tag. Springer-Verlag, 1987, xi + 131 pp, $15.40 
(P). [ISBN: 0-387-18000-1] This dissertation inves- 
tigates the automatic translation of set-oriented re- 
lational database query specifications into itera- 
tive programs, using functional programming and 
program transformation techniques. The author 
presents two algorithms for generating the iterative 
programs, considers efficient programs for evaluating 
aggregate functions (e.g., average), and reports on 
Lisp (T) as a database system implementation lan- 
guage. RB 


Computer Science, P, L. Advances in Comput- 
ers, Volume 26. Ed: Marshall C. Yovits. Academic 
Pr, 1987, xi + 476 pp, $65. [ISBN: 0-12-012126-3] 
This annual publication consists of survey articles 
on selected topics by various authors, written from 
a relatively leisurely perspective. Computer support 
of “fuzzy” human reasoning, unary (vs. binary) rep- 
resentation in computers, parallel algorithms, multi- 
stage interconnection networks, fault-tolerant com- 
puting, VLSI validation and testing, software testing 
concepts, development of reliable distributed soft- 
ware systems. RB 


Computer Science, P. Concurrency and Nets: Ad- 
vances in Petri Nets. Ed: K. Voss, H.J. Genrich, 
G. Rozenberg. Springer-Verlag, 1987, xiii + 622 pp, 
$49. (ISBN: 0-387-18057-5] Addresses and papers in 
honor of C.A. Petri’s 60th birthday, plus many con- 
tributed papers on current issues and applications of 
nets. RM 


Applications, P. Proceedings of the First Interna- 
tional Conference on Industrial and Applied Math- 
ematics (ICIAM 87). Ed: A.H.P. van der Burgh, 
R.M.M. Mattheij. CWI Tract, V. 36. Math Cen- 
trum, 1987, 433 pp, Df. 56.80 (P). [ISBN: 90-6196- 
318-4] Contains the papers presented at ICIAM 87 
by researchers in the Netherlands. The 29 papers in 
this volume cover a wide range of topics in applied 
and industrial mathematics. AO 


Applications, P. Protezt II. Ed: J.J.H. Miller. 
Boole Pr, 1987, vii + 169 pp. [ISBN: 0-906783-55-0] 
Proceedings of the October 1986 third international 
conference on text processing held in Dublin. Focus 
on font design and user interface in computer type- 
setting systems. LAS 


1988] 


Applications (Biological Science), P. Lecture 
Notes in Biomathematics-71: Mathematical Topics 
in Population Biology, Morphogenesis and Neuro- 
sciences. Ed: E. Teramoto, M. Yamaguti. Springer- 
Verlag, 1987, ix + 348 pp, $37.40 (P). [ISBN: 0-387- 
17875-9] Proceedings of a 1985 international sym- 
posium in Kyoto, Japan. Papers on mathematical 
ecology, population biology (structure, dispersal, and 
evolution), theory of patterns and morphogenesis, 
neuroscience, and physiology. RM 


Applications (Biological Science), P. Lecture 
Notes in Biomathematics-70: Stochastic Methods in 
Biology. Ed: M. Kimura, G. Kallianpur, T. Hida. 
Springer-Verlag, 1987, vi + 229 pp, $22.80 (P). 
[ISBN: 0-387-17648-9] Proceedings of a 1985 work- 
shop in Nagoya, Japan for biologists and mathemati- 
cians. Topics of papers include population genetics, 
neurophysiology, fluctuations in living cells, mathe- 
matical treatments of epidemiology, population dy- 
namics, stochastic differential equations. RM 


Applications (Communication Theory), T(16: 
1), P. Digital Transmission Theory. Sergio 
Benedetto, Ezio Biglieri, Valentino Castellani. Pren- 
tice-Hall, 1987, xvi + 639 pp, $53.33. [ISBN: 0-13- 
214313-5] Treatment of digital communication the- 
ory (introduction to mathematical techniques, infor- 
mation theory, modulation schemes, Gaussian chan- 
nels, adaptive receivers, encoding and reliability, 
non-linear channels) with emphasis on engineering 
applications, bandwidth, power, and complexity of 
systems as major design trade-offs. RM 


Applications (Economics), P. Mathematical Eco- 
nomics. Kelvin Lancaster. Dover, 1987, xiii + 411 
pp, $9.95 (P). [ISBN: 0-486-65391-9] Republication 
of the work originally published by Macmillan in 
1968 (TR, November 1968). PS 


Applications (Electrical Engineering), T(16- 
17: 1). Mathematical Modeling and Digital Simu- 
lation for Engineers and Scientists, Second Edition. 
Jon M. Smith. Wiley, 1987, xv + 430 pp, $36.95. 
[ISBN: 0-471-08599-5] Major additions in this edi- 
tion are new sections on numerical methods for sim- 
ulating random processes and simulator verification. 
New material has also been added throughout to bet- 
ter explain the use of numerical techniques. (First 
Edition, TR, October 1978.) AO 


Applications (Electrical Engineering), P. Oper- 
ator Theory, Analytic Functions, Matrices, and Elec- 
trical Engineering. J. William Helton, et al. CBMS 
Reg. Conf. Ser. in Math., No. 68. AMS, 1987, xiv + 
134 pp, $19 (P). [ISBN: 0-8218-0718-8] Expansion of 
ten lectures given at a regional conference in Lincoln, 
Nebraska. Primarily useful as a list of theorems and 
references. GG 


Applications (Engineering), P. Numerical Meth- 
ods in Transient and Coupled Problems. Ed: R.W. 
Lewis, et al. Wiley, 1987, xvi + 350 pp, $95. [ISBN: 
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0-471-91200-X] Of interest to reseasrchers in civil en- 
gineering. LC 

Applications (Engineeritig), T(16-17: 1), P, 
L. Solving Polynomial Systems Using Continuation 
for Engineering and Scientific Problems. Alexander 
Morgan. Prentice-Hall, 1987, xiii + 546 pp, $49.95. 
[ISBN: 0-13-822313-0] A continuation method (em- 
bedding or homotopy method) for solving small sys- 
tems of polynomial equations which requires no ini- 
tial guess and finds all solutions. Provides the neces- 
sary differential topology. Includes exercises; many 
suggestions for experiments and projects; case stud- 
ies from real applications. Pleasant, conversational 
style. Contains Fortran source code (available on 
diskettes) for implementation. DFA 


Applications (Fluid Mechanics), S(18), P. Tur- 
bulence and Random Processes in Fluid Mechanics. 
M.T. Landahl, E. Mollo-Christensen. Cambridge 
U Pr, 1986, vi + 154 pp, $17.95 (P). [ISBN: 0- 
521-34687-8] Masters-level monograph on subject in- 
tended to provide background necessary for reading 
the literature and beginning research. Assumes read- 
ers from a variety of backgrounds. Covers stability, 
wave motion, and coherent structure. MR 


Applications (Information Theory), P, L. Anal- 
ysis of Fuzzy Information, Volume I-III. Ed: James 
C. Bezdek. CRC Pr, 1987, $450 set. V. I: Math- 
ematics and Logic, 272 pp [ISBN: 0-8493-6296-2]; 
V. I: Artificial Intelligence and Decision Systems, 
251 pp [ISBN: 0-8493-6297-0]; V. III: Applications 
in Engineering and Science, 296 pp. (ISBN: 0-8493- 
6298-9] Fifty chapters based on papers presented at 
a July 1984 conference, including ten lengthy survey 
papers. A comprehensive survey of Lofti Zadeh’s in- 
novative concept of a fuzzy set based on imprecise 
information, revealing extensive applications to anal- 
ysis, logic, expert systems, database systems, pattern 
recognition, and operations research. LAS 


Applications (Management), P. Decision Sup- 
port Systems: Theory and Application. Ed: Clyde 
W. Holsapple, Andrew B. Whinston. NATO ASI 
Ser. F, V. 31. Springer-Verlag, 1987, x + 500 pp, 
$90. [ISBN: 0-387-17774-4] Proceedings of NATO 
Advanced Study Institute (1985) on current DSS 
views and future trends. Recurring emphasis of 
artificial intelligence explored throughout. Papers 
cover theory (paradoxes, relational theory of model 
management, conceptual modelling, artificial intel- 
ligence and knowledge-based DSS), and applications 
(office information systems, accounting and auditing, 
strategic planning, and management). RM 


Applications (Physics), T(18: 1), S, P. Hamil- 
tonian Methods in the Theory of Solitons. L.D. Fad- 
deev, L.A. Takhtajan. Springer-Verlag, 1987, ix + 
592 pp, $110. [ISBN: 0-387-15579-1] The first of a 
two-volume series, this book is devoted to the clas- 
sical rather than the quantum analysis of the non- 
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linear Schrodinger equation, with emphasis on the 
Hamiltonian structure. The authors have attempted 
to make this text sufficiently self-contained to serve 
as an independent introduction. MU 

Applications (Physics), T(18: 2), P. Gauge 
Field Theories. Stefan Pokorski. Mono. on Math. 
Phys. Cambridge U Pr, 1987, xiii + 394 pp, $89.50. 
[ISBN: 0-521-26537-1] Gauge theory in electrody- 
namics studies the invariance of the Lagrangian den- 
sity of fields with respect to local phase transforma- 
tions. This book provides basic tools in theory of fun- 
damental interactions requiring only a background 
in quantum field theory. Topics covered include ba- 
sics of perturbation theory and areas of research in- 
terest (e.g., chiral symmetry). Includes many exer- 
cises. MR 

Applications (Physics), T, S, P, L. Manifolds 
and Mechanics. Arthur Jones, Alistair Gray, Robert 
Hutton. Australian Math. Soc. Lect. Ser., V. 
2. Cambridge U Pr, 1987, 166 pp, $39.50; $13.95 
(P). (ISBN: 0-521-33375-X; 0-521-33650-3] A text- 
book aiming to make basic ideas about differentiable 
manifolds readily available to applied mathemati- 
cians and theoretical physicists while exhibiting ap- 
plications (to Lagrangian mechanics) to the mathe- 
matician. Prerequisite: advanced calculus, e.g., Spi- 
vak’s Calculus on Manifolds. Fundamental notions; a 
geometric derivation of Lagrange’s equations; flows, 
spherical pendulum, rigid bodies. Numerous exer- 
cises. RB 


Applications (Physics), P. Percolation Theory 
and Ergodic Theory of Infinite Particle Systems. Ed: 
Harry Kesten. IMA, V. 8. Springer-Verlag, 1987, xi 
+ 323 pp, $34. [ISBN: 0-387-96537-8] Proceedings 
of a workshop at the Institute for Mathematics and 
its Applications, mostly concerned with random pro- 
cesses (such as self-avoiding walks) on lattices. Per- 
colation theory and interacting particle systems are 
subfields of probability, related to statistical mechan- 
ics. BC 


Applications (Physics), P, L. Einstein’s Dream: 
The Search for a Unified Theory of the Universe. 
Barry Parker. Plenum Pr, 1986, ix + 287 pp, $18.95. 
[ISBN: 0-306-42343-X] A presentation for a popular 
audience of background and concepts in search of a 
“srand unified theory (GUT)” in physics. General 
relativity, phenomena including white dwarf stars 
and black holes, origins of the universe, particle 
physics and field interactions, current GUTs are sur- 
veyed in a historical context; “mathematics has been 
completely avoided in this book ....” RB 


Applications (Physics), P. Theory and Applica- 
tions of Liquid Crystals. Ed: J.L. Ericksen, D. 
Kinderlehrer. IMA, V. 5. Springer-Verlag, 1987, xii 
+ 353 pp, $37. (ISBN: 0-387-96546-7] Thirteen pa- 
pers, largely the proceedings of a workshop at the 
Institute for Mathematics and Its Applications, Min- 
nesota, January 1985. Almost no mathematicians 


TELEGRAPHIC REVIEWS 


were engaged in liquid crystal research at that time. 
The workshop succeeded in stimulating subsequent 
work: three of the papers report on investigations 
undertaken after the workshop took place. RB 


Applications (Physics), T(15-17: 1), L. One- 
dimensional Stefan Problems: An Introduction. 
James M. Hill. Mono. & Surv. in Pure & Appl. 
Math., V. 31. Longman Scientific & Tech (US Distr: 
Wiley), 1987, xviii + 204 pp, $106. [ISBN: 0- 
470-20388-9] Mainly studies semi-analytical solution 
methods for those heat-diffusion moving-boundary 
problems, pseudo-steady-slate approximations, large 
Stefan number expansion, integral formulation, up- 
per and lower bounds for the motion, formal series 
solutions, integral iteration techniques; Goodman’s 
and Megerlin’s methods and an iterative series so- 
lution procedure. For practitioner as well as stu- 
dent. DFA 


Applications (Physics), S*(13), L. Realistic Ap- 
plications in Mechanics. The Spode Group. Ox- 
ford U Pr, 1986, viii + 76 pp, $19.95 (P). [ISBN: 
0-19-853654-2] A series of explicit case studies in 
(calculus-based) mechanics applied particularly to 
sports (e.g., the “miracle” high jump in the 1968 
Mexico City Olympics) and driving (designing auto- 
mobile brakes). Intended to help motivate 16-19 year 
old vocational students in Great Britain. An excel- 
lent source of rich, supplementary problems. LAS 


Applications (Physics), P. Proceedings Seminar 
1983-1985: Mathematical Structures in Field Theo- 
ries, V. 2, M.J. Bergvelt, G.M. Tuynman, A.P.E. 
ten Kroode. CWI Syllabus, V. 13. Math Cen- 
trum, 1987, iii + 206 pp, Dfl. 13.60 (P). [ISBN: 
90-6196-317-6] Consists of three papers: “An intro- 
duction to classical mechanics and symplectic geome- 
try,” by G.M. Tuynman; “The Hamiltonian structure 
of Yang-Mills theories,” by M.J. Bergvelt; and “Ge- 
ometrical description of the Toda lattice,” by A.P.E. 
ten Kroode. MU 
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Bittinger’s brief calculus texts, we chal- nO ES 
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results. You'll see how Calculus: ~ go 

A Modeling Approach, Fourth 

Edition, by Marvin L. Bittinger, and 
Applied Calculus, Second Edition, 

by Bittinger and Bernard B. Morrel, were 
designed with your students in mind. You'll 
also see how well these two new editions 
hold up under the close scrutiny of the 
toughest judge you know. You. 
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All Topologies Come From Generalized Metrics 


RALPH KOPPERMAN, City College of New York 


RALPH KOPPERMAN: I received my Ph.D. from M.I.T. in 1965, and came to 
The City College in 1967. SEEK is a special program for students from poor 
areas of the city, and I have coordinated its mathematics efforts since 1969. 

My research interest has always been in limits, which I first tried to study 
through infinitary languages (logic). I was able to publish in the field, but 
was not happy with the results of that research, and went into point-set 
topology in 1980. I was a founding member of the CCNY Seminar on 
General Topology and Topological Algebra in 1981, and have been involved 
with it ever since. 


Most of us learned our topology in a course about metrics. We were shown that 
the wonderful notions of continuity and uniform continuity, limit and Cauchy 
sequence, naturally arose from their proper consideration. We were then shown or 
told that metrics were too restrictive. There were limits which could not arise from 
metrics. 

Topological spaces were given as a partial solution to this problem. More general 
views of limit and continuity arose from a study of the topological concepts of open 
and closed, but uniform continuity and the ideas of Cauchy and complete were 
gone. Depending on our teacher, we may or may not eventually have learned 
another way of recovering these uniform ideas. 

Here we instead generalize the definition of metric space in a way that makes all 
topologies generalized metric topologies. This approach allows the use of suggestive 
“metric” notations at all times and reunites the treatment of topological and 
uniform ideas in key cases. 

In the first section, we look at metric and topological spaces from an almost 
standard viewpoint; in the second we develop our generalization. All topological 
and metric definitions introduced without comment are as found in Kelley ({1)). 
Similar definitions are found in most basic texts, but there may be some differences. 


1. Metric and topological spaces. For our purposes, it is convenient to regard a 
metric as amap d: X X X — [0, co] for some nonempty X, which satisfies: 

(ml) d(x, x) =0 

(m2) d(x, z) < d(x, y) + d(y, z) (triangularity) 

(m3) d(x, y) = d(y, x) (symmetry) 

(m4) d(x, y) =0 => x = y (separation). 

A pseudometric is a map d: X X X — [0, 00] for X # 9, satisfying (m1)-(m3), a 
quasimetric is one satisfying (m1)-—(m2). 
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A topology is a set T of subsets of some nonempty X such that: 

(tl) ¢, X & T, 

(t2) if C C T then UC € T, 

(t3)if AP, 2eTthn APNZQET. 

Given a topology T, a set is open if it’s in T, closed if its complement is in T. 

Just how do we get a metric (or quasimetric) topology? A set F is open in such a 
topology if every point in Y is surrounded by an open ball with positive radius r, 
S.(x) = {y: d(x, y) <r} contained in #. It would be equivalent to require that 
for each x © F there is some positive , such that the closed ball, N(x) = 
{y: d(x, y) <r} CF: N,,(x) C Sx) C N,(x), so if one sort of ball of positive 
radius is contained in #, so is the other. This latter is easier to generalize, and will 
be our approach. 

We now show that if 2= (X, d) is a metric space, then its metric topology, 
which we shall denote by To(2% ), is a topology: 

(tl) @ is open since no X € ¢ can contradict our condition. Since there is a 
positive r, choose one; if x € X then N(x) CX, showing X € To( 2). 

(t2) If C C To(%) we show UC to be open: if x € UC then for some 2 € C, 
x € J; but then for some positive r, N(x) C 2 Cc UC. 

(t3) To show that the intersection of two open sets is open, let FY, 2 € To(%), 
x € FQ. Then for some positive r,t, N(x) CFA and N(x) c 2. But the 
minimum of two positives is positive and Miine,,(%) = N(x) ON N(x) CAN 2, 
sO we're done. 

The two concepts of topology and metric give us two notions of limit: Let CX, d) 
and (Y, d’) be metric spaces. L is a metric limit at a € X for a function f: X > Y 
if for each positive r there is a positive s such that if d(a, x) < s then d’(L, f(x)) 
<r. 

For (X,T), (Y,T’) topological spaces, L is a topological limit at a © X for a 
function f: X — Y if for each open 2, if L © 2 there is an open FY such that 
a © # and for each x € FY, f(x) = 2. 

We leave to the reader the important verification that if T is the topology arising 
from d and T’ arises from d’, then metric limits and topological limits are the 
same. 

What must we change? Nonmathematicians would accept the cartoon (p. 91) as 
a demonstration of nonsymmetry. 

But mathematicians require more, so we look at two topologies indirectly 
discussed in some calculus courses. 


The topology of upper semicontinuity. Recall that f: R — R (R the reals) is upper 
semicontinuous if whenever f(x) < a then for some positive r, if |x — y| <r then 
f(y) <a. It can be shown that the collection U = {(—00,a): —-wo <a< o} 
forms a topology on R, called the topology of upper semicontinuity, and that f 1s 
upper semicontinuous iff f is continuous from the usual real topology to U (for 
discussion see Kelley [1], p. 101). In U if x < y and A= (—o, y) then x € F and 
y &€ Ff but there is no open 2 such that y © 2 and x € 2 (forif y © 2 = (— ow, z) 
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The cynical cyclist and symmetry. 


then x < y<zsox €2Q). But: 


Fact. In the topology arising from a quasimetric d on X there is an open F with 
xeEfand yEA iff 0 < d(x, y). 


Proof. If d(x, y) =0 and x € Y, F open, let N(x) C FY; then d(x, y) < r so 
y € F. But if d(x, y) > O then let r > 0, d(x, y) € r; clearly x € S.(x), y E S.(x), 
an open set. (The usual proof shows that S$ (x) is open in this situation; a 
generalization of that result is shown in the next section.) 


Combining this fact with the previous discussion of U, we see that if U arises 
from a quasimetric d and x < y, then d(y, x) = 0 contradicting (m4), and d(x, y) 
> 0 = d(y, x), contradicting (m3) as well. (The reader may wish to verify a fact not 
central to our discussion: U arises from the quasimetric defined by d(x, y) = 
max{0, y — x}.) 


The topology of pointwise convergence and neighborhood bases. Given a topology 
T, and an x © X, a neighborhood base about x is a collection B, of sets which have 
open subsets containing x and for which if x € Y, FY open, then for some N € B,, 
N C #. Note that for any metric topology, if x © X, B, = {S),,(x): a positive 
integer} is a countable base about x. Further, this collection is totally ordered by set 
inclusion since if r < s then S.(x) C S.(x). 

Let X = R14 = { f: [0,1] > R}, under the topology of pointwise convergence. 
(Recall that a sequence {g,} of functions from Y into R converges pointwise to a 
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function f if {g,(y)} converges to f(y) for each y € Y.) Kelley ({1], 92-93 
particularly Theorem 6, also 88-91) discusses the topology of pointwise convergence 
and shows that points in this X don’t have countable bases about them. 

For us it’s useful to note further that points don’t even have bases about them 
totally ordered by set inclusion. Let f € R™®" and define f, € R&" by f(x) = 
f(x) + 1/n; then { f,} 1s a sequence approaching f pointwise. By way of contradic- 
tion let B be a base about f totally ordered by set inclusion, and for each n let 
P&B, f, € F,. lf there were a 2 € B such that for each n 2c #, then for each 
n, f, € 2, and since 2 is a neighborhood of f, this contradicts { f,} > f. Thus for 
some n, 2 ¢ #,, and since B is totally ordered by set inclusion, A, C 2. But then if 
fe, F open, there isa 2 € B such that 2 C Y and an n such that F, C Y, thus 
P,CHF,s8s0{F,. n =1,2,...} is a countable base about /, our contradiction. Thus 
this space cannot arise from a metric valued in any totally ordered set like the 
nonnegative reals. We’ve now seen the key reasons why not all topologies arise from 
metrics, and shown how an idea of limit comes from topology. 

We close this section with an example showing that a topology doesn’t determine 
its Cauchy sequences: Let X be the set of positive integers, d(m,n) = |m — nl, 
e(m,n) = |1/m— 1/n| (= d(m, n)/mn). Both d and e are metrics on X, and 
further To((.X, d)) = To((X, e)): since Sf(n) = Sfacnaiy(n) = {1}, {n} is open in 
each topology, so since arbitrary unions of open sets are open and each set is the 
union of its points, each set is open in either topology. 

Thus if the notion of Cauchy were determined by topology, the two metrics should 
yield the same Cauchy sequences. Since d(m,n) = 1 form #n, x, = n isn’t Cauchy 
with respect to d. But it is with respect to e. 


2. Continuity spaces. In the first section we showed that (m3) and (m4) (symme- 
try and separation) couldn’t be expected to hold in a generalization of metric space 
from which all topologies arose. Neither could we expect our generalized metric to 
be valued in the positive reals. We begin this section with a definition of structures 
which may be used to replace the nonnegative extended reals. 

Before we give our definition, notice that the order on [0, oo] is unusual in that 
it’s completely determined by addition: a < b iff for some c (€ [0, o0]), b= ate. 
This in turn says about + on [0,00] thatifa+x=b and b+y=a thena<b 
and b < aso a= b. We adopt this as our axiom (vl) below. Notice that if (A, +) is 
a semigroup (i.e., +: 4 X A — A is associative) and A has identity 0 then < can 
be defined on A by: a <b iff for some x € A, b=a+ x. This < is always 
reflexive (since a = a+ 0, a < a) and transitive (if a < b and b < c then for some 
x,yEA, b=at+x andc=b+y=(at+x)t+ty=at+(xt+y), soasxc). If 
(v1) holds, < becomes a partial order (since if a < b and b <a then for some 
x,y €A,b=a+xanda=b+ysoa=b). Later we give examples which show 
that this < need not be total. 

In giving up our total order, we must give up the existence of min{r, s} for 
r,s > 0, a number often used in metric arguments; we shall find that inf will work 


1988] ALL TOPOLOGIES COME FROM GENERALIZED METRICS 93 


in its place. With this preface, we now generalize [0, oo]: 


1. Definition. A value semigroup is an additive abelian semigroup A with identity 
0 and absorbing element oo # 0, satisfying: 

(vl) ifa+x=bandb+y=a then a=b, 

(thus a < b iff b= a+ x for some x, defines a partial order), 

(v2) for each a there is a unique b (called (1/2)a or a/2) such that b+ b= a, 

(v3) for each a, b there is an inf, a A BD, 

(v4) aN b+c=(atc)A(bt+ Cc). 


Fact. If for each i € I, A; is a value semigroup (together with +,, 0,, 00;), then 
so is their product (with +, 0, defined coordinatewise; 1/2 and inf are also taken 
coordinatewise). 


Proof. (v1) Suppose a + x = b, b+ y = a. Then for each i, a, +, x; = (a+ x), 
=b,, a,;=b, +; y,,80 a; =b, so a= b. 
(v2)—(v4) are shown similarly and their proofs are left to the reader. 


The nonnegative extended reals with the usual addition form an example of a 
value semigroup which we call #. The examples we use later will be powers of &. 

The partial order < defined using (v1) is the usual one on [0, 00], and is the only 
one we refer to below. For each a,b € A,O0 <a/2 <a<a+b< oo. Thus think- 
ing of < as < and #, each nonzero element is positive. 

In #7, (0,1) € (1,0) and (1,0) < (0,1); in fact (0,1) and (1,0) are two positive 
elements whose inf is 0. Many metric space proofs use the fact that the min of two 
positive numbers is positive, in fact this property of the positive reals, rather than 
their size relative to 0 is key to their usefulness. Thus we must next define a “set of 
positives” by its useful properties: 


2. Definition. A set of positives in a value semigroup A is a P C A satisfying: 
(pl) if r,s © P then rAsé€éP, 

(p2) if re P, r<a,thenae P, 

(p3) if r © P then r/2 € P, 

(p4) if a< b+,r foreachr € P then a < b. 


(p4) is often used in its contrapositive form: if a < b then a £ b +r for some 
r € P. Also P # 6 by (p4), for otherwise 00 < 0 + r for each r € P so co < 0, and 
since < is a partial order and 0 < oo by the paragraph preceding definition 2, 
0 = oo; but this contradicts definition 1. In addition to the usual example, P = 
(0, co] in @, we see that each value semigroup A is a set of positives in itself. We can 
now define our generalization of metric space: 


3. Definition. A continuity space is a quadruple 2 = (X, d, A, P) such that X is 
nonempty, A is a value semigroup, P a set of positives on A, and d: X X X >A 
satisfies (m1) and (m2) of our first paragraph. This d is called a continuity function. 
& (and d) is symmetric iff (m3) is satisfied, and separated iff (m4) is satisfied. 

We now formalize the way topological spaces arise from continuity spaces. 
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4. Definition. If x € X, bE A, F=(X,d, A, P) a continuity space, then 
N,(x) = {y: d(x, y) < b} 1s called the closed ball of radius b about x, and 
To(%) = {2c X:if x € 2 then N(x) C 2 for some r € P} is the topology arising 
from &. : 


Suggestive “metric” notation is valid for continuity space topologies. For exam- 
ple, if f: X ~ Y, 4=(X,d, A, P), Y=(Y,e, A’, P’), then f is continuous at 
x € X with respect to To( 2%), To(Y ) iff for each s € P’ there is an r € P such that 
if d(x, y) <r then e( f(x), f(y)) < s. Uniform continuity can be defined in the 
above situation as well: f is uniformly continuous iff for each s € P’ there is an 
r & P such that if d(x, y) < r then e( f(x), f(y)) < s (here r is independent of x). 
In order to discuss certain limits in this general setting, we need to generalize 
sequences (mappings from the natural numbers into X) to nets (maps from a 
directed set (D, <) into X; Kelley discusses these beginning on p. 65). Then y is 
the limit of the net x: D — X iff for each r € P there is an n € D such that if 
n<m then d(y, x,,) < r. Wecan also define the net x: D > X to be Cauchy if for 
each r © P there is an » € D such that if n<m, p then d(x,,,x,) <r. Its a 
useful exercise to generalize the usual textbook results about topological concepts 
(limit, continuous, open, closed) and uniform notions (uniformly continuous, 
Cauchy, complete) to this setting. Most topological results hold without additional 
restrictions; some of them, as well as most uniform results, require symmetry, and 
separation is sometimes a simplifying assumption. (Note: there is also a clear way to 
define a uniformity from a continuity space in the symmetric case. If this is done, 
our definitions of uniform concepts coincide with those which come from the 
associated uniformity, but we prefer not to discuss uniform spaces here. The place 
of symmetry in our scheme is given in Theorem 11.) The Baire category theorem 
and related results are the few which require that we essentially have a metric 
(valued in [0, co)). 


5. PROPOSITION. To(% ) is always a topology. 


Proof. Our proof in the first section that every metric yields a topology applies 
here word for word, except for its last sentence. Note that since d(x, y) <rA t iff 
d(x, y) <r and d(x, y) <t, N,,,(x) = N(x) 9 N(x). Induction extends this to 
finite infs and intersections. That last sentence may now be changed to read: But 
since the inf of two positives is positive (by (pl)), N.,,(x) = N(x) AN N(x) CAN 
2, so we’ve shown To( 2% ) to be a topology. 


Our main goal is to show the converse of 5: every topology is a To( 2 ). It’s useful 
first to look at what needs to be done. For (X, T) a topological space, we must find 
A, P, d so that if 2= (X,d, A, P) then T = To(2). But how do we know when 
T = To(%)? Clearly by definition 4, if for each x € © there’s an r € P such that 
N(x) Cc 0, then O€ To(Z), so if this holds for each O€ T we must have 
T Cc To(%). 

It isn’t difficult to get for AE To( 4) a continuity function dy: X kX X ~& 
such that YE To(%,) C To( 2), where X= (X, d, &, (0, 00]). Simply let g > 0 
and define d(x, y) =O if (xe SY>yVEF), dy(x, y) = q otherwise. (m1) is 
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clearly satisfied, and for (m2): If d,(x, y) = q or d,(y,z) =q then d,(x,z) < 
d(x, y) + d,(y, z), and if both are 0 then(x€ PS>yePFY)and(yEeSoze 
SF) so (xEY>zEF), showing d,(x,z)=0< d(x, y) + d,(y, z). Also if 
p<q thn M(x)={y dyl(x,y)spja{yi delx% y) =O= {yi xe P= 
y€ FF}; this setis Y if x € Y, X otherwise, and clearly if p > q then N,(x) = 
By 5, ¢, X € To(%,); ifx € Y,0<p <q then N(x) CY, thus SE To(4,). If 
PE To) and f+ ¢ then let x € F; there’s some p > 0 such that YC N,(x) 
C Ff, thus “C FY. If ## Y we can choose x © P— F and then X = N,(x) CF, 
so X = FY. Thus To(4,,) = {0, %, X} C To(@). 

The above suggests that we might well try to build up from an appropriate 
collection of 2.,’s, and the next lemma sets up equipment which will allow us to do 
that. 


6. LEMMA. If for each iG I, A, is a value semigroup and P,C A, is a set of 
positives, then the sum of the P,, P = {r: r(i) © P, for each i € I and for all but a 
finite number of i € I, r(i) = 0,} is a Set of positives in the product A of the A,’s. 


Proof. (pl) if r,s © P, then for each 1 E J, (r A s)\(i) =r(i) A sti) © P,, and 
= oo, for all but the finite number of places at which r(i) # 00, or s(i) # o0,, thus 
r A. s © P; (p2) and (p3) are clear. For (p4) assume b < a+ r for each r € P, and 
note thatif i € J, g © P, and r(i, q) € P 1s defined by r(i, g)(i’) = 0, LiF i’ ET, 
r(i,g\i) = q, then r(i,g) € P. Thus b(i) < (a+ r)(i) =a(i) + q for arbitrary 
positive g, so b(i) < a(i), but since 7 was also arbitrary, b < a. 


7. THEOREM. Every topology is a To(2 ). 


Proof. Let q € (0, 00] be fixed throughout this proof. For each YE T (our 
topology) define d, as preceding 6 and let 2=(X,d,A,P), where A is the 
product of T copies of @, P the sum of T copies of (0, 00], and for each x, y € X, 
SET, d(x, y(S) =d,(x, y). A was shown to be a value semigroup following 
definition 1; P was shown to be a set of positives on A in proposition 6. Further, 
we showed that (m1), (m2) were satisfied at each coordinate by d,, thus they are 
satisfied by 2. It remains only to be shown that for % defined as above, To( 2) = T. 

Let SET, p € (0, 00] and define r(.Y, p) by r(¥%, pS) = p, (FZ, DXF’) 
= if SES’. Ney p(X) = {y: d(x, y) SWF, p)} = (y: doe Y) < p}, thus 
by the discussion of d.,, this setis Y if x € Y and p < q, X otherwise. This shows 
that T C To(2), since if x € ¥, then for p < g, r(Y, p) © P and Nyy (Xx) CF. 

It further leads to To(#) C T: If x € A © To(%) then for some r, € P, N,(x) 
Cf but {S: 7,(%) < oo} is finite, thus let AZ) =p, forl<i<n,r(F’) = © 
for S’E{S,...,4,}. Then r, = r( 4%, py) A+++ ALCS. ft Py), thus N, (x) is the 
finite intersection of the Nig» )(). Since each of these is X or Y%, N. (x) € T. 
Thus PC U,.gN,(x) CY, so F is a union of elements of T, therefore by (t2), 
Pe T. 


3. Symmetry and complete regularity. The above raises obvious questions as to 
the place of symmetry and (m4) in our scheme. A surprising feature of the proof of 
7 is that our “closed balls” were open, but not closed, and we also discuss questions 
related to that. Recall that T is 7, iff for each x, {x} 1s closed (X — {x} is open), 
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completely regular iff for each open # and each x € F there is a continuous 
f: X — [0,1] such that f(x) = 1 and f(y) = 0 for each y € F, Tychonoff iff it is 
T, and completely regular. 


8. PROPOSITION. To(2) is T, iff 2 is separated. 


Proof. If & is separated and x € X — {y} then d(x, y) # 0so for some r € P, 
d(x, y) €r; for this r, N(x) C X — {y}, and since x was arbitrary, X — { y} is 
open. Conversely, if 2 is T, and x, ye X, x # y, then x € X — {y} so for some 
re P, N(x) c X — {y}, thus d(x, y) ¢ r and in particular, d(x, y) # 0. 


Next, open and closed balls. Note that the obvious generalization of S,(x) need 
not always be an open set: Let X be the plane, A = 2’, P =(0,co]’, and 
d((w, x),(y¥, Z)) = (jw — yl, |x — z)), @= (X, d, A, P). The reader can verify that 
# is a continuity space and To( 2’) is the usual topology, but Sy ,)((0, 0)) is a closed 
square with the corners removed, thus not open. Some other concepts are needed: 


9. Definition. The positive dyadics are DY = {x = i/2*: i, k positive integers}. 


Notice that there is a multiplication -: DY X A — A defined by repeated applica- 
tion of addition and 1/2 (of (v2)). 

For x © X,b€ A, B,(x) =U{N,,(x): 2 € DY, x < 1} = { y: for some dyadic 
wz <1, d(x, y) < xb}, and is called the open ball of radius b about x. 

The dual of &= (X, d, A, P)is #* = (X, d*, A, P), where d*(x, y) = d(y, x) 
(and X,A,P remain unchanged). Dual concepts are denoted by placing * in 
appropriate places, e.g., N.*(x) = {y: d*(x, y) <r}. 

It can be shown routinely that this dyadic multiplication is well defined, 
distributive in both variables, (%)a = m(za) and la = a. Thus if x < m then 
ar<ar+(m-—a”)r=(2+m-— x)r = mr. It’s also clear by (p3) and (p2) that if 
r © P and x € DY then xr € P. Further, d* is always a continuity function, thus 
&* a continuity space, and for metric spaces (A = &, P = (0, oo]), if r € P then 
S.(x) = B(x). 


10. Proposition. If b€& A, x © X then Ny, (x) C B(x) CN, (x). B(x) is 
open ifr © P. For each x © X, the sets B, = {B,(x): re P} and N, = {N,(x): 
r © P} are both bases about x. For each b © A, x © X, N,*(x) is closed in To(&). 


Proof. The first inclusion holds since b/2 = (1/2)b, and the second follows from 
the fact that if « < 1 then ~b < 1b = 5b, so N,,(x) C N,(x). We next show that if 
r<P then B(x) is open: if y © B(x) then for some x <1, d(x, y) < zr; if 
m=(1—2)/2 then »€ DY so mre P, and if zEN_(y) then d(x,z) < 
d(x, y)+ d(y,z)<(#+m)r, so zEN,4,(x), and 2+ m <1, showing 
N.(y) © B(x). That B. and N, are bases about x now follows, since if x € PE 
To(2) then for some r € P, x € B(x) C N(x) CF. 

Finally, to see that N.*(x) is closed, suppose that y © X — N*(x). Then 
d(y,x) €r so by (p4), for some t€ P, d(y,x)&r+t. If d(y,z) <t then 
d(z,x) <r, for otherwise d(y, x) < d(y,z) + d(z,x) <t+r, a contradiction. 
But this contradiction shows N,(y) C X — N.*(x), thus the latter is open. 
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11. THEOREM. If % is,symmetric then To(&) is a completely regular topology, 
which is Tychonoff iff 2 is separated. Every completely regular topology arises from a 
symmetric continuity space. 


Proof. Suppose first that 2 is symmetric and let x € A € To(2). Then for some 
re P, Nx) Cc &. Define g,: A — [0, 0] by g,(a) = inf{x € DY: a < zr}; note 
thatifc <a+bandas<ar, b< mr thenc<ar+mr=(u+ m)r, SO g,(c) < 
g,(a) + g,(b). We define h: X — [0,1] by A(y) = min{ g,(d(x, y)),1} and next 
show that A is continuous. 

Since d(x, z) < d(x, y) + d(y,z) we have h(z) < h(y) + g,(d(y, z)) so 
h(z) — h(y) < g,(d(y, z)); similarly, h(y) — h(z) < g,(d(z, y)), so by symmetry, 
|h(y) — h(z)| < g,(d(Qy, z)). If p € (0, oc] then let x € DY, ~ < pj if d(y, z) < xr 
then |h(y) — h(z)| < g,(d(x, y)) < ~ < p, so A is continuous at this arbitrary y, 
thus on X. 

Also, h(x) = inf DY = 0 and if h(y) < 1 then for some x < 1, d(x, y) < xr < 
ar+(l-—a#)r=(a+1-—-za)r=r, so yEN(x) CY. Thus f= max{0,1 —h}: 
X — [0,1] is our function. 

We now proceed to show the converse; note in passing similarities between this 
proof and that of Theorem 7. Suppose T is completely regular; then there is a set F 
of continuous functions valued in [0, 1] such that whenever x € AY € To(% ) there is 
an f © F for which f(x) = 1 and f is 0 off A. We define A and P and show them 
to have the right properties in a manner analogous to that used in the proof of 7: 
A=R*%, P={re(0,0]*: r(f) = 0 for all but a finite number of f}. The 
definition of d must again be coordinatewise: d(x, y)(f) =|f(x) — f(y)|, and 
there is an obvious coordinatewise proof that d satisfies (m1)—(m3). We now show 
that T = To(2). 

For re P let r(f)=p,;> 0 for 1 <is<n, r(g) = oo otherwise; then y € 
Bix) @ d(x, y) < mr for some ~ <1, #m€ DY @ for each i between 1 and 2, 
f(y) — f,(x)| < »p, for some » € DY, m <1 for each i between 1 and n, 


Fv) ~ fx) <p ey ENA") — p,, f(x) + p;)]. This shows that 
| 0) "fC L(x) — pfx) + p,)]: X € X, tis--o th EF, Pis+++s Pn > 0 


i=] 
n a positive integer} = {B,(x): x © X}, so for each r € P, x © X, Bx) € T. If 
x € PE TO(%) choose r(x) € P such that B,,.(x) C FY; then PC U, <wB,,,)(X) 
CP, 80 P=VU,<gB,.)(x) € T as a union of sets in T, so To(#) C T. But if 
x € PET choose f€F such that f(x) =1, f(y) = 0 for y € FY and r(f) € P 
by r(f)(f) = 5, r(f)(g) = for g #f, and note that Ny~(x) = {y: d(x, y) < 
r(f)} = {y: jl -— fQy)| < 5} CY, so PE To(%), so To(%) = T. 

The assertions about Tychonoff spaces are verified by combining the above with 
Proposition 8. 
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Although the number of primes is infinite, one can still find as many consecutive 
composite integers “‘as one pleases’; 100, 500, 1,000, 1,000,000, or whatever. That is, 
there are “regions” of consecutive integers in the set NM = {1,2,3,...} of positive 
integers where no prime is to be found and the number of integers in such a region 
can be as many as we please. 

For example, if we wish to point out a region of 1,000 consecutive composite 
integers in N we may consider the numbers 1001! + 2, 1001! + 3,...,1001! + 1001 
and observe that the first integer is divisible by 2, the second is divisible by 3, and so 
on, and that the last integer is divisible by 1001. In general, to find k consecutive 
positive integers in N, none of which is a prime, we need only consider the numbers 
(A + 1)! + 2, (kK + 1)! 4+ 3,...,(4 + 1)! + (k + 1). It is to be pointed out, of 
course, that these numbers may not form the first such region in N. For example, 
the numbers 22! + 2, 22! + 3,...,22! + 22 form a region of 21 consecutive integers 
where no prime is present, but the integers 1130,1131,...,1150 form the first such 
region. It is also to be pointed out that one or both of the numbers (k + 1)! + 1, 
(k + 1)! + (& + 2) may not be prime, meaning that the region can be extended in 
one direction or the other (perhaps both) to include additional consecutive com- 
posite integers. This is not the case for the region containing the integers from 1130 
to 1150, inclusive, since 1129 and 1151 are both prime numbers. 

When a region of k consecutive composite integers c),C,,...,¢, cannot be 
extended in either direction to include additional composites because the integers 
c, — 1 and c, + 1 are both primes, we shall call the region a prime desert of length 
k. Thus the subset {1130 + 1,1131,...,1149, 1150} of N is a prime desert of length 
21 (and it is the first prime desert of length 21). 

Several questions immediately arise: 


1. Is there a prime desert of any given length k? 
2. If there is a prime desert of length k, how can one be found? Can we 
“construct” one? 
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3. If there is a prime desert of length &, where is the first such desert? 
4. If there is a prime desert of length &, is the number of such deserts finite or 
infinite? 


With regard to question 1 we can see immediately that in order to have a prime 
desert of length k, k must be an odd integer. So question 1 becomes: Is there a 
prime desert of any given odd length k? 

With regard to question 2 we remark again that the numbers (k + 1)! + 2, 
(A + 1)! + 3,...,(k + 1)! + (Kk + 1) are k consecutive composite integers. But they 
do not form a prime desert of length k because one of the integers (k + 1)! + 1 or 
(k + 1)! + (& + 2) must be a composite, as we now show. We observe first that in 
order for (kK + 1)! + (k + 2) to be prime, k will have to be an odd integer such that 
k + 21s a prime. For suppose k is odd such that k + 2 is not a prime. Then k + 2 
has a prime factor g, where g < yk + 2. Since VK +2<k+1,q<k +1 andit 
follows that g\(k + 1)!. Thus (A + 1)! + (kK + 2) is not a prime. Similarly, if k + 4 
is not a prime, then (k + 1)! + (k + 4) is not a prime. In general, we have the 
following: 


THEOREM 1. Suppose k is odd and consider the set S = {k + 2,k + 4,...,k + 
2m)}\ where it is the case that each integer in S is not a prime but k + 2(m + 1) is a 
prime. If k* +k > 2m —1, then the consecutive composites (k + 1)! + 2,(k + 1)! 
+3,...,(k + 1)! + (k + 1) can be extended to include the composites (k + 1)! + (k 
+2),(k +1)! + (k + 3),...,(k + 1)! + (k + 2m), (kK + 1)! + (kK + 2m 4 21). 
(A + 1)! + (k + 2m + 2) may, or may not, be a prime, depending on the value of k. 


Proof. Each element k + 27, 1<j<m of S has a prime factor g,, where q, 


< yk +27. Since yk +27 <k +1 for k*+k>2j—-1, q(k + 1)!, implying 
that (k + 1)! + (k + 27) is a composite. 


When k is odd (k #1) such that k + 2 is a prime, we have, by Wilson’s 
theorem, that (kK + 2 — 1)! = —l (mod k + 2). That is, (kK + 1)! + 1 is divisible by 
A + 2 and thus is not a prime. Therefore, the consecutive composites (k + 1)! + 2, 
(A + 1)! + 3,...,(k + 1)! + (k + 1) can be extended (to the left) to include the 
composites (kK + 1)! and (Ak + 1)! + 1. (k + 1)! — 1 may, or may not, be a prime, 
depending on the value of k. 

With regard to question 4 we note that for k = 1 we are asking whether the 
number of prime deserts of length 1 is finite or infinite. This is equivalent to asking 
if the number of twin primes 1s finite or infinite, a long standing unsolved problem 
in number theory. 

Since the main purpose of this article is to consider consecutive prime deserts, in 
particular consecutive prime deserts each having the same length, we shall not 
pursue the above questions directly, but, rather, indirectly. (By n consecutive prime 
deserts we shall mean n deserts determined by v + 1 consecutive primes. Note that 
if the primes are also in arithmetic progression, the consecutive prime deserts all 
have the same length, and conversely. For a brief discussion of arithmetic progres- 
sions of primes see [1, pp. 10-—15].) We begin with the following definition. 
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DEFINITION. Two consecutive prime deserts, each having the same length k, are 
called prime desert twins of length k. Three consecutive prime deserts, each having 
the same length k, are called prime desert triplets of length k. And so on. We may 
refer to n consecutive prime deserts, each having the same length k, as prime desert 
n-tuplets of length k. 


Now suppose p is an odd prime, p # 3. To have desert twins of length k (where 
the deserts begin at p + 1) it must be the case that p, p+(k +1) =(p+1)+k, 
and p+ 2(k +1)=(p+t 2) + 2k are all prime integers. Since p # 3, one of 
p+1, p+2 must be divisible by 3. Thus k #0 (mod3) and we have k =1 
(mod 3) or k = 2 (mod 3). Suppose k = 1 + 3m. Then p+1+k =(p+ 2) + 3m, 
implying 3+ p+ 2, and p+2+2k=(p+1)+ 30 + 2m), implying 3+ p + 1. 
Therefore, it must be the case that k = 2 (mod 3). This, together with the fact that 
k = 1 (mod 2), tells us that k = 5 (mod6). We have proved the following: 


THEOREM 2. Suppose p is an odd prime # 3 and k is odd. The existence of prime 
desert twins of length k implies that k = 5 (mod 6). 


So it is necessary to have k = 5 (mod 6) in order for prime desert twins of length 
k to exist. But is it the case that given such a k, there are prime desert twins of 
length k? Let us try to construct a desert twin of length k = 5. We will need to find 
integers p,p+1,p+2,...,p +11, p+12 where p, p+ 6, and p+12 are 
primes and the remaining numbers are composites. Since p is a prime, each of 
p+1,pt 3,...,p +11 is divisible by two. As noted above, assuming p # 3, 
either 3|p + 1 or 3|p + 2. Suppose the former. Then also 3|p + 4, 3|p + 7, and 
3| p + 10. What about p + 2 and p + 8? They must also be composites or we will 
not have prime deserts of length 5. If we also assume p # 5, then one of the 
numbers p + 1, p + 2, p + 3, p + 4 must be divisible by 5. But since p + 6 and 
p +12 must be primes, 5 cannot divide p + 1 or p + 2. Suppose 5|p + 3. Then 
also 5|p + 8. Finally, supposing 7|p + 2 gives us the situation described below: 


n 0 1 2 3 4 5 6 7 8 9 10 11 12 
divisors 
ofpt+tn — 2,3 7 2,5 3 2 — 2,3 5 2,7 3 2 =- 


Question: Is there a prime p such that p + 6 and p + 12 are also prime and 
such that the above is satisfied for the numbers p+ 1,..., p +5 and p+/7,..., 
p +11? Answer: Yes. p+ 1 divisible by 3, p+ 3 divisible by 5, and p+ 2 
divisible by 7 suggests we need to find a prime p that satisfies the system of 
congruences 


p=2(mod3) p=2(mod5) ~~ p= 5(mod7) 


where it is a/so true that p +6 and p+ 12 are primes. The Chinese remainder 
theorem guarantees an infinite number of solutions to such a system of linear 
congruences (all congruent to one another modulo 3 - 5 - 7) but, of course, not all 
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solutions are primes. And when a particular solution p is a prime, p + 6 and/or 
p + 12 may not be prime. 

It is routine to solve the above system and arrive at p = 47 (mod105). All 
solutions are of the form p = 47 + 105m and we seek m such that p, p + 6, and 
p +12 are primes. m=0 leads to p= 47, p+6= 53, and p+ 12=59, all 
primes. Thus 48, 49, 50, 51, 52 and 54, 55, 56, 57, 58 are prime desert twins of length 
k = 5. It is interesting to note that this desert twin is the first desert twin where 
k = 5. 

Dirichlet’s theorem tells us that every sequence of positive integers {a + bm}, 
where a and 5b are relatively prime, contains an infinite number of primes. Thus 
there is an infinite number of values of m for which 47 + 105m is a prime. (Note 
that m must be even.) Dirichlet’s theorem also tells us that each of the sequences 
{53 + 105m} and {59 + 105m} contains an infinite number of primes. It would 
seem reasonable, then, to expect other values of m to produce three primes p, 
p+ 6, p+ 12, where p € {105m + 47}, p+6€ {105m + 53}, and p+126 
{105m + 59}. The reader can check that this is the case for m = 2, 10, 50, and 74. 
But is the number of such m finite or infinite? Is the number of prime desert twins 
of length 5 finite or infinite? 

One can certainly form a construction of the type used above for k = 5 to try 
and find a prime desert twin for any k = 5 (mod 6). Such a construction will lead to 
a system of congruences that will have a solution p = a (mod ¢) with (a, t) = 1 by 
the Chinese remainder theorem. This solution will lead to the three sequences 
{tm+a}, {tm+a+(k+1)}, {tm+ a+ 2(k + 1)} and each of these contains 
an infinite number of primes by Dirichlet’s theorem. Each value of m that yields a 
prime in all three sequences gives a prime desert twin. But, will there always be such 
an m, and if so, is the number of such m finite or infinite? 

Now let us turn our attention to prime desert triplets. Suppose p is a prime # 3. 
To have prime desert triplets beginning at p + 1 we must first have prime desert 
twins; thus k = 5 (mod6) with p + (k + 1) and p + 2(k + 1) primes. For a desert 
triplet it also will have to be the case that p + 3(k + 1) is a prime. Since p # 3 and 
p+(k +1) is a prime, p + 3(k + 1) is not divisible by 3 or p. Suppose g is a 
prime > 3, g #p. If gk + 1), then gt p+ i(k + 1)Wi. If q+ (k + 1), then for 
i = 1,2,3, we have i(k + 1) =a,(mod q) where 1 <a,;<q-—landi#j>a,# 
a,. Thus we may write 


pt({k+1)=(pta,)+ qb, 
p+2(k+1)=(pta,)+ qb, 
p+ 3(k +1) =(p+a,) + qbs. 
Since p, p + a,, p + a,, and p+ a, do not constitute the g consecutive integers 
from p to p+q-—1, p + 3(k + 1) need not be divisible by g. Thus there are no 


further restrictions on k due to the prime g and so it is necessary only for k to be 
congruent to 5 modulo 6 to have prime desert triplets of length k. 
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Do prime desert triplets exist for every k such that k = 5 (mod6)? Can prime 
desert triplets of length k be constructed? If yes, then for k = 5 we would need to 
find primes p, p + 6, p+ 12, and p+ 18 such that the integers p +1, p+ 2, 

.p+5,pt7pt+8,...,pt+1lipt 13,p+14,...,p+17 are all com- 
posites. Consider the following: | 


n 0 1 2 3 4 5 6 7 8 9 10 11 12 
divisors 
ofptn -—- 2,3 7 2 3,5 2 -— 2,3 11 2,57 3 2 - 
n 13 14 15 16 17 —= 18 
divisors 


ofptn 2,3 5 2 37 2 = 
This leads to the system of congruences 
p=2(mod3) p=1(mod5) ~~ p=5(mod7) p = 3 (mod 11) 


which in turn leads to p = 971 (mod 1155) or p = 971 + 1155m. Now we see that 
we need a value for m such that p, p + 6, p + 12, and p + 18 are all primes. The 
sequences {1155m + 971}, {1155m + 977}, {1155m + 983}, and {1155m + 989} 
each contain an infinite number of primes, but is there a common value of m that 
yields a prime in each sequence? Yes; the first such m is m = 210. Thus we have 
constructed the prime desert triplets of length 5 determined by the four primes 


243,521 243,527 243,533 243,539 


Are there other values of m? How many? 
The following construction leads to the first prime desert triplets of length k = 5. 
They are determined by the primes 251, 257, 263, and 269. 


n 0 1 2 3 4 5 6 7 8 9 10 11 12 
divisors 
of p+n — 2,37 11 2 35 2 - 23 7 25 3 2 =- 
n 13 14 15 16 17 18 
divisors 


ofp+n 2311 5 27 3 2 - 


So, at least in the case k = 5, it is actually possible to construct prime desert 
triplets. 

One should soon wonder if it is possible to “extend” prime desert triplets to 
prime desert quadruplets. Well, if k = 5, we would need p, p + 6, p+ 12, p + 18, 
and p + 24 all to be primes. If we assume p # 5, then either 5|p + 1, 5|p + 2, 
5|p + 3, or 5|p + 4. Since p+6=(p+t+1)+5,5+p+1 because otherwise no 
prime desert of length 5 would exist from p to p + 6. Since p + 12 = (p+ 2) + 10, 
5 + p + 2 because otherwise no prime desert twins would exist from p to p + 12. 
And since p + 18 = (p + 3) + 15,5 + p + 3 else we would not have prime desert 
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triplets from p to p + 18. Therefore, it must be the case that 5| p + 4, implying that 
prime desert quadruplets of length 5 are not possible. A similar analysis will show 
that prime desert quadruplets of length k = 11 are not possible. 

Since 29 = 5 (mod 6), it is possible to have prime desert triplets of length k = 29. 
Might such a triplet be extended to a quadruplet? Unlike the cases above for k = 5 
and k = 11, this time the answer is yes. If we have a prime desert triplet of length 
29 beginning at p+ 1 for some prime p, then we know that p, p + 30, p + 60, 
and p+ 90 are primes. If it is also the case that p+ 120 is a prime and 
pt 91, p+ 92,..., p + 119 are composites, then we have a prime desert quadrup- 
let. p #5 implies that 5 does not divide any of the integers p + 30, p + 60, 
p + 90, or p + 120. Thus the restriction due to division by 5 that prevents the 
existence of prime desert quadruplets of length k = 5 and k = 11 (and k = 17 and 
k = 23 as well) does not hold for k = 29. In fact, prime desert quadruplets of length 
29 do exist; the first such one is determined by the five primes 


9,843,019 9,843,049 9,843,079 9,843,109 9 843,139 


The above discussion regarding prime desert quadruplets suggests the following 
theorem: 


THEOREM 3. Suppose p is a prime, p > 5, and k is odd. The existence of prime 
desert quadruplets of length k implies that k = 29 (mod 30). 


Proof. To have prime desert quadruplets of length k beginning at p + 1, it must 
be the case that p,(p+1)+k, (p + 2) + 2k, (p + 3) + 3k, and (p + 4) + 4k 
are all prime integers. Since p > 5, one of p+ 1, p +2, p+ 3, p+ 4 must be 
divisible by 5. Thus k # 0 (mod 5) and we have either that k = 1 (mod5), k = 2 
(mod 5), k = 3 (mod5), or k = 4 (mod5). Suppose k = 1+ 5m. Then p+1+k 
= (p+ 2) + 5m, implying 5 t p+ 2; p+24+2k =(p+ 4) + 10m, implying 5 + 
p+4; p+3+4+ 3k =(pt+1)+ 501 + 3m), implying 5+ p+1; and p+4+ 4k 
= (p+ 3) + 5(1 + 4m), implying 5 + p + 3. Therefore, k # 1 (mod 5). Similarly, it 
follows that k # 2 (mod 5), and & ¥ 3 (mod 5). Thus it must be the case that k = 4 
(mod 5). This, together with the fact that kK = 5 (mod 6), implies k = 29 (mod 30). 


For k = 29 we can also note the following: p + 30 =(p+t 2) + 28, p+ 60 = 
(p+4)+ 56, p+90=(pt+ 6)+ 84, pt+120=(p+1)+119, p+150= 
(p + 3) + 147, and p + 180 =(p+5)+ 175. If p # 7, one of p+ 1, p+ 2,..., 
p + 6 must be divisible by 7. Thus if we have a prime desert quadruplet beginning 
at p +1, 7 cannot be a factor of p + 2, p+ 4, p+ 6, or p + 1. If it is also the 
case that 7 + p + 3, a prime desert quintuplet is possible. In this event 7 must divide 
p +5 and thus at most prime desert quintuplets are possible. 

A necessary condition on the length k of prime desert n-tuplets is given by the 
following theorem, the proof of which is readily established by induction on i: 


MAIN THEOREM. Let p, denote the ith odd prime and suppose k is odd. If p > p;, 
then a necessary condition for the existence of prime desert ( p; — 1)-tuplets of length k 
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beginning at p + 1 is 

k =(2-3-5--- p;— 1) (mod2-3-5--- p,). 
The same restriction on k applies to prime desert n-tuplets for n such that p; —-1<n«< 
Pi4, ~2. Thus if k=(2-3-5-++ p;—1VQmod2-3-5--- p,) but k#(2-3- 
5 +--+ p.,—1)(mod2-3-5--- p;,,), then a prime desert ( p,; — 1)-tuplet may be 
extended to at most a prime desert ( p,,, — 2)-tuplet. 
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Introduction. ‘Take a four digit number x. Form a new number 4 by rearranging 
the digits of x in descending order. Form a new number B by rearranging the digits 
of x in ascending order. Define K(x) = A — B. In [4], D. R. Kaprekar discovered 
that after at most 7 such operations every four-digit number in base 10 leads to the 
same result, 6174. 

In the late 60’s and early 70’s, the problem of proving and possibly generalizing 
the above discovery made the rounds at Berkeley, MIT, etc. ({1], [6]). D. D. 
McCracken said that the problem was called “the 6174 problem”, and he conjec- 
tured that every three-digit number in an even base leads to a similar phenomenon 
[5]. 

The authors were introduced to the problem as such. 

In this article, we study such an operation, called a Kaprekar transformation 
(transformation for short), on three-digit numbers in arbitrary bases. In these 
situations it is possible, as we shall see, to give quite a complete description of what 
happens: 


(a) For a three-digit number x in base r, the result of a single Kaprekar transforma- 
tion is determined by the difference between the largest and smallest digits, I, of x and 
the base r: the first digit of the result is I — 1, the second digit r — 1, and the third 
digit r — I. 
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If 728 is in base 15, for instance, then J is 6 and r is 15; hence, (6 — 1,15 — 1,15 
— 6), or 5149 is the result after one transformation. Since J of 5149 is 9, 8146 is the 
result after the second transformation on 728. (We denote digits larger than 9 by 
representing the digits in base ten and underscoring. An ordered triple and a 
three-digit number with its digits written in juxtaposition will be used interchange- 
ably, sometimes one preferred to the other with respect to clarity.) 


(b) There is a unique number, namely, ((r — 2)/2,r—1,r/2), to which every 
three-digit number in base r converges after at most (r+ 2)/2 transformations (1 
transformation for r = 2) if and only if r is even. (The unique number is called the 
Kaprekar constant.) 


All three-digit numbers in base 1986, for instance, converge to the number 992 
1985 993 after no more than 994 transformations. 


(c) There is a unique regenerative loop consisting of two numbers, ((r — 3)/2, r — 1, 
(r+ 1)/2) and ((r — 1)/2,r—1,(r — 1)/2), to which every three-digit number in 
base r leads after at most (r + 1)/2 transformations (1 for r = 3) if and only if r is 
odd. (The two numbers, each the image of the other under the transformation, are 
called loop elements.) 


In base 1987, for instance, every three-digit number becomes either 992 1986 994 
or 993 1986 993 after at most 994 transformations. 


(d) The smallest number of transformations necessary for a three-digit number in an 
even base r to become the Kaprekar constant is 2 + r/2 — lifI <r/2,orl+1—r/2 
if I > r/2, where I is as described in (a). 


Consider 867 in base 10, for instance. The difference J is 2 and r is 10. Hence, 
after exactly 2 + 10/2 — 2, or 5 transformations it becomes the Kaprekar constant 
495. 


(e) The smallest number of transformations necessary for a three-digit number in an 
odd base r to become a loop element is: (r+ 3)/2—TI if I<(r—-1)/2; 1 if 


=(r—1)/2; I-(r-)D 2 ifl>(r-1//2. 


If 867 is in base 15, for example, then J is 2 and r is 15. Hence, it requires at 
least (15 + 3)/2 — 2, or 7 transformations for 867 to become 7147. 

J. H. Jordan had proved a stronger version (even and only even bases) of 
McCracken’s conjecture in his 1964 paper and identified Kaprekar constants in his 
proof [2, Theorem 3]. 

C. W. Trigg, 1974 paper, stated without proof the results mentioned in (b)-(e) 
above [7]. 


1. Preliminaries. We denote by S(n, r) the set of all positive integers expressible 
in n-digit sequences in base r exclusive of those digit sequences with all digits equal. 
For example, $(3,10) is the set of all decimal integers between 001 and 998 


1988] THE DETERMINATION OF KAPREKAR CONVERGENCE 107 


excluding 111, 222,..., 888. A “three-digit number’’ means an element of S(3, r) 
for an appropriate base r. 

Let K: S(n, r) > S(n, r) be that transformation which permutes the n digits of 
a number in S(n, r) to obtain the largest and the smallest number expressible using 
the same n digits and then subtracts the smallest from the largest. K is called the 
Kaprekar transformation. Recursively, we define 


K! {x for /=0 
(x) = K(K'"'(x)) for! =1,2,3,.... 


Let x © S(n,r). If K(x) =x, then x is said to be a fixed point of K. If 
x € S(n,r) is a fixed point, and K[S(n, r)] = {x} for some positive integer m, 
then x, if it exists, is called the Kaprekar constant of S(n, r) and the smallest such 
integer m is called the degree of Kaprekar convergence of S(n,r). If y € S(n,r) 
and K‘(y) = x for some positive integer /, where x is a fixed point of K in S(n, r), 
then y is said to converge to x under K and the smallest such integer / is called the 
degree of convergence of y to x. (Note that fixed points may not be unique; for 
instance, both 1001 and 0111 are fixed points in S(4, 2).) If, in particular, x is the 
Kaprekar constant, then y is said to be Kaprekar convergent and /—the degree of 
convergence of y to x—is called the degree of Kaprekar convergence of y. 

If, for some integer m, K(x) = K™*"(x) and K™(x) # K™*/(x) for all 7 such 
that 1 <7 <i, then K(x) is said to generate a loop of length ¢ and each of 
K"™*i(x), 7 = 0,1, ..., i-1, is called a loop element. (Note that being a unique 
fixed point does not necessarily mean the fixed point is the Kaprekar constant, and 
that the existence of a loop does not preclude the existence of a unique fixed point; 
in $(4, 20), for example, 12 3 15 8 is a unique fixed point while 719 19 12— 12 6 12 


8 > 6315 145127118 >5216 15> 1499 6 +719 19 12 form a loop of 
length 6 with each element being the image of the preceding one under K.) 

If K(x) is a loop element for some integer m, then x is said to be loop 
convergent and the smallest such integer m is called the degree of loop convergence 
of x. If every member of S(n, r) is loop convergent, then S(n, r) is said to be loop 
convergent and the smallest number of applications of K necessary for all members 
of S(n, r) to be loop elements is called the degree of loop convergence of S(x,r). 

We study the Kaprekar transformation on three-digit numbers only in this paper. 
Throughout the paper all numbers are nonnegative integers and K stands for the 
Kaprekar transformation; r for the base (radix) of a number system; J for the 
(positive) difference between the largest and smallest digits of a three-digit number. 

For x € S(3p, r), we denote by d(n, i; x) the ith digit of K”"(x) (counting from 
left to right) and by m(n, i; x) the ith digit in the ascending order of the magnitude 
of digits of K"(x) (that is, m(n,1;x) < m(n, 2; x) < m(n,3; x) with at least one 
strict inequality). If there is no ambiguity, we simply write d(n,i) and m(n, i), 
respectively, for d(n, i; x) and m(n,i; x). For x = 792 in S(3,10), for example, 
d(0,1) is 7 while m(0,1) is 2, the smallest digit; d(1,1) is 6 while m(1, 1) is 3 since 
K(x) = 693; d(2,1) = 5 = m(2,2) and d(2,3) = 4 = m(2,1) since K(x) = 594, 
and so on. 
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With these notations we can write the image of x in $(3, r) after n applications 
of K as follows: 


K"(x) = (d(n,1; x), d(n,2; x), d(n, 3; x)) 
or equivalently as d(n,1; x)r* + d(n,2; x)r + d(n, 3; x). 


2. The mechanics of Kaprekar transformation. We begin with a fundamental 
lemma from which all our results will be derived. 


LEMMA 1. Let x be in S(3,r). Then, for every n = 1, we have 


d(n,1;x) =m(n-—1,3;x)—m(n-1,1;x)-1 (1) 
d(n,2;x)=r-1 (2) 
d(n,3;x) =r+m(n-1,1; x) —m(n-1,3; x). (3) 


Proof. Let x € S(3,r) and n= 1. By definition of the transformation K and 
due to the fact that the largest digit is greater than the smallest (hence, borrowing is 
necessary in the subtraction), we obtain 


K"(x) = |[m(n—1,3; x)r? + m(n—1,2;x)r + m(n—- 1,1; x)| 
—[m(n- 1,1; x)r? + m(n — 1,2; x)r + m(n- 1,3; x)| 
= |m(n-1,3;x) —m(n—-1,1;x)-1]r’ 
+[r+m(n-—1,2;x)-—m(n—1,2;x)-1]r 
+[r+m(n-—1,1;x) —m(n-—1,3;x)]. 
Hence, the first digit of K"(x) is m(n — 1,3) — m(n — 1,1) — 1, the second is 


r — 1, and the third is r + m(n — 1,1) — m(n — 1,3). Of 
From Lemma 1, one can readily see that, for every n > 1, 


d(n,1; x) + d(n,3;x) =d(n,2;x)=r-1 
= m(n,3;x) =m(n,1; x) + m(n,2; x). 
From (1) and (4), it follows that, for all n > 1, 
d(n+1,1;x) =m(n,2;x)-1; (5) 


(4) 


and from (3) and (4) 
d(n+ 1,3; x) =m(n,1;x) +1. (6) 
The lemma also shows that the image K(x) of a three-digit number x in a 
particular base r is determined by the (positive) difference between the largest and 


smallest digits of x; hence, one needs to know only this difference to evaluate K(x). 
We denote this difference by I(n; x): for x € S(3, r), 


I(n; x) = m(n, 3;x) — m(n,1;x) for every n = 0. 
Note we use J for J(0, x) and that: if x and y are in $(3,r), then for each n > 1, 
K"(x)=K"(y) ifandonlyif J(n-—1;x)=I(n-1; y). 
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Though K may be thought of as a function of J, after the initial application of K 
the middle digit, being r — 1, remains always the largest. So we investigate the 
behavior of the first and third digits under K, which are, in some order, the smallest 
and middle-sized digits. We denote by J(n; x) the (nonnegative) difference between 
the smallest and middle-sized digits; that is, for x € S(3, r), 


J(n; x) = m(n, 2;x) — m(n, I;x) for n> 0. 
Then, for n > 1, 
J(n; x) = m(n,2; x) — m(n,1; x) =|d(n,1; x) — d(n,3; x)}. 
If we write (1) and (3) using [(n — 1; x) for m(n — 1,3; x) — m(n — 1,1; x), 
then J(n; x) = |r — 21(n — 1; x) + 1|. Thus, for » > 1, 
r—21(n-1;x)+1 iff(n-—1;x)<(r+1)72 
J(n; x) = . 
2in-1;x)-r-1 iff(n-1;x)>(r+1)72 


(7) 


LEMMA 2. Let x be in S(3,r). Then, for alln =1, J(n, x)—the (nonnegative) 
difference between the first and third digits of K"(x)—is odd if and only if the base r is 
even. 


Proof. By (4), the base r and m(n,1) + m(n,2) have opposite parity. But, 
m(n,2) — m(n,1) and m(n,1) + m(n, 2) have the same parity. 
From (5) and (6), it follows that, for n > 1, 
d(n+1,1;x) —d(n+1,3;x) =J(n; x) - 2. 
Hence, for every n > 1 and every x in S(3, r), 
d(n+1,3;x)<d(n+1,1;x) ifandonlyif J(n; x) > 2. (8) 


Consequently, if J(; x) > 2 for any n > 1, then by the definition of J and 
equations (5) and (6), 


J(n+1;x) =J(n; x) - 2. 


If J(1; x) = 2, therefore, then each application of K decreases J(1; x) by 2 as long 
as it is not already less than 2. Hence, J(1+ i; x) =J(1; x) — 2i for all i< 
J(1; x)/2. Thus we have proved 


Lemma 3. Let x be in S(3,r). If J(1: x) = 2, then 
J(1 + i; x) =J(1; x) - 2i for alli < |JQ; x)/2|. 


3. Determination of Kaprekar convergence and loop convergence. According to 
Lemma 1, for every x in S(3,r), 


K(x) =(J-1,r-1,r-T). (9) 
Hence, K[S(3, r)] has exactly r — 1 elements, namely, (J —1,r-—1,r—J), I= 
1,2,...,7—1; moreover, for every n>1, K"(x) must be one of these r— 1 


numbers. For some n < r, therefore, K"(x) must be either a fixed point or a loop 
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element that starts a regenerative loop. It turns out that whether the base is even or 
odd determines which of these alternatives actually happens. 

Lemma 3 implies that J(1; x) decreases by 2 each time K is applied, eventually 
reaching 1 or 0 according as the base r is even or odd (due to Lemma 2), yet says 
nothing about the values of J( ; x) thereafter. By (5) and (6), however, 


J(n+1;x) =|J(n;x) — 2) forevery n > 1. 


Therefore, J(n; x) = 1 for all n > / if J(/; x) = 1 for some / > 1 whereas J(n; x) 
is alternately 0 and 2 for all n > / if J(/; x) = 0 for some / > 1. So the difference 
between the first and third digits of a fixed point, if it exists, is 1 whereas that of a 
loop element, if it exists, is either 0 or 2. 

Now, let x € S(3,r) and write k = J(1; x). 

Suppose k is odd. (Note that this happens if and only if 7 is even, by Lemma 2.) 
If k > 1, then, by Lemma 3, J(/; x) =1 for some / > 2 and the smallest such 
integer / is (k + 1)/2; hence, by (8), d((k + 1)/2,3; x) + 1 = d((k + 1)/2,1; x). 
Applying (4) to this and using (5) and (6), we see that K ‘**+°)/*(x) is ((r — 2)/2, 
r — 1, r/2) and is a fixed point of K. Since & > 1 if and only if J is neither r/2 nor 
(r + 2)/2, (k + 3)/2 is 


r/2+2—-I(ifI<r/2) or I—r/2+1(if > (r+ 2)/2) 


by (7). 
If k=1, there are two cases to consider: J=r/2 and J =(r + 2)/2. 
If J =r/2, then, by Lemma 1, K(x) is equal to ((r — 2)/2,r —1,r/2); if T= 
(r + 2)/2, then K?(x) is equal to ((r — 2)/2, r — 1, r/2). 

Thus, ((r — 2)/2, r — 1, r/2) is the fixed point to which every member of S(3, r) 
converges and hence, it is the Kaprekar constant if and only if r is even. Moreover, 
the smallest number of transformations K necessary for a three-digit number x in 
S(3, r) to converge to the Kaprekar constant is 


2-J+r/2 ifl<r/2, 
l+J—-r/2 iff>r/2. 


Next, suppose that k is even (this happens if and only if r is odd). If k > 2, then, 
by Lemma 3, J(/; x) = 0 for some / > 2 and the smallest such / is (k + 2)/2. 
Applying (4), we see that K (*+%/2(x) is ((r — 1)/2, r — 1,(r — 1)/2). And apply- 
ing K twice more, we can show that ((r — 1)/2, r — 1,(r — 1)/2) and ((r — 3)/2, 
r—1,(r + 1)/2) form a regenerative loop of length 2 as each is the image of the 
other under K. (Note the values of J( ; ) of these numbers are 0 and 2, respectively; 
and J(k/2; x) is also 2, but K*/?(x) is not a loop element.) 

If k = 0, then by (4), K(x) = (r — 1)/2, r—1,(r — 1)/2), a loop element. 

Thus, --- ~ ((r—-1)/2,r-1,(7—- 172) ~ (r — 3)/2,r—-1,(r + 1)/2) > 
(r — 1)/2,r—1,(r—1)/2) > --: form a regenerative loop of length 2 to which 
every member of $(3, r) converges if and only if r is odd. 

From Lemma 1, we see that 


K(x) = ((r-1)/2,r-—1,(r-—1)/2) ifandonlyif J = (r+ 1)/2; 
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whereas 
K(x) = ((r-— 3)/2,r—-—1,(r+1)/2) ifandonlyif I= (r—-1)/2. 


If J is neither (r + 1)/2 nor (r — 1)/2, then, by (7), k => 2 with the equality 
holding only if J = (r + 3)/2, and (k + 2)/2—the smallest number of transforma- 
tions K necessary for x to become a loop element in this case—is, by (7), 


(r+ 3)/2-I(GfI<(r-1)/2) or I-(r-1)/2 GfI> (r+1)/2). 


Accordingly, the smallest number of transformations K necessary for a three-digit 
number x in S$(3, r) to become a loop element is 


(r+ 3)/2-I if <(r-1)/2, 
1 if l= (r-1)/2, 
I-—(r-1)/2 iff >(r+1)/2. 
We have thus proved the main result, namely, 


THEOREM I. 

(A) The set S(3, r) has the Kaprekar constant ((r — 2)/2,r — 1, r/2) if and only if r 
is even. 

(B) The set S(3,r) is loop convergent with a unique regenerative loop of length 2 
consisting of ((r — 3)/2,r —1,(r + 1)/2) and ((r — 1)/2,r—1,(r — 1)/2) if 
and only if r is odd. 

(C) If x in S(3, r), where r is even, is not the Kaprekar constant, then its degree of 
Kaprekar convergence is given by: 

G)2-I+r/2 ifIl<r/f2 
Gi) 1+/J—r/2 wfl>r/2. 

(D) If x in S(3;7r), where r is odd, is not a loop element, then its degree of loop 

convergence is given by: 


(i) (r+ 3)/2-TI ifl<(r-1)72 
(ii) 1 iff =(r—1)/2 
(ii) J-—(r—1)/2 iff > (r—-1)/2. 
COROLLARY 1. 
(A) The degree of Kaprekar convergence of S(3,1r), where r is even, is 1+ r/2 for 
r>4andifr=2. 
(B) The degree of loop convergence of S(3, r), where r is odd, is (r+ 1)/2 forr > 5 
and 1 ifr = 3. 


Proof. The degree of convergence of the set S(3, 7) is equal to the maximum 
degree of convergence over all members of S(3,7r). Hence, from (C) and (D) of 
Theorem 1 follows the corollary. 


4. Additional examples. We combine (5) and (6) into 
K"(x) = (m(n - 1,2; x) —1,r-—1,m(n-1,1; x) +1) forn>2. (10) 


Then, we can use (9) and (10) as an algorithm to evaluate K"(x) for all n > 1. 
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Consider 787, for example. If it is in base 10, then using (9) and (10) we easily 
obtain the successive images: 787 — 099 —> 891 — 792 — 693 — 594 — 495 > 
495 > ---; if it is in base 15, its successive images are: 787 > 0 14 14> 13 141 
> 12142->11143>10144 59145 >8146>7147>6148 +7147 
>6148-—> .:-:. 

Examples like these, which can be readily obtained (by a computer or mentally), 
can be used to substantiate the formulas in Theorem 1 and its corollary as well as 
some properties of K described elsewhere in the paper. Notice that we chose the 
number 787 so as to exhibit an element of $(3,r) whose degree of convergence 
coincides with that of the set S(3, r). (K"[S(3, 10)] = {495} if and only if ” > 6, 
and Kn[S(3,15)] = {7 14 7, 6 14 8} if and only if n > 8.) 


The authors would like to mention, in passing, that the material presented in this 
paper provides answers to simple questions—perhaps suitable for exercise problems 
for elementary courses—such as: 


1. How many three-digit numbers in base 10 will become 892 under K? 297 
under K? 

2. Given a base, say 1986, determine the number of all three-digit numbers in 
S(3, 1986) whose degree of Kaprekar convergence is exactly that of S(3, 1986). 

3. If a three-digit number is randomly chosen in base 10, what is the probability 
that the number will become 495 after 2 Kaprekar transformations? After 5 
transformations? 
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The Editor’s Corner: The Quest for Normality 
HERBERT S. WILF 


The integral. Until now these columns have discussed research developments. 
This month the offering is more in the spirit of the ‘Teaching of Mathematics’ 
department: the discussion will concern a subject that has been well known to 
specialists for years, but deserves wider dissemination. We are going to do two 
things. First we will evaluate a certain n-fold definite integral. Second, although the 
evaluation is pretty enough to need no further justification, it has one: from it we 
can (and will) derive the form of the normal distribution function in 1 variables. 

The integral that you are about to see is available in various references, the 
application of it is also to be found in standard places (e.g. [1]), but hopefully some 
readers will find a new and interesting addition to a linear algebra course some- 
where in the paragraphs below. It is a relative of the well-known evaluation 


f edt = yr. (1) 

All of us who have taught calculus know the devices that combine to prove (1). 

We write down the integral twice, using different dummy variables, push together 

the two integrals, switch to polar coordinates, etc., etc. My students always think 

that they’re learning another general technique for integration. I try to tell them that 

there are virtually no other applications of this whole chain of events, but I don’t 
think they believe me. The related integral is given by the following theorem. 


THEOREM 1. Let A be ann Xn real, symmetric, positive-definite matrix. Then 


gp 8/2 


Pe Le AM dy, dey + de, = ————. 2 
i- i- ; ? " /det( A) ( ) 
n times 
In the above, x = (x,, Xy,..., x,), and (a,b) is the usual inner product of two 


vectors a and b. 
Before we get down to the proof, which is the really nice part, here is a small 
example. Suppose you wanted to calculate 


[= f f e 5x6 +4xY dy dy, (3) 


—CO” — wo 


First notice that the argument of the exponential is as in (2), where the matrix A is 
_ 5 -2 
A=( 5 ~2). 
A quick check shows that A is positive definite, and from (2) we find painlessly that 
7 
V26 
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Now let’s prove the theorem. First, since A is real and symmetric, we can write 
A = PAP’, where P is a real orthogonal matrix, A is diag(A,,..., A,,), and the ’s 
are the eigenvalues of A. Make a change of variables in (2) by letting y = P’x. 
Orthogonal matrices being what they are, this substitution is inverted by x = Py. 

What happens to the argument of the exponential on the left side of (2) is that 


(x, Ax) = (Py, APy) 
= (Py, PAP’Py) 
= (y, P’PAP'Py) (4) 
=(y, Ay) 
=Ayyp + Agvet FAL In 


which is to say that the quadratic form has (scarcely unexpectedly) been reduced to 


a sum of squares. 
Further, since P is an orthogonal matrix, the volume element dx, dx, --- dx, in 
the integral is simply replaced by dy, dy, --- dy,. 
Where we are at the moment is that the integral on the left side of (2) has 
become, after this rotation of coordinates, 


i _— ia evi Ain dy, dy, -*+ dy. (5) 


We pause briefly to express appreciation for the positivity of the A’s, and then we 
observe that (5) is just a product of n one-dimensional integrals, viz. 


fo oT Mt ay, Lee if” oT Ann wy,}. 


Each of these is trivially reducible to (1) above, and the left side of (2) has now been 


found to be equal to 
7 7 7 
VA Va, Va,’ 


The proof of the theorem is completed by noting that the product of the eigenvalues 
of a matrix is equal to its determinant. @ 


If you liked that one, or maybe knew it already, then try this generalization. 


THEOREM 2. Let w be a given vector, and let A be ann Xn real, symmetric, 
Positive definite matrix. Then 


(Q7)"/7 
ydet A 


feo Pe 2 40 dx = e1/% 0, A Nw) (6) 


where ‘{’ means the integral over all of E”. 


Equation (6) is, of course, the calculation of a Fourier transform. 
The appearance of the inverse matrix, A~', on the right side of (6) is quite 
extraordinary, as is the symmetry between the function whose transform is being 
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taken and the shape of the transform itself. If you haven’t ever had the experience of 
seeing this inverse appear before your very eyes then by all means do try the 
derivation. It begins in the same way as the proof of theorem 1, but after the 
rotation of the coordinate axes and the evaluation of the integral, the thing to do is 
not to stop there but to relate the mysterious looking sums that will appear to the 
inverse matrix of A. Be sure to remember that since A = PAP? it must be true that 


LPP s/h =(A™'),5 (4 5=1,n), 


and that is really all you'll need to prove (6). 


The ‘bell-shaped curve’ in n-space. My motives in discussing the integrals (2), (6) 
were not entirely pure (as opposed to ‘applied’). I also wanted to share a fruitful 
application of those integrals with you, namely to the generalization of the ‘bell- 
shaped curve’ to higher dimensional space. 

In one dimension we have the familiar ‘normal curve,’ given by 


— —(1/2)((x=p)/o)? 

f(x) = poe , (7 
Furthermore, in (7), we know the ‘meanings’ of the parameters p» and o: p is the 
average value of x and o” is the average value of (x — p)*. 

Today’s question is: what shall we replace (7) with when we get to n variables 
X = (X,, X5,..., X,)? In other words, what is the form of the n-variable normal 
distribution, and what are the meanings of the parameters that it contains? 

To begin answering this question, let A be an n Xn real, symmetric, positive 
definite matrix, and let 


Vdet A 


(217) n/2 


F(x) _ eh P2)(x, Ax) (8) 


According to (2) above, the integral of F over all space is 1. Since F is 
nonnegative-valued, it follows that F is a probability density function. 
Next, in terms of this F, the Fourier transform relation (6) takes the form 


fe F(x) dx = e7 1/2, Ae) (9) 


The thing to do now is to expand both sides of (9) in a power series in w and 
compare the coefficients. If we do this we find that the left-hand side of (9) is 


n 1 
1-1 Lo, fx,F(x) dx — = Lo, fx,x,F(x) dx ++::, (10) 
r=l1 2 r,s 
whereas the right-hand side is 


1 
LF (A), 500, to (11) 
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If we equate the coefficients of w, on both sides we discover the unsurprising fact 
that 


[x,F(x) dx = 0 (r =1,2,..., 7). (12) 


If we equate the coefficients of w,w, on both sides we discover the possibly 
surprising fact that 


fox, F(x) de = (At), , 9 (r,s =1,2,..., 7). (13) 


The conclusion? With respect to the probability density function F(x) of (8), the 
average value of each of the coordinates x, is 0, and the average value of each of the 
products xX, is (A)... 

Probabilists refer to the average of x,x, as the covariance of the two variables x, 
and x,. Hence we have shown that A~! is the matrix of covariances. Since that is 
the case, why don’t we change its name? Instead of A, let’s call our matrix C7!. 
Then C will be the covariance matrix. One final touch would be to allow for the 
possibility that the variables have mean values other than 0. If p, is the mean value 
of the ith variable and wp is the vector of all of the w,, then we can summarize our 
findings in the form of 


THEOREM 3. Let uw be a given n-vector of constants, and let C be a given positive 

definite n X n matrix. Then the function 
F(x) [-S(-w.c%-w)}) 04 
x) = oS expy — F(x - Bp), X— ph 

(27) * Mdet C 2 
is a probability density function (the ‘normal’ distribution) in E”. With respect to this 
density, the average value of the rth coordinate x, is p, (r=1,...,), and the 
average value of (x, — p(x, — Bs) SC, (7,5 =1,..., 7). 

Do take a minute to let n = 1 in (14) and compare with (7)! 
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What Pythagoras Could Have Done 


YORAM SAGHER 
Department of Mathematics, Statistics, and Computer Science, University of Illinois, Chicago 


Pythagoras allegedly proved that the square root of 2 is irrational. Plato’s 
dialogue Theaitetos tells of proofs by Theodorus of Kyrene of the irrationality of the 
square roots of all integers less than or equal to 17, with the obvious exceptions. 
Why Theodorus stopped at 17 is a subject of lively speculation, but the general tone 
of modern writers (see, e.g., [1]) is to sympathize with the giants of the classical age 
for not having had access to the properties of prime numbers. Had they but known 
that p divides a* implies p divides a, they would have seen through the problem at 
a glance. 

In an issue of the MONTHLY [2], W. C. Waterhouse discusses a plausibility 
argument for the square root of k being either an irrational or an integer, based on 
the assumption that the properties of prime numbers are as mysterious to our 
students as they were to the Greeks. However, the plausibility argument is but a 
rewording of the usual idea. 

In preparing a talk to high school students, the following proof occurred to me. 
The main point of the proof is that it does not depend on properties of prime 
numbers, and so was fully accessible to Pythagoras and to Theodorus. It is also 
accessible to high school students after one year of algebra. When shown to a 
number of bright ninth-graders, 1t caused some excitement. 

Suppose Vk = m/n, where m and n are integers with n > 0. If k is not a square, 
there exists an integer g so that g < m/n < g + 1. Now m? = kn? implies m(m — 
gn) = n(kn — qm) and, hence, m/n = (kn — qm)/(m — qn). From g < m/n < q 
+ 1 we get 0 < m— qn < n. Therefore we have: 


Vk = (kn — qm)/(m — qn), 


where the denominator is positive and smaller than the one in the original fraction. 
Continuing, we get an infinite decreasing sequence of positive integers, an impossi- 
bility. 
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A Question of Protocol 


I. BOUWER* and Z. STAR 
Department of Mathematics and Statistics, University of New Brunswick, Fredericton, Canada E3B 5A 3. 


1. Introduction. Relative to any ordering of the n vertices of the path graph 
aq(n),n > 2, the greedy coloring algorithm produces a 2-coloring or a 3-coloring. 
We determine the number of vertex orderings that result in 2-colorings. We also 
solve the corresponding problem for the cycle graph y() on n vertices, n > 3. 

Let G be any finite simple graph, and S any ordered set whose elements will be 
called colors. (For graph-theoretic terms not defined below, we refer the reader to 
[1].) For any given ordering I of the vertices of G, the greedy coloring of G relative to 
I’ is the coloring of G found by considering the vertices of G in turn, in the order 
provided by I’, and assigning to each vertex the lowest possible color, relative to the 
order on S, such that no two adjacent vertices are assigned the same color. Note 
that if the maximum degree of the vertices of G is equal to d, then any given 
ordering of the vertices of G results in a greedy coloring with at most d + 1 colors. 
An r-color vertex ordering of G is an ordering of the vertices of G relative to which 
the greedy coloring of G uses exactly r colors. If G has n vertices, then a vertex 
ordering of G will be indicated by a one-to-one assignment of the numbers 
1,2,..., to the vertices of G. 

For any natural number n > 2, let z(n) denote the labeled path graph on n 
vertices (see Figure 1). 


a(n): QO——-O-——_ --- —o 
v(1) v(2) v(n) 
Fic. 1 


The labeled cycle graph on n vertices, y(n), is the labeled graph consisting of the 
vertices and edges of an n-gon. Since the vertices of either graph have maximum 
degree equal to 2, it follows that any vertex ordering results in a greedy coloring 
with just two or three colors. 

An interesting and unexpectedly difficult problem (raised in [2]) is to determine 
the number of 2-color vertex orderings P(n) of a(n), and C(n) of y(n). The 
problem may be envisaged as follows: The chairs in a royal banquet hall are 
arranged along one side of a rectangular table (or around a circular table with an 
even number of chairs), and colored alternately red and blue. The king arrives first 
and takes any seat. The other members of the court arrive in descending order of 
rank and can sit anywhere, except that protocol requires a person to be seated in the 
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same colored seat as the king or else next to somebody already seated. If there are n 
persons altogether, and n seats, what is the number of possible seatings? 

We shall let O(n) (or E(n)) denote the number of 2-color vertex orderings of 
a(n) in which the number 1 is assigned to a vertex in an odd (or even) position. 
Thus, O(n) + E(n) = P(n). In Theorem 1, we derive simple recurrence relations 
for O(n), and, in Section 3, explicit solutions for O(n), and thence for P(n) and 
C(n) (see Proposition 1), generating functions yielding in terms of Bernoulli 
numbers. 


2, Recurrence relations. Parts (i) and (ii) of the following Proposition appear in 
[2]; however, the proof given here is shorter. 


PROPOSITION 1. For n > 1: 

(i) P(2n) = 20(2n), 

(ii) P(2Qn + 1) = 2n(2n + 1)O(Qn — 1), 
(il) C(2n + 2) = (Qn + 2)P(2n + 1), and 
(iv) C2Qn + 1)=0. 


Proof. Note that O(2n) = E(2n), since an odd position when counted from the 
left is an even position when counted from the right. This implies (1). (iv) is 
immediate. To prove (iti), we note that in a vertex ordering of y(2n + 2), any one of 
the 2n + 2 vertices can carry the label 2n + 2, while the first 2n + 1 labels 
determine a 2-color vertex ordering of 7(2n + 1). To prove (ii), we let two vertex 
orderings of y(2n + 2) be equivalent if one can be derived from the other by a 
rotation. One then obtains a bijection between the set of equivalence classes of 
2-color vertex orderings of y(2n + 2) and the set of 2-color vertex orderings of 
a(2n + 1) by deleting the vertex in the cycle that is assigned the number 2n + 2, 
and taking the remaining vertices in clockwise order, say. But in any vertex ordering 
of the cycle, the two vertices on either side of the vertex to which the number 1 is 
assigned may be labeled in 2”(2n + 1) ways, and the number of those orderings of 
the remaining 2” — 1 vertices that result in a greedy 2-coloring of y(2n + 2) is 
equal to O(2n — 1). H 


THEOREM 1. (i) For n > 2, 
n—-l 
O(2n +1) =4n0(Q2n-1)4+4¥. GyalaG ~ k)O(2k — 1)O(2n — 2k — 1) 
k=1 
and (ii) for n > 3, 
O(2n) = (2n — 1)O(2n — 2) + (2n — 1)(2n — 2)O(2n — 3) 
n—2 
23 Pn ')k(2n ~ 2k — 1)0(2k — 1)O(2n — 2k — 2). 
k=1 
Proof. We prove (i); (ii) follows similarly. The determination of O(2n + 1) 
reduces to two cases. In the first case, the number 1 is assigned to an end-vertex 


v(1) or v(2n + 1). When 1 is assigned to v(1), the vertex v(2) can be assigned any 
one of the 2” remaining numbers, and, in each instance, the number of orderings of 
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v(3),..., v(2n + 1) that result in 2-colorings is O(2n — 1). By the reflective symme- 
try of 7(n), we arrive at a similar conclusion when 1 is assigned to v(2n + 1). This 
case then accounts for the term 4nO(2n — 1). In the second case, 1 is assigned to an 
interior vertex v(2k + 1), 

l<k<n-l. 
The set of 2k numbers assigned to the first 2k vertices v(1),...,v(2k) may be 


| 
2k 


ways. For each such selection, v(2k) may be assigned any one of the 2k numbers 
selected, and the number of orderings of the first 2k — 1 vertices resulting in 
2-colorings of a(n) is then O(2k — 1). The vertex v(2k + 2) may be assigned any 
one of the remaining 2” — 2k numbers, and then the number of orderings of 
v(2k + 3),..., v0(2n + 1) resulting in 2-colorings of 7(n) is equal to O(2n — 2k — 
1). 


Since, in a 2-color vertex ordering of 7(”) or y(n), any neighbor of the vertex 
assigned 1 may be assigned any one of n — 1 labels, we remark that n — 1 divides 
each of E(n), O(n), P(n), C(n). 

The following table gives the values of O(n), P(n), and C(n), for n < 10: 


n O(n) P(n) C(n) n!} 
] 1 1 — ] 
2 ] 2 — 2 
3 4 6 0 6 
4 9 18 24 24 
5 56 80 0 120 
6 185 370 480 720 
7 1632 2352 0 5040 
8 7217 14434 18816 40320 
9 81664 117504 0 362880 
10 451089 902178 1175040 3628800 


3. Explicit solution. It is easy to obtain explicit generating functions for these 
sequences. Let 


G(t) = O(A)t/1! + O(3)3/3! + «+ 
Multiply (i) of Theorem 1 by t7"/(2n)! and sum from 2 to oo, to obtain 


G'(t) = (tG(t) +1)’. (1) 
The substitution 
y=-14+1t+1/G(t) 
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reduces (1) to the Bernoulli type differential equation 
y'+2y+y*=0, 


with solution: 


y = 2/(e*— 1). 
Thus 
1 1 ; 2t 
Gn 77 t— + 
Similarly, with 
2 4 


H(t) = 0(2)= + 0(4) = forse, 


one obtains, from Theorem 1: 
H’(t)- (t+ ?G(t)) A(t) =¢ + E(t). (2) 


The solution of the first order linear differential equation (2) is 


H(t) = =1 + ePexp{ ['r°6(r) dr)} 
0 
t? O(1) t4 = O(3) ¢° 
= —1+ exp} — + ——— + ——— = 
2 l! 4 3! 6 
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Examples of Infinite, Incongruent Exact Covers 


JOHN BEEBEE 
Department of Mathematical Sciences, University of Alaska, Anchorage, AK 99508 


Stein [1] asks: “If the set of integers is represented as the union of a denumerable 
number of (pairwise disjoint) incongruent arithmetic sequences (d;: b;), 1 <i < 0, 
and Yd>' = 1, does it follow that d,; = 2' ?” I show by example that the answer is 
“no,” and pose 3 similar problems. 

The symbol (d: b) represents the arithmetic sequence {x: x = b+ ad, a€ Z}, 
where 5 is the initial position, d is the modulus of the sequence. A finite collection 
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of arithmetic sequences {(d;: 5,)} with the property that each integer belongs to 
exactly one sequence is called an exact cover. For every exact cover, Yd>' = 1. 

A collection of arithmetic sequences is incongruent if no two moduli are the 
same. There are no finite incongruent exact covers (see [3]). A collection of 
arithmetic sequences satisfying the conditions in Stein’s problem can be called an 
infinite incongruent exact cover, or ITEC. A collection that satisfies all the condi- 
tions except incongruence is called an IEC. A summary of results about exact covers 
and related systems is given in [2], which has an extensive bibliography. 

It can easily be verified that the set of arithmetic sequences {(2': 2'~!):1<i< co} 
is an ITEC, so Stein’s problem asks if there is an IEC with different moduli from 
this one. I call this particular ITEC the infinite Gray cover, because of its close 
connection with the binary Gray code. 


DEFINITION. The Gray cover with parameters p and k has moduli 


Dy Dyvvey Dh Pry Dryeeey Poy ss De, py... p* p* 
-~k-17><k-1> <-~k-1-> 
and corresponding initial positions 


1,2,...,p —1; p,2p,...,(p —1)p;...3p*"1,2p*"1,...,(p — 1) p*74; p*. 

The Gray cover with parameters p and k is an exact cover. If we let p = 2 and 
imagine k to be infinite, we get the previously mentioned infinite Gray cover. If we 
let p = 3 and imagine k to be infinite, then each integer belongs to precisely one of 
the arithmetic sequences 


{(3':3'!): 1 <i < co} VU {(3!:2-3'"):1<i< o}, (1) 


and 


so (1) is an IEC. 


Construction. Let C, = {(d,: ;)}, CG, = {(e,: ¢;)} be exact covers. Then C, = 
{(d;:b,):i #I} VU {(e,d,: b; + c,d;)} ts an exact cover. I call C, the exact cover 
formed by covering the /th sequence of C, by C,. 

This construction can be extended somewhat. If C, or C, is an IEC, so 1s C3. 
Also, the construction can be applied repeatedly and iteratively (C, = C,). 


Proof. To prove that C,, constructed as described, really is an exact cover, let x 
be an integer. Since C, is an exact cover, there is a unique index i and number «a 
such that x = b, + ad,, so if i#J then x €(d,:b,)& CG. If i=TJ, then x = 
b, + ad,, and C, is an exact cover, so there is a unique index j and number f such 
thata=c, + Be,, and, hence, 


x = b, + (c, + Be,)d, = 6, + cd; + Ble,d;). 
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Thus, x © (e,d,;: b; + c,d;) © C;. Since the indices i and j are unique for each 
x, x belongs to precisely one sequence in C,. Thus C, is an exact cover. 

If we cover each of the sequences (3': 3'~/) in (1) by the infinite Gray cover, we 
get the collection 


{ (3/24: 3/1 + 2771-3): 1 <j <oco,l<i< oo} U {((3':2-3!):1<i<o}. (2) 


The method of construction suggests that (2) is an IEC, and in fact it can be 
verified that it is an ITEC with moduli different from the infinite Gray cover. 
Obviously the smallest modulus is 3 and 3 is a divisor of all the moduli. 

Using the same idea, we can construct an ITEC with smallest modulus 4, but 4 
does not divide all the moduli. We begin with the infinite Gray cover {(2':2'~'): 
1 <i < o} and cover the first sequence (2:1) with the ITEC (2), getting the IEC 


{(2-3'- 24:1 42-31 +23/):1<j<0,1l<i< o} (3) 
U{(2- 34:14 4-3) "):1 <i < oo} U {(2):2'"):2 <i < ow}, 


which is seen to be an ITEC. 

Using the ideas outlined here, it is possible to construct an infinity of IIECS, all 
with smallest modulus 2, 3, or 4; and none with 4 a divisor of all the moduli. 

I have not been able to solve the following problems. 


1. Construct an TIEC with smallest modulus larger than 4. Note that if the 
smallest modulus is 5 or any prime, then it divides every modulus. In fact, it can be 
shown that if the smallest modulus is a prime p, then there are IIECs with smallest 
modulus gq, for all g < p. 

2. Construct an ITEC for which 1 is the greatest divisor of the moduli. 

3. Let 


dig = 2'3/, 1<i,j<o, 
do jy = 3/5*, l<j,k < 0, 
dion = 2'5*, 1<i,k <0, 


dij. = 2'3/5", li, j,k < @. 


Then Ld;,, = 1. Find 5, ,, so that (d; ,,: b;;,) is an IIEC, or show that none exist. 


I 


A positive answer to this would solve 1 and 2. 
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THE TEACHING OF MATHEMATICS 


EDITED BY JOAN P. HUTCHINSON and STAN WAGON 
The Absolute and Uniform Convergence of Infinite Improper Integrals 


DaviD J. HALLENBECK AND KATARZYNA TKACZYNSKA 
Department of Mathematical Sciences, University of Delaware, Newark, DE 19716 


In standard advanced calculus texts the most common test for establishing the 
uniform convergence of an infinite improper integral is the Weierstrass M-test: 
Suppose f(x, t) is Riemann integrable over [a,c] for all c > a and all x € [a, B]. 
Suppose that there exists a positive function M(t) defined for ¢ >a such that 
| f(x, t)| < M(t) for t > a and x © [a, 8], and such that /{°°M(t) dt exists. Then for 
each x €[a, B], [2f(x,t) dt is absolutely convergent and the convergence is 
uniform on [a, £] [1, p. 268], [3, p. 348]. 

The Weierstrass M-test also gives the uniform convergence of /,%| f(x, t)| dt. For 
this reason students often assume that if /{°f(x, 7) dt converges uniformly and 
absolutely then /{,%| f(x, t)| dt also converges uniformly. In this note we present an 
elementary example to show that this assumption is in general false. The following 
mean value property is the main tool in the proof of our theorem. 


LEMMA. Let f be a positive decreasing function on [0, 00) and let g be defined on 
[O, 00) by 


(x) ={ 1 *k<x<2k+1 k =0,1,... 
g -1 2k-l<x<2k k=1,2....’ 


Let [a, b] be contained in {0, 00). Then there exists a real number c & [a, b] such that 


[800 at = f(a) f(s) ae (1) 


Proof. Let 
[a0 ae 
ofa) 


Since g is integrable on [a, b] we know that G(x) is a continuous function of x on 


[a, b] [2, p. 133]. Hence G has the intermediate value property and (1) will hold if 
we show that 


and G(x) = Or 


min G(x)<A< max G(x). (2) 
xE[a, b] x€[a, b] 


We call the intervals [2k,2k +1) even (k =0,1,...) and the intervals 
[2k — 1,2k) odd (k = 1,2,...). If a = b then (1) holds trivially so suppose a # b. 
Clearly either a belongs to one of the even intervals or one of the odd intervals. 
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We first consider the case when a belongs to an even interval. Let a* be the 
smallest natural number larger than a, and let d= a* — a. It follows from the 
definition of g and the fact that f is decreasing that 


—(1= a) sat) < f'f(n)g(o) at < af(a). (3) 
Since —f(a*) > —f(a), (3) implies 
(—1 + d)f(a) < f'f(1)g(e) dt < af(a), (4) 


and so we have —1+d<A<d.If G attains the values —1 +d and d in the 
interval [a, b] then (2) holds and the proof is complete. 


We next consider the case when max, <;4 ,;G(x) < d and (or) min, </,, ,)G(«) 
> —1+d. Let b* = a* + 1 and note that if x € [a, b] then -1+ d< G(x) <d. 
Furthermore, b > b* implies that G attains both —1 + d and d. Therefore we need 
only consider the case when b € (a, b*). Note that 0 is always one of the values 
attained by G. 

Assume max, <;,,5)G(x) = D and 0 < D < d. Then b < a® since if b > a* the 
maximum value of G is d. Hence b — a = D and this implies that {’f(t)g(t) dt < 
Df(a). So we have A < D. 

Now assume min, 2), »;G(x) = D where —1+d< D <0. Then b & (a*, b*) 
and if we let dG = a* + d we have b — @ = — D. This implies that {?f(t)g(t) dt > 
Df(a), and so A > D. 

It follows that if —1 + d is attained by G and D <d then -—1+d<A<D. 
Also, if d is attained by G and D > —1+d, then D < A < d. Finally, if D <d 
and D> —1+d wehave D < A < D. Hence, 

min G(x) <A< max G(x), 
x€[a, b] x€[a, b] 
and the proof is complete in the case a belongs to an even interval. 

When a belongs to an odd interval substitute —g for g and apply the even case 

to obtain 


PUM) 8(9} d= f(a) f(-8(0) at 5) 


for some c € [a, b]. This is equivalent to (1) and completes the proof. 


The following theorem gives an example of a function f(x, t) defined for ¢ > 0 
for which /{5°f(x, t) dt converges uniformly on x > 0, /5°| f(x, ¢)| dt converges on 
x > 0, but the convergence of this last integral is not uniform for x > 0. This cannot 
occur in the cases where the Weierstrass M-test applies to /5°f(x, t) dt since in these 
cases we have {5°| f(x, t)| dt converging uniformly. 


THEOREM. If g(t) is defined as in the previous lemma, then [@xg(t)/(x* + t*) dt 
converges uniformly for x > 0 while {[@\xg(t)/(x* + t*)| dt converges but not uni- 
formly on x > 0. 
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Proof. Let 0 < a < b and note that by the previous lemma we have 


[pal dt = a flv) (6 


for some c € [a, b]. Since | {£g(t) dt| < 1, (6) implies that 


[asl at < (7) 


Since x/(x* + a”) <1/2a for all x > 0, it follows from (7) that given any e > 0 
there exists a real number Q such that if a > QO then 


b x ( d 
t t 
lap P4728 ) 


It follows from the Cauchy criterion for uniform convergence [1, p. 267] that 
[exg(t)/(x? + t?) dt converges uniformly on x > 0. It is easy to verify that if 
F(x) = [@x/(x* + t*) dt then F(0) = 0 and F(x) = 7/2 if x > 0. Since F(x) is 
not continuous on [0, 00) it follows that the convergence of [%°|xg(t)/(x* + t*)| dt 
is not uniform on [0, co) {1, p. 270]. This completes the proof. 


~ x2 4 gq? 


<€. (8) 
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Special Integration Techniques for Trigonometric Integrals 


ASHOK K. ARORA, SUDHIR K. GOEL, AND DENNIS M. RODRIGUEZ 
Department of Applied Mathematical Sciences, University of Houston- Downtown 
Houston, TX 77002 


In this paper we present two basic properties of definite integrals. These 
properties are in common use and can be easily proved. We use these properties to 
evaluate certain complicated integrals some of which are normally evaluated using 
contour integration. 


Property 1. If f is an integrable function over the closed interval [0, a], then 
a a/2 a/2 
[oP (<) de = [OP () de + [F(a = x) ae. 
0 0 0 


The following two facts are immediate from property 1. (A) If f(a — x) = f(x), 
then 


[ P(x) dx = 29" P(x) des 
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and (B) If f(a — x) = —f(x), then 
ffs) dx = 0. 
0 


One should observe that a function satisfying the hypothesis of fact (A) is 
symmetric over the interval [0, a] across the line x = a/2, and a function satisfying 
the hypothesis of fact (B) is symmetric over the interval [0, a] through the point 


(a/2, 0). 


Property 2. If f is an integrable function over the closed interval [0, a], then 
[tx) dx = [ta — x) dx. 
0 0 


We illustrate the use of the above properties with several examples. 

The first example we present appears in most complex analysis books (see [3], 
[4]). The integral in this example 1s generally evaluated using contour integration 
and the procedure is quite involved (see [4, pp. 113-114)). 


Example 1. 


ere) In x 
il 5 dx = 0. 
o l+x 


Solution. If we let I denote the value of this integral, then using the substitution 
u = 1/x and simplifying using In(1/u) = —In u, we obtain 
o Inu 


[= —- 5 du= —T, 
o ltu 


whence I = 0. 
An alternative way to solve example 1 is to use the trig substitution x = tan @ 
and then use property 2. 


Example 2. 


ore) In x T 
a 
0 (1+ x?) 4 


Solution. Let x = tan@; then dx = sec*6d6. Thus 


00 In x a/2 In(tan 6) 5 
r= | aon dx = f ectg S000 0 


= [7’’cos6 In(tan 0) dé 
0 


_ i 
, 


ft n/2 qt a /2 
= 7 In(tan 8) dO + sf cos 20 In(tan 8) dé. 


1 + cos26 
2 


jintan 0) do 
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Now, {%/* In(tan 6) d@ = {7/* In(sin 8) dé — {5/* In(cos 0) dO = 0 by property 2. 
We evaluate the second integral by parts, letting u = In(tan@) and dv = cos 26d. 
Then 


sec? 2 1. 
du = and d@ = an2d d@ and v= 5 sin 26. 
Therefore, 
[= zsn26in(and)}) ~ 5 [7sn2 26. ab 
4 n26 


Using L’HOpital’s rule, we obtain J = 0 — 7/4 = —7/4 as desired. 


Examples 1 and 2 above appear as problems in [3, p. 183]. The hint provided to 
solve these problems is to use the branch 


7 39 
7 


Inz=Inr+i0 [r>0-T<a< 3 


of the logarithmic function, and integrate the functions 


In z In z 
a «and —~— 
z“ +] (z* +1) 


around the closed contour shown below: 


Our next example appears in the book Complex Analysis by Ahlfors [1, p. 159]. 
Its evaluation using contour integration is fairly involved. We noticed that this 
integral is important since it is the main result needed in proving Jensen’s formula 
(see [1, p. 205]). The solution given in example 3 below also appears in [5]. 


Example 3. 
foin(sin 6) d6 = —mln2. 
0 
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Solution. Using (A) above, we get 
[inGind) d@ = 2("”" In(sin 8) dé. 
0 0 
Using property 2 above, we obtain 
[= [O in(sin 6) do = [° in(cos 6) dé. 
0 0 
Therefore, 


21 = [°" TinGin 8) + In(cos )| dé 
0 


7 1 
-f in| sin 20 | d0 
0 2 
= ["in(sin26) do - [""n240, 
0 0 
Substituting ¢ = 26 in the first integral yields 
1 7 WT 
2] = al In(sin ¢) dt — 3 In2. 
Using (A) again, we have 
_ mn /2 . a 7 a 
21= f In(sin ¢) dt sIn2=I 5 In2. 
Therefore, J = —(7/2)ln 2. Hence, 
[nin d) dé =2I = —rln2, 
0 
as desired. 


Our next example was proposed as a problem by Suresh Ailawadi [2, p. 439]. A 
solution, different from the one we present below, appeared in the September 1985 
issue of the College Mathematics Journal. 


Example 4. Evaluate 


Sani 

n/2 Sin x 
r= f Sanit nt 
0 sin’x + COSs"x 


Solution. Using property 2 above, we get 


Sail n 

n/2 sin x n/2 COS XxX 
IT =f _- hl hl dx =f yo ; hl 
0 sin’x + Cos’x 0 cos’ x + sin'x 


It follows that 


7 T 
2I= / /? lx = 3 > Whence I = 
0 


| 3 


Note that J is independent of n. 
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There exist many other integrals that can be evaluated using properties 1 and 2, 
three of which are listed below: 


font + cos @) dé, 
0 


fom 
0 


7 a”sin?x + b"cos2x 


9 a-"sin?x + b*"cos*x 


] 
x+- 
x 


dx 
1+ x?’ 


dx. 
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E 3249. Proposed by Toma Albu, University of Bucharest, Romania. 


Show that, for all primes p and all natural numbers m, the sum 
m k 
y 


k= 1 + k(p—1) 
is not an integer. 


E 3250. Proposed by Lennart Bondesson, Umea, Sweden. 


Suppose N > 1 and a, <a, < -::: < ay are given real numbers. Consider the 
partial fraction decomposition 
NJ NA, 
j=1 A; + xX 7 d a,+ x 
It is familiar that the numbers 4,, A4,,... alternate in sign. What can we say about 


the signs of the partial sums A,, A, + A,, A, + A, + Az,..., Ay + Ad t+: +Ay? 


E 3251. Proposed by Clark Kimberling, University of Evansville, Indiana. 


Given a triangle ABC, it is possible to construct three circles Ty, [,, [. inside 
ABC in such a way that each one is externally tangent to the other two, I, is 
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tangent to the two sides AB and AC, I 1s tangent to the two sides BC and BA, 
and I. is tangent to the two sides CA and CB. (Cf. A. S. Winsor, Modern Higher 
Plane Geometry, Christopher Publishing House, Boston, 1941, pp. 206-209 or H. G. 
Forder, Higher Course in Geometry, pp. 244-245.) Let D be the point of contact of 
l, and [., & the point of contact of [. and [,, and F the point of contact of T, 
and Ip. 
(a) Prove that the lines AD, BE, and CF are concurrent. 

*(b) Prove that the lines A’D, B’E, and C’F are concurrent, where A’, B’, and C’ 

are the excenters of the triangle ABC. 


E 3252. Proposed by David G. Winslow, Louisiana State University, Baton Rouge, 
Louisiana. 


(a) Suppose that h: R — R is continuous and that h” is bounded for some 
positive integer n, where h?(x) = h(h(x)), h?(x) = h(h(h(x))), etc. Prove that 
either h or h? is bounded. 

(b) Show that for each positive integer n there is a continuous function 
h: R* —> R? such that h* is unbounded for 1 < k <n — 1 but h” is bounded. 


SOLUTIONS OF ELEMENTARY PROBLEMS 
Brocard Revisited 


E 2905* [1981, 619]. Proposed by R. J. Stroeker, Erasmus Universiteit, Nether- 
lands, 


Inside any triangle with vertices A, B, and C, a point P exists such that 
«PAB = «PBC = x«PCA =:w. The point P is called a Brocard point and the 
angle w is called its Brocard angle. If a, 8, and y are the angles of triangle ABC, 
then prove the inequalities: 


wit<a t+ Bo! + y! < ze and ro <a 7+ B~? + y? <w’. 


Editorial Comment. Partial solutions were received from P. S. Bruckman, H. H. 
Guggenheimer, and the proposer. Later the following relevant paper appeared: 


R. J. Stroeker and H. J. T. Hoogland, “Brocardian geometry revisited or some 
remarkable inequalities,’ Nieuw Archief voor Wiskunde (4) 2(1984), 281-310. 


This paper gives complete proofs for the two assertions: 


a + Bo + y > wt, a2 + B~? + y < wy? 
and provides numerical evidence in support of the other two inequalities 
3 ; 3 
ao} + Bo + y < 58 a ~+ B~? + y? > 7° 
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These last two conjectures were proved by V. Mascioni in the following two 
papers: 

Zur Abschatzung des Brocardschen Winkels, Elem. Math. 41 (1986) 98-101; 

Zur Abschatzung des Brocardschen Winkels II, Elem. Math. 42 (1987) 35-42. 

Improved versions of the inequalities of the problem have been obtained in a 
forthcoming paper of F. F. Abi-Khuzam and A. B. Boghossian. The relevant parts 
of their results may be summarized as follows: 


3 
(1) wo 143-4 <cail+t Bo} ae y} < 5e 3 
3 


equality holds in each case if and only if the triangle is equilateral. They also show 
that there exists a number 6 € (1, 2) such that 


a+ BpX+y- <3(2w) * forall AeE[0,d]; and 
a+ B+ y-> > 3(2w) * forall A¢ (0,8). 


Sums of Consecutive Square Roots 


E 3010 [1983, 483]. Proposed by F. David Hammer, Redwood City, CA. 
Prove that for all n > O the relation 
| nl/? +(n+1)'7+4+(n+4 2)77| = |(9n + g)'/7| 
holds. Here [-] denotes the greatest integer function. 


Solution I by J. C. Binz, University of Bern, Switzerland. Let x = yn + ¥n +1 + 
yn + 2. Then 


x*°=3n+3+4 2(/n(n +1) + yn(n +2) + f(n+1)(n + 2) ). 


For n> 1 the inequalities (n + 2/5)? < n(n + 1) < (n+ 1/2)’, (n+ 7/10)? < 
n(n + 2) <(n+4+ 1)’, and (n+ 7/5)? < (n + 1)(n + 2) < (n + 3/2)? lead to 9n 
+8 <x? <9n+9, and, therefore, [x] =[V9n + 8]; the case n = 0 is verified 
directly. 


Solution II by James Propp, Berkeley, California. The relation is easily checked 
by hand in the cases n = 0, 1, and 2. When n > 3, we have 
3 


8 
n(n +1)(n + 2) > n+ > 


(straightforwardly verifiable by estimating the roots of the quadratic polynomial 
(x — 1)x(x + 1) — (x — 1/9)°), so that by the arithmetic mean-geometric mean 
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inequality, 
vn +¥nt+1+yn+2 3 
> Winn Tn 2 
> Vynt — 
9 
and 


vn +¥nt+1+yV¥nt+2>yon+8. 


But by the root-mean-square inequality, 


vn +y¥nt+1+ynt+2 n+(n+1)+ (n+ 2) 
CO —————— 
3 3 


and 


vn t¥n+1+vVn+2 <V9n+9. 


Since no integer can lie between V9n + 8 and V9n + 9, both yn + Vn +1 4+ 
vn + 2 and y9n + 8 must have the same integer part. 

A solution involving the application of Taylor series methods (for yn + x ) was 
submitted by O. P. Lossers. A. M. Rockett showed that both V9n + 8 and yn 
+yn+1 + yn + 2 (but no integer) lie between y9n + 7 and y9n + 9. A. Tissier 
(France) showed that if a, = L__,(n + q)'” and b, = {(2k + 1)*n — 1}'”, then 
[a,,] = [6,] for all sufficiently large n. In the last assertion P. D. McCray showed 
that the powers 1/2 and 2 could be replaced with r and 1/r, respectively, and R. B. 


Nelsen showed that n + g could be replaced with n + hq where h is an integer. 


Also solved by 39 other readers and the proposer. 


Ups and Downs 


FE 3042 [1984, 257]. Proposed by A. M. Fink, Iowa State University, and Max 
Jodeit, Jr., University of Minnesota. 


Which real-valued functions / defined on [0,1] can be expressed as h(x) = f(x) 
+ g(x), where f(x) > 0 is nonincreasing, and g(x) > O is nondecreasing? 


Solution by the proposers. A function fh can be so expressed if and only if 
V(h) < h(QO) + AQ), where V(/) is the variation of h on [0, 1]. 

If the condition holds, the decomposition h(x) = (h(0) — N(x)) + P(x) works 
(P(x) and M(x) are, respectively, the positive and negative variations of h on 
[O, x]). For PQ) + NQ) = V(A) < h(O) + kh) = 2h(0) + PCL) — N(1), which im- 
plies N(x) < NQ) < h(O0). This decomposition is the only one with g(0) = 0, 
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fQ) = 172)(h(0) + AQ) — V(A)) (by the minimum property of the Jordan decom- 
position). 
If h =f + g, as in the problem, then 


V(h) < V(f) + Vg) = f(0) — FQ) + g(Q) — g(0) 
= h(0) + h(1) — 2f(1) — 2e(0) < h(0) + h(A). 


Thus, if V(h) = h(O) + AQ), g(0) = 0 = f(1), so f and g are unique in this case. 
These functions may oscillate; (1/2)(1 + cos? 27x) is an example. It is always 
possible to add cx to g(x), c > 0, and subtract cx from f(x) to get h(x) = f(x) 


+ Zo(x) with go(0) = 0 = fo(1). 


Also solved by G. Behrendt (West Germany), A. Bondesen (Denmark), R. Davies, T. Jager, B. G. 
Klein, M. Meyerson, W. A. Newcomb, E. Ordman, R. Pemantle, S. Philipp, R. Shepler and V. Konecny, 
J. C. Tripp, U. of South Alabama Problem Group, and P. Y. Wu (Republic of China). 


A Minimum Length by Fours 


E 3049 [1984, 437]. Proposed by Jordi Dou, Barcelona, Spain. 


Determine a planar region of area 4 which can be partitioned into four subre- 
gions of unit area in such a way that the total length of all bounding arcs is a 
minimum. For example, a square of area 4 can be partitioned into four unit squares 
so that the bounding arcs have total length 12, while a circle of area 4 can be 
partitioned into four sectors of unit area so that the total length of the bounding 
arcs 1S 


(4 + 8)/Va = 11.6. 


Solution. The only solution that appears correct was that of the proposer. He 
asserts that the minimizing figure must satisfy the following conditions. 

(i) All the boundary arcs in the minimizing figure must be straight segments or 
arcs of circles. 

(ii) At each vertex there meet three arcs forming equal (120°) angles. 

(111) The appropriately signed curvatures of the three arcs meeting at a vertex sum 
to zero. 

(iv) The figure has two perpendicular axes of symmetry. 

The first three conditions are shown to be necessarily satisfied by an optimal 
figure in the paper “Isoperimetric problem with application to the figure of cells” 
by H. P. McKean, M. Schreiber, and G. H. Weiss, Journal of Mathematical Physics, 
6(1965) 479-484. The fourth condition was not proved, but was supported by some 
heuristic reasons. 
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Assuming that the minimizing figure is as in the diagram below, its proportions 
are calculated as follows. 


Let O, and O, be the centers of PA and PB and r1, ’ the respective radii. Let 7, 
be the radius of MP and 2e (in degrees) the central angle subtended at center QO. 
Assume for the while that chord MP has length 1; the figure will be renormalized 
below. 

Then 7, = 1/(2sin e€), r, = sin(60 — e)/sin(90 — 2e). From (ii) we have 1/r; = 
l/r, — l/r, sor, =7r,/ — 2r, sin e). 

Let a be the area of the region MAPM, so a = S, + T, + o, where S, is the area 
of the sector AO,P, T, the area of triangle O,PM, and o the area of the segment 
bounded by MP and MP. Let B be the area of region OMPBO = S, + T, + T—«, 
where S, is the area of sector BO,P,T, that of triangle PO,M, and T that of 
triangle O,OM. 

Then S, = ar7(45 — €)/180, T, = sin(30 — e)r,/2, o = (we — 90 sin 2e)r7/180, 
S, = ary(30 + €)/180, T, = sin(30 + e€)r,/2, T = [sin(60 — e) — r, sin(30 — 2e)] - 
[r, sin(60 + 2e) — sin(30 + €)]/2. The condition a(e) = B(e) implies e = 
1°92349794. The total area S = 8a = 86 = 5.99867234. The total length 


L = 4(AP + BP + MP) + 20M 
= 4[ar,(45 — €)/90 + ar,(30 + €)/90 + 232/90] 
+2[r, sin(60 + 2e) — sin(30 + e)]. 


L = 13.70068728. The total length A for a total area 2 =4is A= /4/S -L= 
K - L = 11.1878022. 
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The corresponding parts of the rescaled figure (with the same lettering) are K 
= MP = 0.816586932; OM = 0.22949299; MO, = 0.385194625; OO, = 0.36835908; 
r, = 0.694647354; r, = 0.736718161; r; = 12.1642382. 

(a(e) = B(e) was solved with a pocket Casio FX-702P.) 

Three nonoptimal configurations were also received. 


Where J Is 


E 3071 [1985, 58]. Proposed by K. Satyanarayana, Hyderabad, India. 


If in triangle ABC, angle C > angle B > angle A, then prove that J lies inside 
triangle OBH where, as usual, O, 17, H denote the circumcenter, incenter and 
orthocenter, respectively, of triangle ABC. 


Solution by O. P. Lossers, Eindhoven University of Technology, The Netherlands. 
Let a, 8, and y denote the angles of the triangle. Because AH | BC and BH 1 AC, 


we find 


1 
ZCAH = ZCBH = 57 — "| 
Furthermore, because ABO is isosceles, 
1 
ZBAO = ZABO = 5 (a —2y)|= 57 "| 


Therefore, AJ is simultaneously the angular bisector of ZCAB and of ZHAO. In 
the same way, BI is the angular bisector of ZCBA and 27 HBO. Let AJ intersect OH 
in P and let BI intersect OH in Q. Evidently OA = OB, and AH > BH because 
a < £8. Now we apply the well-known partition theorem of angular bisectors 
AH: AO= PH: PO, BH: BO = QH: QO. 

So we see that PH > QH, or that O, P, QO, and H are in that order on OH, i.e., I 
lies inside triangle OBH. 

Also solved by L. Bankoff, J. Dou (Spain), N. Elkies, J. Fukuta (japan), W. Janous (Austria), L. 


Kuipers (Switzerland), W. A. Newcomb, C. R. Pranesachar (India), M. Vowe (Switzeland), and the 
proposer. 

It was brought to the attention of the editors that this problem appeared as Problem 794 in Math. 
Mag., where it was proposed by L. Bankoff. Also see the article by A. P. Guinand, “Euler Lines, 
Tritangent Centers, and Their Triangles”, this MONTHLY, 91 (1984) 290-300. 


An Integral Root Test 


E 3072 [1985, 58]. Proposed by Louis Funar, Craiova, Romania. 


Let P(x) =a) + a,x +--+ +a,x” be a real polynomial of degree n > 2 such 
that 
[n/2] 1 
O< < — ; 
“0 Xu 2k +1074 


Prove that P(x) has a real zero r such that |r| < 1. 
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Solution by K. F. Andersen, University of Alberta, Edmonton, Canada. Since 


[7/2] 
f P(x) dx = ¥Y. 2a,,/(2k +1) 
—i k=0 


the hypothesis shows that the integral of P(x) over the interval (—1, 1) is negative. 
It follows that P(x) < 0 for some x € (—1,1). Since P(O) = a, > 0, the Inter- 
mediate Value Theorem shows that P(x) has a real zero in (—1, 1). 


Also solved by 27 other readers. 


Ptolersy Applied 


E 3073 [1985, 147]. Proposed by Calin P. Popescu, Bucharest, Romania. 


In the hexagon A,A,A,A,A;A, the triangles A,A,A, and A,A,A, are equi- 
lateral. Is it true that A,A,,3(k = 1,2,3) are congruent if their sum equals the 
perimeter of the hexagon? Also, consider the converse. 

Solution by Jiro Fukuta, Shinsei, Japan. In quadrilateral A,A,A,A,, the follow- 
ing inequality holds: 

A,A,:A,A; + A,A;°A5A, > A,A;z° AAs, 


and so we have 


A,A, + A,A, > 4,4; (1) 
Similarly, 

A,A,+ A,A,; > AqAj. (2) 

A,A, + AgA, > Ag A>. (3) 


Using (1), (2), and (3), we obtain 
(perimeter of the hexagon) > A,A, + A,A, + A3A,g. (4) 


If equality holds in (4), then equality also holds in (1), (2), and (3). If equality holds 
in (1), A, is on the circumcircle of the triangle 4,A,A,. Similarly, 4, and A, are 
on the same circle. Thus, it is easy to prove that 4,A,,3(k = 1,2, 3) are congruent. 


Also solved by the proposer. 


A Symmetric Inequality in Three Variables 


E 3074 [1985, 147]. Proposed by M. J. Pelling, London, England. 


Prove the inequality 


Ld+p)A+e)p-ry(q-r’ e210 +p) Tp -9), par eR. 


1988] PROBLEMS AND SOLUTIONS 139 


where the sums and products are over cyclic permutations of p, g, r and determine 
all the cases where equality holds. 


Composite solution by Jiro Fukuta, Shinsei, Japan, and the proposer. Define a, b,c 
as follows: 


a=(1+p*)(1+q’)(r- p)(q-1r) 
b=(1+q*)(1+r?)(p — q)(r-p) 
c=(1+r°*)(1+ p*)(q-r)(p- 4). 
Substituting into (a + b + c)* = Ya* + 2Lab > 0 we get 
(Si (1 + p2)(1 + a2 )r- pilq—r)V 
=Y(1+ p+ er = py(q-ry 
+ 2{T 10 +p?) }{T](p - a} {LA + 7) -p)} 
= V(1+p?)'(1+ 92) (r-p)(q-r) - 2710+ ep?) - ¢) = 0 


which is the required inequality. 
Equality holds if and only if a+ b+ c= 0, Le., 


L(+ p?)(1 + a? \r-phq-r) =09. (1) 


Introducing parameters x = (rg + 1)/(q —1r), y=(rp + 1)/(r — p), we have 


r> +1 r°+1 —(r?+1)(x+y) 
’ yep? rr a a a (ytr)(x-r) ? 
r°+ 1)(1 + y? r>+ 1)(1 + x? 
1 pro eV) pe gee TOE) 
(y +r) (x —r) 
which, on substituting into (1) and simplifying, gives 
(r2 +1) 


Cairn (lat ty? tay + 2] ret = y)} = @) 


so that the general case of equality in parametric form is given by 


1 — xy(x* + y* + xy + 2) ry —1 rx +1 
a ia pr ~~; q = 
(x—y)(x+y) yrr x~r 


(3) 
where in (3) x # +y. There is also a special case in (2) when x = y = +1/ 73 and 
r is arbitrary, except r# +1/ 73, with 


r — y3 r+ y3 r+ 3 r— V3 4 
P= Ta?) fo 8 POT) Oa 
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Finally, there is the case p = g =r, not included above because x, y are assumed 
finite. 
The inequality was originally discovered in trigonometric form as 


y cos? a cosec*(B — y) > 2 (sum cyclically over a, B, y), 


which reduces to the algebraic version of the problem on setting p = tana, 
gq = tan B, r = tany and clearing fractions. The parameters x, y of (3) are x = 
cot(B — y) and y = cot(y — a). 


Continuous Functions are Constant 


E 3077 {1985, 148]. Proposed by Nicholas Passell, University of Wisconsin-Eau 
Claire. 


Let N be the natural numbers with the topology 7 consisting of @, N and the 
complements of finite sets. Characterize the continuous functions from R, the reals 
with the usual topology, to N with the topology T. 


Solution by Alan Schuchat, Wellesley College. A function f from R with the usual 
topology to N with T is continuous if and only if f is constant. 


Proof. Constant functions are always continuous. Conversely, if f is continuous 
then R is the disjoint union of the closed sets F = f-*({j}), for 7 in N. By a 
connectedness and category argument (e.g., F. Hausdorff, Set Theory, Chelsea, 
1957, p. 178), only one set F; is nonempty. So f is constant. 


Also solved by M. Bowron, F. S. Cater, Chico Problem Group, J. A. Frohliger and A. L. Thorsen, 
J. W. Grossman & D. Schmidt, D. Hamlin (student), G. M. Kellar, J.-C. Leccia (France), N. J. Lord 
(England), O. P. Lossers (The Netherlands), L. A. Lucas, M. D. Meyerson, W. A Newcomb, M. J. Reed 
& C. R. Diminnie, C. R. Rosentrater, T. Salat (Czechoslovakia), F. Siwiec, T. A. Summers, N. Y. Wong 
(Hong Kong), and the proposer. 


Comment. Cater, Frohliger and Thorsen, and Lucas pointed out that N can be replaced by any 
countable 7, -space. 


An Application of Gauss-Lucas 


E 3078 [1985, 215]. Proposed by Zalman Rubinstein, University of Colorado, 
Boulder, and the University of Haifa. 


What is the largest positive integer n for which the nonzero solutions of the 
equation (1 + z)” =1 +z” all lie on the unit circle? 


Solution by W. O. Egerland, University of Baltimore. Let p,_,(z) = (z + 1)" — 
z" — 1. The derivative p’_,(z)=n((z+1)""'-—2"') has a root at z= 
(exp(27i/(n — 1)) — 1)7!. Since, if n > 8, |z| = (2sinf{a/(n — 1)})7* > 1, p42) 
cannot have all nonzero roots on the unit circle by the theorem of Gauss-Lucas; and 
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since p,.(z) = 7z(z + 1)(z*7 + z+ 1)’, n = 7is the largest positive integer for which 
all nonzero roots of p,,_,(z) lie on the unit circle. 


Also solved by C. K. Caldwell, B. P. Carter, N. Elkies, Z. Franco (student), D. Hamlin (student), A. 
A. Jagers (Netherlands), W. Janous (Austria), S. V. Kanetkar, K. Kearnes, I. E. Leonard, N. J. Lord 
(England), O. P. Lossers (Netherlands), J. M. Monier (France), K. L. Stellmacher, G. Sylvester, and the 
proposer. 


A Congruence Argument 


E 3080 [1985, 215]. Proposed by Loren C.. Larson, St. Olaf College. 
Can the following equations be satisfied with integers? 
(x +1) +a? =(x+2) +b? =(x +3) +02 =(x4+ 4) +d? 
Solution by Mark Bowron, Charlottesville, Virginia. The equations cannot be 
satisfied with integers. For any integer h we have 
O(mod 8) if h = 0 (mod 4), 
h?=(1(mod8)_ if h = 1 or 3 (mod 4), 
4(mod 8) if h = 2 (mod 4). 
Hence, 
0,1, or 4 (mod 8) if h = 0 (mod 4), 
h?+k*=(1,2,o0r5(mod 8) if h = 1 or 3 (mod 4), 
0,4,or5 (mod 8) if h = 2 (mod 4), 


for any integers h and k. Therefore, since x + 1,x+2,x+3,x+4 form a 
complete set of residues modulo 4, the congruences 


(x+1)? 4+ a?2=(x4+2) +b? =(x +3) +02 = (x + 4)’ =d? =n (mod 8) 
are satisfied only if n € {0,1,4} M {1,2,5} NM {0,4,5}, which is impossible. 


Also solved by J. Binz (Switzerland), R. Breusch, D. M. Broline, C. K. Caldwell, P.-C. Chuang, N. 
Elkies, Z. Franco (student), C. Groenewoud, J. H. Halton, R. Heller, J. Hook, I. M. Isaacs, F. H. 
Kierstead, Jr., B. G. Klein, L. Kuipers (Switzerland), O. P. Lossers (The Netherlands), S. Marivani, J.-M. 
Monier (France), D. Nuenschwander (student, W. Germany), M. Nutt (Canada), N. Plotkin, R. A. Simon 
Chile), U. of South Alabama Problem Group, J. T. Ward, K. Woerner, and the proposer. 


An Odd / Even Power Sum Inequality 


E 3102* [1985, 507]. Proposed by J. P. Lambert, University of Alaska, Fairbanks. 


Prove or disprove: For all integers m,n > 1, 


m(27-1)" | 2m 
m (jy 2m+1 


2n+1 


142 PROBLEMS AND SOLUTIONS [February 


No correct solutions were received. The following argument was prepared by 
Harold G. Diamond of the editorial staff. (It is the policy of the MONTHLY Problem 
Section to provide solutions whenever possible. In the case of the present problem, 
this policy was severely tested.) 

The stated inequality is true. Indeed we have 


THEOREM 1. Let 


p+ m m1 


-YeQ-y'/ Len’ (1) 


j=l =] 


det { 2m 
A,(m) = 5 +] 


Then A,(m) > 0 for all positive integers m and all real v > 4. Also, A,(m) > 0, 
A,(m) > 0, and A,(m) < 0 for all positive integers m. 


We shall later give an asymptotic estimate, due to Paul T. Bateman, for A,(m) 
for fixed »v as m > oo. This will show that if m is sufficiently large, depending on », 
then A,(m) < 0 for each real » < 2 and A,(m) > 0 for each real »v > 2. Possibly 
A,(m) > 0 holds for each vy > 2 with no restriction on m. 

The inequality of Theorem 1 is remarkably sharp, considering its simplicity and 
uniformity. The identities that follow (3) (below) show that A,(m) = O(m7?) as 
m — co for vy = 2,3. The asymptotic analysis will show that this O estimate holds 
for each vy > 2 with an O-constant that depends on ». 

We begin the proof of (1) by noting that for relatively large v a simpler inequality 
gives a better estimate. We have 


mi(2j — 1)" [—) | 2m 
Or OT? < 
rei (2 J) 


p+ 


7 (vy > 2m). (2) 


The first inequality holds for all positive values of v since 


1 3 (2m — 1)’ 
—S> <x ——— << eee < —_— TFT? 
2” > AP (2m) 


The second inequality holds for vy > 2m by the binomial expansion 


1 \~? yp 1 
1- 5 >1+ 521+. 
4m 4m 2m 


We shall henceforth assume that vp < 2m. 
Inequality (1) is equivalent to showing 


2m p+l1 aw i 
2m+1 


— Om yan > 
ie (27) 
We verify the cases vy = 2 and v = 3 directly: Familiar formulas for sums of squares 


0. (3) 
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and sums of cubes show that the left-hand side of (3) 1s 


2m \3 4m+1 4m+1 
Eas ~ 2m+2~— (2m +1)'(2m + 2) (=?) 
| 2m \4 ,_ Amt 1) _ 16m? + 22m? + 8m + 1 » =3) 
2m +1 2(m + 1)° 2(m + 1)°(2m + 1)* 


Henceforth, we assume p > 4. 

We proceed by induction on m. In view of (2), (1) holds for m = 1 and m = 2 
for all vy > 4. To obtain adequately sharp estimates we also verify (1) directly for 
m = 3. It suffices to show that 


T\" (21\"  (35\" 
ed rd 4 -°& 
1 v 9) v ° 
rea ~| +{=-] +1 ’ 
3 3 


We do this by breaking [4, oo) into a finite number of intervals [a, b) on each of 
which we have 


T\? (21\" (35\" 7\¢@ (21\"% (35\4 
—} +a] +1 —} +1] +(=> 
Eq rq | Ls rd 4 6 
sup 1\? V\» < 1\2 7 \ 6 SF: 
a<v<b _ + ),— + ] __ + 4{— +] 
3 3 3 3 


One set of such intervals has end points at 4, 4.3, 4.7, 5.3, 6.4, 11, oo. 
Suppose now that (3) holds for some m > 3. Then by our inductive assumption 


2m+2 
yj’< f + 


j=l 


p+] m 
2” ’ 4 (2m+1)° + (2m 4 2)’ 


and we must show that the last expression is at most 
Im+? p+l1 m+1 

1+ 2” Sj’. 
2m + 3 j=l 


Equivalently, we must show that 


1\’ 2m+2\"*} , 
m+—| — (m + 1) 
2 2m + 3 (4) 
Im +2\741 amy) 
< — - 
or Pres = 


We give an upper estimate for the left side of (4) and lower estimates for each of the 
factors on the right side of (4). 
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LEMMA 1. Let A denote the left-hand side of (4). Then 


(m+ 1/2)’ y 
< ———__——-{] - (5) 
2m + 3 2m + 2 
for any positive v. 
Proof. We have 
A ; (2m + 1)(2m + 3)\ ° 2m+2 
(m+1/2)” | (2m+2)(2m+2)/ 2m+3 
; f 1 | 2m +2 
(Qm+2)} 2m+3 
1 1 p 2m + 2 1 p 
<1-— (1 + ——— } —— _ = —— _ — 
| aE A CEE 2m+3 (2m+4 2)(2m + 3) 


LEMMA 2. Suppose that 3 < v < 2m +1. Then 


Sn’ > (m + 1/2) 1 Seer 
p+ 27(m + 1) 


n=l 


(6) 


Proof. We have by two integrations by parts 
n n+1/2 
f rd + fF Oe dt =n? 
n—1/2 Hn 
v(y — 1) 


_ J f" t—-n+1/2) td 
2 Lt /?) 


+ fo“ —n- 1/2" at| 


i 


= en — 1/2)°{(n —u)’ > +(n4+ uy’ *} du 


vip — 1 _ 
< ree, —1/2)’?+(n4+1/2)’ YPC —1/2)° du, 
2 0 
where we have used the fact that (n — u)’~? + (n + u)”~? is an increasing function 
of u on [0,1/2]. By addition it follows that 


m 2 ” viy— 1 ” y- y— 
f Ow dt < nr VEY S 1, 12) *+(n+1/2) ae 
1/2 n=l 48 
By convexity 


yr fx dx (y 31/2), 


1988] PROBLEMS AND SOLUTIONS 145 


SO 


p+l p+l 


(m + 1/2) — (1/2) ay viv —1) (¢ ¢m 7 m+1 
y+ < bm ta pera rat. 


If we evaluate the last integrals and note that 2~’~!/(v + 1) — v/48 < 0, we obtain 


“y 


For fixed m, the expression in braces is an increasing function of v. For 
y<2m + 1 it follows that 


(m+1/2)""" my 4 
ae yn + gmt 1/2) ( 


n=l1 


yd 2m+ 2 
bres 


2m+ 1 


p+il 


m+1/2 m yp 2m \2m 2m+2\2™ 
(m+ 172). yn’ + —(m+1/2)’" | | + , 
y+ nol 48 2m+1 2m+1 
We complete the proof of the lemma by showing that 
| 2m \2m 2m+2\?" 8/2m4+1\? 
_ > 3. 
2m +1 Eee o\mei}? ™?? (7) 
This relation is verified directly for m = 3,4,5,6. For m > 7 we have 
2m \2m 2m+2\?™ 2m \2mtl 2m +2\2m+1 
< 
Pees 2m+ 1 2m+1 ees 
1 25 8/2m+1)\? 
<—+e< — — 
e 8 9\ m+1 


Now (7) fails for m = 1,2, but (6) holds by direct calculation for m = 1, v = 3 and 
form=2,3 <v<5. 

The latter verification can be accomplished by the aforementioned device of 
breaking [3, 5] into a finite number of subintervals. 


LEMMA 3. Suppose that 4 < v < 2m andm > 3. Then 


2m+2\""} 2m \?t 2(v+1)B 
— > TY 
Ss | (2m + 1)(2m + 3)’ 
where 
p/2 vip—1 y*(p—1 
B>l1- / , 2d _ 77) 
m+1  8(m+1) 48(m + 1) 
Proof. Let 
od 2m 2m+2 
PC), Fay te ag 7 
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and let c= (a+ b)/2. Noting that f ’”” > 0 on[a, b], we have by Taylor’s formula 
f(b) =f(c) + (b-c)f(c) + (b- ¢) f"(c)/2 + (b- ce) f'”(€)/6, 
f(a) = fc) + (a-e)f'(c) + (a- 0) f'(c)/2 + (a- ec) fF" (0) /6. 


We obtain, upon subtracting, 


f(b) = Ha) > (b= a) (0) = Boao +B 
where 
gue Ami + Om+ tm 
(2m + 1)(2m + 3) (2m + 1)(2m + 3) _ 
Let 
2m + 2 
© (2m + 1)(2m + 3)’ 
Then 
1 1 1 


yey ooo & OF ED FT 3, 
2m+2 (2m + 2) (2m + 1)(2m + 2) (2m + 3) 


say. By the binomial theorem 
B=(1—e)’>1—ve+ v(v — 1)e*/2 — v(v — 1)(v — 2) 8° 6, 


where the truncation estimate is valid since ve < 1. 
Let S, = 1 — ve + v(v — 1)e*/4. Then 


S,>1—-v(e, + &) + v(y — 1)e?/4. 


Let S,(t) = ¢*/2 — (v — 2)t?/3. Since S,(t) is increasing for 0 < ¢ < e, we have 
S3(€) > S3(€;). 


Thus 
p(y —1) 
B>S,+ Sale) 
y y v(v —1) v(v — 1)(v - 2) 
2m+2  (2m+2)  2(2m + 2)” 6(2m + 2)° 


Finally, for »v > 4 we have 
—y—v(vy—-1)(v-2)/6 —-v*(v-1)/6 
——, —. = 
(2m + 2)° (2m + 2) 


which completes the estimate of B. 
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We now use Lemmas 1, 2, and 3 to establish the validity of (4). By these lemmas 
it suffices to show that 


aay p 


Im +3  —mt+2 


. 2(v + 1) j= y v(v—1) | 
(2m + 3)(2m + 1) 2m+2 8(m+1)  48(m+1)° 
(m+1/2)"** ; 2v(v + 1) 
pt) teeceeerd 


for 4 < vy < 2m, m > 3. Equivalently, we must show that 


p <1 yo p(y — 1) 7 p*(y — 1) ie 2v(vy + 1) } 


2m + 2 — Im+2— 8(m+1)P  48(m+1}\ 27m +0) 
or that 
(lly — 43)p (2lv+75)p*p*(v-1)(vt+1) (P-1)(¥ 4+ 1)? 
216(m +1)? 1296(m +1) —-:108(m + 1° 648(m + 1)” ° 


The first term in the last expression is positive for vy > 4. The second plus the fourth 
terms exceed the third term, by the arithmetic-geometric mean inequality. This 
establishes (4) and completes proof of the induction. 


An asymptotic estimate. 


LEMMA 4. If vy € R, v # —1, then 


1 p 
j= | + ym + pn’ + ¢(-v) + O(m’~?), 
j=l 


where § denotes the Riemann zeta function. 


This is a particular case of the Euler-Maclaurin sum formula and may be found 
in B. C. Berndt’s Ramanujan’s Notebooks, Part I, Springer-Verlag, New York, 1985, 
p. 150. 


If » > —1, then two applications of Lemma 4 and a division yield 


2m fom yt+1l  (vtl1)(v+2)) (v+1)(v + 6) 
LI) Qf) =2- 5 + — 3 - — 5 
~ | m om 48m 


j=l ] 


or meene =), of 2, 1 1) 


mt} 
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For fixed v, the binomial theorem gives 


Im p+1 1 —py-—l 
+ =1+ {1+ — 
2m+1 2m 
yp+1  (vt+1)(vt+2)  (v+1)(v + 2)(v + 3) 
= —_— el + Tl —— — 
2m 8m? 48m? 
] 
+O sal (9) 
m 


If we subtract (8) from ” we obtain, for vy > —1, 


2m 
A =1+ - acd 
vip+1 y+1)e(—v)(2’t' - 1 1 1 1 
— rth) (4 Dente + O + ——— + —]. 
94m? ya mt m’t2 m2t2 m4 


For » < —1 we estimate A,(m) by approximating £7”, j” and ©” ,(2j)” di- 
rectly. Together these estimates give 


THEOREM 2. As m —> © 


A,(m) ~ v(v + 1)/{24m?}, if v>2, 
v+1 
A,(m) ~ tO" =?) a 9). if -l<pv<2, 
| —2 log2 
A_,(m) ~ oem 
A (m) ~ 2-27", iff v<-l. 


In particular, A,(m) is asymptotically positive if and only if v > 2 
A Product Inequality 


E 3132 [1986, 132]. Proposed by Robert E. Shafer, Berkeley, CA. 


For integers n > 2 and real s > 0, show that 


Te +0 ES <(n+1) TI 


1 
s+ k= 5] 
2 


Solution I by San Bernardino Problem Solving Group, San Bernardino, CA. By 
defining f(s) =TI%_ (s + k), the desired inequality becomes f’(s) < f(s + >) — 
f(s — 5). Writing f as a Taylor series about s yields 


- fs al Fs) + DPMS 5 Ja 


s+ 


N\st 5 
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where the summation is over odd values of 7 from 3 ton + 1. For n > 2 and s > QO, 
the higher order derivatives are positive (f is a polynomial with positive coeffi- 
cients), which establishes the inequality. 


Solution IT by Michael Vowe, Therwil, Switzerland. First we prove by induction the 
inequality 

nt n nt 
[](s +i) < [s+ 5) 0 
i=0 


A=1 


s+k- 5). (1) 


For n=1, this is s(s + 1) <s?+s5+4=(s + 4)*. For the induction step, it 
suffices to show 


n+ 1 1 
(s+ n/y(stne1)<[s+ 5 [(s+n+5), 


which reduces to n + (1/2) > an. 
Now we prove the original inequality by induction. For n = 2, the inequality 
becomes 


1 1 
s(s + 1)(s + 2)|- + —— + 
5 


9 
= 357+ 684+2< 357+ 654+ — 
s+] s+2 4 


3 

+=—](s+—~). 
ST Oi" 9 
Now assume that n> 2 and that the original inequality holds for n. By the 
induction hypothesis and (1) we have 


= 3 


n+1 n+1 n+1 


Itoh 5 Horo sot H+) 


n 1 
<(stn+1)(n+1) [] s+k- 5 
k=1 2 
n I] k 
tls+— tk-—- 
[ 5) I: 2 
n+l 1 
=(n+2)[| s+k- 5] 
=1 


Solution III and generalization by BehdZet A. Mesihovic, University DZemal Bijedic, 
Mostar, Yugoslavia. For a sequence (a;) of positive numbers, the Arithmetic- 
Harmonic Mean Inequality is 


A,(a) = 


n n 
4; 2 n = H,(a), 


with equality if and only if a, = --- = a,. With this, we can generalize the original 
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claim to 


2 


for a sequence of positive numbers satisfying 
(k + 1)a, —(k -1)a,4,; > 2A4,(a), (3) 
with equality in (2) if and only if a, = --- =a,,, or n=1. Note that the 
condition (3) holds for any concave sequence, and that if a, = s + k — 1, then it 
holds with equality. 
If n = 1, then (2) holds with equality. For n > 1, we apply induction. Letting 


L,,,R,, denote the left side and right side of (2), we rewrite 2n(L, — R,,) and then 
use the induction hypothesis to obtain 


Hn n+1 
Ay, + aAnyy 
= 


(n+ ) IL La 


He} 


Hl 


2n(L, 7 R,,) — (n + 1)(a, + Ani) Ly-1 7 2n Any tRy-1 + I] ay 
k=1 


2 ((n + l)a, 7 (n 7 Lana) Raa 7 2n I] ay 
2 2A,(a)R,_; _ 2H,(a)R,,-1 2 0. 


The final inequality is strict unless a, = --- =a,, which completes the proof 
of (2). 


Also solved by 25 other readers and the proposer. 


ADVANCED PROBLEMS 


For instructions about submitting solutions of these Advanced Problems, which should be mailed before 
June 30, 1988, see the inside front cover. Please place the solver’s name and mailing address on each 
(double-spaced) sheet. Include a self-addressed card or label is an acknowledgement is desired. 


6566. Proposed by Jonathan E. Spingarn, Georgia Institute of Technology. 


Let r: R? > R* bea rotation about the origin by an irrational multiple of z. Let 
f: R* > R? be defined by 


f(x,y) =(x,y) if x <1, f(x,y) =(2-x,y) if x>1. 
Does there exist (x), yy) € R? such that 
k 
ref) (Xo, yo) | >1 


for all positive integers k? (Here (r° f)* denotes the kth iterate of ro f and || || 
denotes the ordinary Euclidean norm.) 


6567. Proposed by Gerald A. Edgar, Ohio State University, and Lee A. Rubel, 
University of Illinois at Urbana-Champaign. 


Let us say that a function u(x, y) is algebraic if there exists a nontrivial 
polynomial P in three variables such that P(x, y, u(x, y)) = 0. Suppose u(x, y) is 
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a harmonic and algebraic function on a region G in R?. Must its harmonic 
conjugate v also be algebraic? 


SOLUTIONS OF ADVANCED PROBLEMS 
Monotonic Norms 


6511 [1986, 218]. Proposed by M. Luisa N. McAllister, Moravian College, 
Bethlehem, PA. 


Let R”*” be the vector space of all m X n matrices with real entries. Suppose 
that A = (a,,), B =(6;;) are in R™*” and satisfy 0 < a,,; < b,,; for all i, j. For 
which norms v on R”*” does the inequality 

0<v(A) < v(B) 
always hold? 


Editorial Comment. No solutions were received. The problem essentially asks 
which norms F(Xx,,, X15,---;X,,,) 1 mn-dimensional Euclidean space are non- 
decreasing functions of each coordinate separately for nonnegative values of 
Nyis X129+++9 Xn 

The max norm, the max row-sum norm, the max column-sum norm, and all 
p-norms with p > 0 certainly have this monotonicity property. The square root of a 
positive definite quadratic form in x,,, X,5,---, X,,, has this monotonicity property 
if and only if all the coefficients of the form are nonnegative. For example 
7 


mi 


{ x2, + Xft -+- +x2 0 4+ e(x1y, - X15) 


does not have this monotonicity property for any positive e. 


6512 [1986, 218]. Proposed by Alfonso Villani, Universita di Catania, Italy. 


A Measured Response 


Let (&2, S, w) be a positive measure space. Let A be a Borel subset of C (the 
complexes) and let M(A) denote the subset of L'(w) defined by 


M(A) = { f © L'(u):u(f-1(A)) > Of. 
(a) Assume that A is a set of first category not containing zero. Prove that M(A) 
is of first category. 
(b) Assume that A is of second category. Prove that M(A) is of second category. 
(c) For which measures p is the assumption “O ¢ A” in (a) superfluous? 


Solution by Kenneth Schilling, University of Michigan, Flint. We first establish 
three lemmas. 


LEMMA 1. Suppose A € C is closed, 0 € A, and ¢ > 0; then the set 
{fe Lu): hl f(A) < 2} 


is open. 
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Proof. Fix f € L}(w) such that u( f—1(A)) < e. We will find 6 > 0 such that 


NA) = (ELM uw): [if slau <3} c (feu(fH(A)) <e}, 
Let 
Q,={w EQ: d(f(w), A) < 1/k} 


where d(x, A) denotes the Euclidean distance from the point x to the set A. Since 
A is closed, 


f(A) = () Q,. 
k=1 
Since A is bounded away from 0, p(Q,) < oo for all sufficiently large k. Therefore, 
p(Q,) =e <e 
for some k. Choose 6 = (e — e’)/k. Then for all g © N;(f) we have 
(e-e)/k> fif-sidu> fi. Wf sid 


“MAN Q, 
> [u(g"(A)) - e] /k, 
so u(g '(A)) < € as desired. 
LEMMA 2. If A C C is open and ¢ > OQ, then 
({feL(u): (f(A) > &} 
is open. 
The proof of Lemma 2 (omitted) is similar to that of Lemma 1. 
LEMMA 3. If U CC is open, 0 € U, and A © U is dense in U, then the set 
{fe Lp): f(Q)\NU c A} 
is dense in L'(1). 
Proof. Fix g € L'(u), and e > 0. We will construct an f © L'() such that 


f(Q)NU CA and [\f- gldu <e. 


First, let a,, i = 1,2,3,..., be a sequence of elements of A that is dense in U. Next, 
define a function 


h:C€ ~C 


as follows. If z € U, set h(z) =z. If z © U, let h(z) =a, where i is the least 
integer such that 


a,-2| < elzi/ fislay. 
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(It is here that we use the fact that 0 € U.) Then h is Borel-measurable, h(U) C A, 
and 


Iz — h(z)| < elzi/ fila 


for all z € C. Finally, let f= ho g be the composition of h with g. By the above, / 
is p-measurable, 


f(Q)NU C A, 


and 


[if-gldu= fiheg— gidp 


< f ei/ f la du =e. 


This proves Lemma 3, and now we can solve the problem. 
Proof of (a). If A is of the first category, then 
Ac UG, 


where each C,, is closed and nowhere dense, and 0 € C,,. Since 
M(A) c UM(C,), 
it suffices to show that M(C,) is of the first category, for each n. By Lemma 1, 


Du) /M(C,) = {fF © D(H): oF (G,)) = Of 


{f:u(f-XC,)) <1/m} (*) 


1 


728 


is a Gy set. Also, by Lemma 3 with 
U=C\ {0} and A=C\C, 
(this is dense in C), we see that 
(f= Lm): f(Q)\ {0} CC\G,} 
= {f:f(@) CC\G,} 
(note that 0 © C \ C,) is dense. Since it is also contained in 


L'(u)\ M(C,) 


we conclude that M(C,) is of the first category. 
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Assertion (b) is incorrect if (2 has no subset of finite, positive measure. For if we 
let A =C \ {0}, then A is of the second category, but M(A) = ©. 

However, (b) is true provided there exists a measurable set X C Q such that 
O < u(X) < oo. To prove this, suppose that A is not of the first category. Then 
there exists an open set U C C such that 0 is not in the closure of U, and U\ A is of 
the first category. Therefore, there exist open sets B, D B, D B,D --- such that 
U > B,, each B, is dense in U, and 


() B,C A. 


n=1 


Choose e such that 0 < e < n(X). By Lemma 2, the set 
F,= {f:p(f-(B,)) > &} 
is open for each n. Since the assertions 
ex<p(f-4(U)) and f(Q)NUCB, 


together imply that »(f~‘(B,)) > «, it follows by Lemma 3 that F’, is dense in the 
nonempty open set 


O={f:p(f'(U)) > e}. 


(To prove that Q is nonempty, choose x, € U and let f(w) = x, for w © X, and 0 
otherwise. Then f € Q.) Hence, f), F, is the complement of a set that is of the first 
category in Q. Hence, it is not of the first category itself. Finally, since each B, C U 
is by construction bounded away from 0, for all f © L\() we have 


p( f-1(B,)) < 0. 


Therefore, 


(VF, < a (02) > | C m2, ] Cc M(A), 
so M(A) is not of the first category. 


Solution to (c). The condition 0 €¢ A in (a) is superfluous if and only if the 
measure p is o-finite. 

Suppose that p is not o-finite. From the well-known fact that, for all f © L'(), 
the set 


O\f-({0}) 
is of o-finite u-measure, it follows that M({0}) = L'(u). Of course, {0} is of the 
first category and L(y) is not. On the other hand, suppose that p is a o-finite 
measure. In order to eliminate the hypothesis “O € A” from (a), we shall (1) 
reformulate Lemma 1 so as to avoid using this hypothesis at line (*) of the proof of 
(a), and (2) show how to eliminate the condition “0 € U” from Lemma 3. In 1 and 
2 below, let Q = U,Q,, where the sets Q,, are disjoint and of finite u-measure. 


nen? 
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1. Lemma 1 as stated is false if the hypothesis “O € A” is omitted, but a proof 
similar to the one given proves that if A C C is closed, e > 0, and 2’ C ) is a set of 
finite u-measure, then the set 


(fe Lu): u(QNF-(A)) <e} 


is open. Now in the proof of (a), replace the lines immediately following (*) with 


“EM(u)\ M(C,) = {fe Lu): w(f-(C,)) = 0} 
= A A { f: p(Q,Nf-(C,)) < 1/m } is a Gy set.” 


k=1 m=1 


2. To eliminate the condition “O € U” from Lemma 3, replace the single 
function h(z) in the proof of Lemma 3 by a sequence of functions. 
For n = 1,2,3,... define 


h,:C ~C 


as follows. If z € U, set h,(z) =z. If z © U, let h,(z) =a, where i is the least 
integer such that 


la, — z|<e-2>"/n(Q,,). 


Finally, define f: Q > C by f(w) =h,(g(w)), where w € Q,. Thus, 


[\f-sldu =f th,eg ~ sidu 


< [ 62° "/u(Q,)du = 627”. 
Q, 


Summation over n yields the conclusion of Lemma 3, as desired. 


Also solved by Victor Pambuccian (Romania) and the proposer. 


Choice on Squares 


6518 [1986, 403]. Proposed by Vladimir N. Akis, California State University at 
Los Angeles. 


Let S be a closed square of area 4. Denote by @ the collection of all squares of 
area 1 contained in S whose centers are points of S, and whose sides are parallel to 
the sides of S. 

Let f: @—> S be a choice function, where f(A) € A for each A € @. Suppose 
that f is continuous with respect to the Hausdorff metric on @. Show that f(A) is 
the center of S for some A € @. 


Solution by David C. Kay, University of North Carolina, Asheville. Let c denote 
the center of S. Suppose f(A) # c for all A € @. Then no sequence { A;} of sets in 
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@ can Satisfy 
lim f(.A,) =. 


Were this to happen, some subsequence {B,} of {A;} would converge in the 
Hausdorff metric to a square B € @, and then by continuity of f, we would have 


f(B) = f(lim B,) = lim f(B,) = ¢, 


a contradiction. Hence there exists a disc D of positive radius, centered at c, such 
that f(A) € D, all A € @. But this allows us to construct a retraction g of D to 
its boundary defined as follows: For each x € D consider the line segment joining x 
and f(A,.), where A, is the unique square in @ centered at x. This line segment 
meets the circular boundary of D at a unique point g(x). Clearly g(x) 1s continu- 
ous. But this contradicts the No Retraction Theorem. 

All other solutions were rather similar to the above, and relied on either the 
Brouwer fixed point theorem or some closely related result. The solutions of John A. 
Frohliger and John Henry Steelman both extended the result to n dimensions. In 
addition, Frohliger remarks that if the square S$ is replaced by the closed disc of 
radius 2 and @ by the internal closed discs of radius r (where r < 1 is fixed), then 
the range of f even contains a centered closed disc of radius 2 — 27. 


Also solved by G. D. Chakerian, William Geller, Victor Pambuccian (Romania), and the proposer. 


A Nonhypergeometric Sum 


6519 [1986, 403]. Proposed by Ira Gessel, Brandeis University. 


Let 
"| b+m 
J m—jy? 


where m and n are nonnegative integers. Show that F(a, b, m,n) = F(b, a, n, m). 


m 


F(a,b,m,n)= >. 
j=0 


at+m+n—2j 


nT] 


Solution I by Jay Hook, Champaign, Illinois. Let X and Y be two. disjoint sets 
with 
|X| =a+n, and |Y| =b+™m. 
Consider the class of all partitions of each of X and Y into three disjoint subsets, 
say, 
X= X,UX,UX, and Y= Y,UY,UY;, 


such that 

(i) [My] = |X| + 6 
and 

(ii) |X| = |X| + a. 


We refer to this class as the set of bipartitions of X and Y with parameters 
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(a,b, m,n). All such partitions may be constructed by the following four-step 
procedure: 

(1) Choose a nonnegative integer j to be the cardinality of X,. By (i) we have 
J<m. 
(2) Choose X;, a subset of X. This can be done in 


atn 
Ca] 
different ways. 
(3) Choose Y,, a subset of Y, so that (i) is satisfied. This can be done in 


b+m\ [b+m 
J+tb} \m-j 
different ways. 


(4) Observe that the sets X,, X;, Y,, Y; are uniquely determined by specifying the 
set 
S = X,UY, c (XUY) \ (X,UY,). 
Since 
[S| = |X] — |X] — [XQ + 1] 
=(at+n)—j+ (Y|- 1X5), 
condition (ii) is equivalent to |S| = n — j. Hence the number of different ways of 
choosing an S that satisfies (11) is 


at+mt+n—2] 
n—J . 

This procedure shows that the cardinality of the class of partitions satisfying (1) 
and (11) is F(a, b, m, n). Now clearly there is a one-to-one correspondence between 
the above class and the class of bipartitions of Y and X with parameters (b, a, n, m). 
This proves the result. 


Solution IIT by C. L. Mallows, AT&T Bell Laboratories, Murray Hill, New 
Jersey. Expand the first binomial coefficient by Vandermonde’s theorem to obtain 


at+m+n—2j ~y[otrrs en) 
T\n-sJa—k k 
and then rearrange the implicit factorials to obtain 


n—j 
Facto Tn) 


Now (by Vandermonde again) summation over j shows the above is 


(> * m+n | ba mart = F(b,a,n,m). 
k 


Proofs somewhat similar to Mallows’ were also provided by Jay Hook and the proposer. According to 
the proposer, “unlike most binomial coefficient identities, this one is not equivalent to a well-known 
identity for hypergeometric series.” 


Reviews 


EDITED By JOSEPH KONHAUSER 
Department of Mathematics, Macalester College, St. Paul, MN 55105 


Discrete Mathematics. By John A. Dossey, Albert D. Otto, Lawrence E. Spence, and 
Charles Vanden Eynden. Scott, Foresman and Company, Glenview, IL 1987. 
482 pp. 


RICHARD D. PORTER 
Department of Mathematics, Northeastern University, Boston, MA 02115 


Computers became a part of everyday life sometime between 1972, when I saw 
an early version of the remarkable computer film “The Hypercube” by Thomas 
Banchoff and Charles Strauss [1], and last Saturday morning when Jim Henson’s 
Muppet Babies demonstrated the wonders of computer graphics to those who watch 
Saturday morning cartoons. In the Muppet show a six-year-old cartoon character, 
pounding away on his PC, confides to the audience that the graphics is all 
mathematical and very precise. At the same time, a second character runs around 
the screen randomly placing splotches of color on the graphics. The actions of the 
second character, while keeping the children’s attention, suggested that precision in 
mathematics implies a lack of imagination. 

A consequence of the pervasive use of the computer from science to word 
processing to Max Headroom and the Saturday morning cartoons is that students 
are taking courses in discrete mathematics. What is to be gained from a course in 
discrete mathematics? In a typical course at the freshman-sophomore level, students 
are looking for the background material needed for courses that follow in mathe- 
matics, computer science, engineering, etc. At the junior-senior level, students 
should be examining a mathematical structure in depth (group theory, for instance) 
and applying the results to a variety of problems. The goal of the second course is to 
expose students to the nature and power of mathematics. 

The book under review is a text for the first of the two courses. The list of topics 
for such a course suggested by the MAA panel on Discrete Mathematics [3] includes 
Sets, the nature of proofs, Formal Logic, Functions and Relations, Recursion, 
Combinatorics, Graphs, Trees and Order, and Algebraic Structures. 

The first criterion of a text for this course is that the material moves beyond 
definitions. Students need time to work with ideas, and see applications. A course at 
the freshman-sophomore level should achieve as many of the goals of the second 
course as possible. In the words of the MAA panel [3] “Adequate time should be 
allowed for the students to DO a lot on their own: they should be solving problems, 
writing proofs, constructing truth tables, manipulating symbols in Boolean algebra, 
deciding when, if and how to use induction, proofs by contradiction, etc.” Judging 
from the books on my shelf, a text that covers topics in the right depth won’t have 
space for all of the topics on the panel’s list. 
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The book by Dossey, Otto, Spence, and Eynden passes the first test by choosing 
not to cover algebraic structures. The first chapter is an engaging introduction to 
combinatorial problems through examples of matching problems (Is it possible for 
an airline to meet requests from pilots for specific flights?) and Knapsack Problems 
(Which experiments should be packed into a satellite of given capacity in order to 
provide the greatest potential return to science?). The approach is one of exploring 
the problems. Various strategies are considered along with their complexity. The 
book moves on to consider a variety of problems related to graphs and develops 
algorithms to solve them. There are algorithms for shortest paths, searching, sorting, 
optimal matching, and network flows. The chapter on counting includes an al- 
gorithm for enumerating permutations. Finite State Machines, Logic and Proof 
Techniques are covered, but only lightly. 

While the authors’ choices of which topics to cover and emphasize are not right 
for everybody, the text succeeds in presenting mathematics in an inviting way and in 
greater depth than a text whose goal is to cover all the topics listed by the MAA 
panel. As noted by Lynn A. Steen [2] “The challenge for mathematicians every- 
where...is to stimulate in all students the intellectual acumen that inheres in 
mathematical habits of thought.” Discrete mathematics especially at the freshman- 
sophomore level, where there is a wide sampling of students, offers an excellent 
vehicle for meeting this challenge to the benefit of all. Hopefully, students will 
discover that imagination along with precision is essential in mathematics. 
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Constructive Analysis. (Grundlehren der mathematischen Wissenschaften; 279), by 
Errett Bishop and Douglas Bridges. Springer-Verlag, Berlin, Heidelberg, New 
York, Tokyo, 1985. 


Ray MINES 
Department of Mathematics, New Mexico State University, Las Cruces, NM 88003 


Constructive Analysis is a substantial reworking of Errett Bishop’s Foundations of 
Constructive Analysis, which greatly influenced the modern resurgence of construc- 
tive mathematics. Indeed, most of the 93 references in the new edition owe their 
existence to it. The effect of the book has been felt by logicians and computer 
scientists as well as mathematicians. Logicians have been attempting to obtain a 
formal system that captures Bishop’s constructive framework and several computer 
scientists have attempted to computerize portions of the book, while others are 
trying to see how much mathematics they can develop under more restrictive 
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constructive conditions than Bishop’s. Mathematicians have been proving theorems 
using Bishop’s constructive methods. (See Stolzenberg, Bulletin of the American 
Mathematical Society, 76(1970) pp. 301-323, for a good discussion of the original.) 

Bishop’s book had an advantage over earlier expositions of constructive mathe- 
matics in that it was interesting and easy to read. This feature has been preserved in 
the new edition. Mathematicians will have no trouble understanding the material. 
Anyone not aware of the constructivist philosophy behind the book might wonder 
why the authors do some things. For example the constructive version of trichotomy 
is the following. If a, b and c are real numbers with a < b, then either a < c or 
c<b. 

The differences between classical mathematics and constructive mathematics 
stem from the different interpretations of the phrase ‘there exists.’ Classically, the 
phrase ‘there exists’ has come to mean ‘there cannot fail to exist.’ Thus most proofs 
of existence begin with ‘assume not’ and continue to arrive at a contradiction, even 
when it is easier to show directly that the object exists. Proofs by contradiction have 
become so common that they seem to be the proof of choice. Take a text off your 
bookshelf and open it at random. Look at the proof you find. More than likely it 1s 
a proof by contradiction. In fact, it is apt to be of the following form. First the 
author says assume not, then he gives a direct proof, and finally he derives a 
contradiction to the fact that he has a proof of what he assumed to be false. 

Constructively, the phrase ‘there exists’ means there is an algorithm that con- 
structs the desired object in a finite number of steps. (Here the word algorithm is 
undefined. It is not meant in the sense of recursive function theory where algorithm 
has a definite meaning. In fact several of the algorithms that are used in recursive 
function theory are not algorithms in our sense.) This difference in interpretation of 
‘there exists’ is carried over to disjunction. Thus, classically to show (A or B) it is 
shown that —A and —B leads to a contradiction. Constructively one must prove 
either A or B. 

One can best appreciate this difference by considering a proof of the Bolzano- 
Weierstrass Theorem. This theorem states that an infinite set S contained in the 
interval [0,1] has a limit point. The key to the classical proof of this theorem which 
uses interval halving is the observation that one of the sets S  [0,1/2] or S O 
[1/2,1] must be infinite. The reason being that they cannot both be finite. 
Constructively this is not helpful. Knowing that they cannot both be finite does not 
tell us where to look for an approximation of the limit point. (Maybe this is why 
some students have a hard time understanding this proof: They want to know which 
half.) The logical principle involved here is the law of the excluded middle (for any 
proposition p either p or —p). In constructive mathematics the law of the excluded 
middle is accepted in finite situations but not in infinite ones. Thus if we know that 
n is an integer and n is either 0 or 1, then to show that nv is 1 it is enough to show 
that n # Q. On the other hand if {a,,} is a sequence of integers that are either 0 or 1 
then to show that there exists an n with a, = 1 we cannot argue that it is impossible 
for all a, to be 0. We must instead name an integer n and show that a, = 1. 

In order to clarify this point let’s consider the following experiment. Imagine a 
computer programmed to take as input an integer 1 and in a finite amount of time 
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to compute either 0 or 1 as output. Suppose further that it does it in such a way that 
once there is an n with output 1, then the output 1s 1 for all m > n. Thus as we feed 
in the sequence 1, 2,... we get a sequence a, < a, < ::: as output. This sequence 
has the property that given an integer 1 we can determine the value of a, but we 
will not know the values of all the a,’s unless we find an n with a, = 1, or prove 
that a, = 0 for all an. 

Suppose that you were to watch the output and you notice that a, = 0 for all 
n < 10°. You might wonder if there is an n with a, = 1. As a mathematician you 
could examine a copy of the program and attempt to prove or disprove that a, = 0 
for all n. As a classical mathematician you would more than likely begin by looking 
for an indirect proof. Suppose you were able to give such an indirect proof to the 
statement that there is an 7 with a, = 1, then your proof would have shown that it 
is impossible for all of the values to be 0. As a classical mathematician you would be 
happy as you know that there must be an n with a, = 1. Now suppose that you 
were told that the first person to predict the value of n with a, = 1 would win an all 
expense trip for two to Hawaii. What would you do? Your proof has not produced a 
method for finding such an n. If you wanted to go to Hawaii you would have to go 
back and attempt to find a constructive proof. This situation was nicely summarized 
by a colloquium speaker when he said ‘This is only an existence theorem, it does not 
say how to get it.’ 

This is the situation in which the constructivist finds himself when presented with 
a nonconstructive proof of the existence of some mathematical object: It is nice to 
know it must be there, but how do we find it? Knowing that there cannot fail to 
exist an n with a, = 1 does not allow one to do further computations. In the above 
situation the constructivist would try to compute a value of n and then prove that 
a, = 1. He might not be successful because it might be a much harder task than to 
show that such an n cannot fail to exist. 

To relate this discussion to the proof of the Bolzano-Weierstrass Theorem, let 


S = {b,:b, = 1/2 — 1/n unless a, = 1 in which case b, = 1 — 1/n}. 


It is clear that S is an infinite subset of the interval [0,1]. Which of the two sets 
SO [0,1/2] and SO [1/2, 1] is infinite? To answer this question we must know if 
there is an n with a, = 1. Notice that the sequence {5, } gives us an example of an 
increasing sequence bounded above by 1 for which we are unable to compute a 
limit. 

You might think that the above is trivial. After all given such a program we must 
surely be able to answer the question on the existence of an integer n with a, = 1, 
as a program is a finite list of instructions. The following is a simple algorithm that 
will produce a sequence for which we do not know if such an n exists or not. Let 
a, = 1 if there exists an odd perfect number less than or equal to n and 0 otherwise. 

An example of the problems we face when trying to prove a disjunction is given 
by the following classical proof of the fact that there exists two irrational numbers a 
and £ with a® rational. Let y = ¥2 V2 Bither y is rational and we take a = 2 = B, 
or y is irrational and we take a = y and B = y2 to get a® = 2. Most mathemati- 
cians, while admiring this slick proof, would like to know which of the two pairs (y, 
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v2) and (V2, ¥2), gives the example. A constructivist carries this wondering to such 
an extent that he would not consider this a proof of the stated theorem. He would 
need to have a proof of the fact that y is irrational. To a constructivist the above is 
a proof of the following statement. 


It is impossible that for each pair (a, B) of irrational numbers the number a® is 
irrational. 


In the Prolog on page 3 of the book under review we find the following. ‘Every 
theorem proved with idealistic methods presents a challenge: to find a constructive 
version, and to give it a constructive proof.’ This is what most constructive 
mathematicians have been doing. On the surface it looks like an easy task. Just take 
some well-known theorem and reprove it in a constructive way. Nevertheless, the 
task of finding a constructive version can be hard. Usually there are several 
constructive versions. This is the case for the Big Picard Theorem on essential 
singularities. On pages 179 and 180 we find two versions. 


(6.21) THEOREM. If f is a differentiable function on the open annulus 
A= {zE€C:0<|z-2z)|<r} 
with 0 and I not in the range, then f has a pole of determinate order at Zo. 


(6.22) THEOREM. Let f be differentiable on the open annulus 
A={z€C:0<|z-2| <r}, 


with an essential singularity at z,, and let § and §’ be distinct complex numbers. Then 
in any neighborhood of zy, f takes at least one of the values § and &’. 


Classically, these two statements are equivalent. Constructively, they are quite 
different. Version (6.21) says that we can compute an integer n so that the 
coefficient of (z — z))” in the Laurent expansion of f is zero for all m < n. While 
in (6.22), given a neighborhood of z), we are able to compute a complex number z 
in this neighborhood for which the value of f(z) is either ¢ or ¢”. 

At first blush the reader might think that by considering (1/n)-neighborhoods of 
Z, we could conclude that one of the values § or ¢’ is taken on infinitely often. All 
that we can prove is that it is impossible that each is taken on only a finite number 
of times. 

Many people have expressed the view that there should be close ties between the 
theory of algorithms and constructive mathematics. Perhaps because algorithms are 
innately constructive. One concern of people who study complexity theory is the 
existence of efficient algorithms. In the past the existence of polynomial time 
algorithms has been shown by giving an algorithm and then showing that it can be 
carried out in polynomial time. Times have changed. Mike Fellow has used 
Wagoner’s conjecture to show that there are polynomial time algorithms for solving 
certain problems in graph theory. His methods are highly nonconstructive. In fact 
the existence of these algorithms is probably independent of ZF. So there are now 
pure existence proofs for computer programs that solve real problems, and no hope 
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of ever programming them. This might be compared to the situation that existed 
when Hilbert gave his proof of the Hilbert Basis Theorem which led Gordan to 
make his famous statement. 


Das ist nicht Mathematik. Das ist Theologie.’ 


It seems to me that the interest in constructive mathematics will increase with the 
growing influence of computers on mathematics. There is already a return of 
interest in the original problems that led to the abstract structures that most of us 
have been concerned with. If you want to get in on the ground floor, then the book 
under review is an excellent starting point. 


1For an interesting discussion of the controversy that arose over Hilbert’s proof of the Basis Theorem 
see Constance Reid, Hilbert, Springer-Verlag, New York, 1970. 
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General, S. Succeed With Math: Every Student’s 
Guide to Conquering Math Anziety. Sheila Tobias. 
The College Board (45 Columbus Ave., NY 10023), 
1987, xviii + 252 pp, (P). [ISBN: 0-87447-259-8] A 
popular math-fitness book for students and adults 
who must learn more mathematics than they think 
is possible for them. Practical hints on reading and 
thinking about mathematics, on solving problems, 
and on uses of mathematics in social science, biol- 
ogy (population genetics), and business. Designed to 
build self-confidence, this sequel to Overcoming Math 
Anziety covers many of the mathematical skills re- 
quired to study freshman-level mathematics. Should 
be in every college’s study skill center. LAS 


Mathematics Appreciation, L**. Mathematical 
Recreations and Essays, Thirteenth Edition. W.W. 
Rouse Ball, H.S.M. Coxeter. Dover, 1987, xvii + 
428 pp, $8.95 (P). [ISBN: 0-486-25357-0] Revision 
of the 12th Edition (TR, May 1975) published by 
the University of Toronto Press in 1974. New ma- 
terial features the role of the computer in extend- 
ing the list of Mersenne primes and in the resolution 
of the problem of “squaring the square.” An ad- 
dendum on space-filling with “golden” rhombohedra 
and the connection with quasilattices. Every public 
or school library should have a copy. No one inter- 
ested in recreational mathematics can afford to pass 
up this modestly-priced delightful classic. JK 


Mathematics Appreciation, T*(13:1). A Math- 
ematics Sampler: Topics for Liberal Arts Students. 
William P. Berlinghoff, Kerry E. Grant. Ginn Pr, 
1986, 400 pp, (P). [ISBN: 0-536-05450-9] Ambitious 
book for a freshman course for liberal arts students. 
Well written. Covers topics such as primes, perfect 
numbers, geometry, probability and statistics, un- 


countability of reals, basics of group theory. Looks 
like an interesting course. MZ 


Mathematics Appreciation, T*(13-16: 1). For 
All Practical Purposes: Introduction to Contempo- 
rary Mathematics. COMAP (60 Lowell St., Arling- 
ton, MA 02174), 1988, xii + 450 pp. (ISBN: 0- 
7167-1830-8] An innovative textbook for a survey 
course in mathematics for liberal arts students pre- 
senting mathematics through contemporary applica- 
tions. Written to accompany a telecourse due to 
be shown on public television beginning in January 
1988, this text can also be used alone in a tradi- 
tional lecture course. Treats management science, 
statistics, social choice, size and shape, and comput- 
ers with minimal mathematical prerequisites (just a 
glimmer of first year high school algebra). Features 
full-color graphics, interviews with users of mathe- 
matics, and many stimulating examples of contempo- 
rary mathematics applied to practical problems. JAS 


Education, P, L. Developments in School Math- 
ematics Education Around the World. Ed: Izaak 
Wirszup, Robert Streit. NCTM, 1987, x + 725 
pp, $20 (P). [ISBN: 0-87353-249-X] Proceedings of 
a March 1985 international conference sponsored 
by the University of Chicago Schoo! Mathematics 
Project. One-third of the papers report on math- 
ematics education world wide; others, mostly by 
Americans, deal with instructional strategies, cur- 
riculum design, and technology-supported learning. 
Includes helpful mini-biographies of all authors. LAS 


History, P, L**. The Rhind Mathematical Pa- 
pyrus: An Ancient Egyptian Text. Gay Robins, 
Charles Shute. British Museum, 1987, 88 pp, (P). 
(ISBN: 0-7141-0944-4] A complete explication (in 
modern notation) of the Rhind Papyrus, the most 
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important “textbook” of Egyptian arithmetic con- 
taining rules for adding (unit) fractions followed by 
problems on solving equations, squaring the circle, 
calculating volumes and prices. Concludes with full- 
color plates of the entire papyrus which are linked 
to the text by schematic diagrams. Contains mini- 
mal commentary—just enough to make the text clear 
without overpowering it. LAS 


History, S*(15-17), P, L**. Studies in the His- 
tory of Mathematics. Ed: Esther R. Phillips. Stud. 
in Math., V. 26. MAA, 1987, 308 pp. (ISBN: 0- 
88385-128-8] Ten samples of recent research on the 
history of recent mathematics, principally in the pe- 
riod 1870-1950: from Dedekind’s invention of ideals 
through Killing, Brouwer, Bieberbach, etc., to John 
von Neumann’s marriage of mathematics and com- 
puting at the Institute for Advanced Study. Splendid 
essays suitable as background reading by upperclass 
mathematics students. LAS 


Group Theory, S(18), P. Lecture Notes in 
Mathematics-1261: Finite Presentability of S-Arith- 
metic Groups Compact Presentability of Solvable 
Groups. Herbert Abels. Springer-Verlag, 1987, vi 
+ 178 pp, $16.30 (P). [ISBN: 0-387-17975-5] 
Group Theory, T(16-17), S, P, L*. Group The- 
ory. W.R. Scott. Dover, 1987, 479 pp, $10.95 
(P). [ISBN: 0-486-65377-3] Unabridged, corrected re- 
publication of a classic 1964 text. Basic definitions 
and results; abelian groups; p-groups; supersolvable 
groups; free groups; extensions; permutation groups; 
representations; products of subgroups; the multi- 
plicative group of a division ring; topics in infi- 
nite groups (in view of the Feit-Thompson theorem). 
Hundreds of exercises. RB 


Algebra, P. Linear Topologies on a Ring: An 
Overview. Res. Notes in Math. Ser., V. 159. 
Jonathan S. Golan. Longman Scientific & Tech (US 
Distr: Wiley), 1987, 104 pp, $36.95 (P). [ISBN: 0- 
470-20842-2] Study of the structure of the set of 
topologizing filters (those inducing a linear topology) 
of left ideals on a non-commutative ring R. Major 
theme is the interplay between the order structures 
and the algebraic structures. RM 


Algebra, P. Lecture Notes in Mathematics-1229: 
Derivations, Dissipations and Group Actions on C*- 
algebras. Ola Bratteli. Springer-Verlag, 1986, vi 
+ 277 pp, $23.60 (P). [ISBN: 0-387-17199-1] Cov- 
ers theory of derivations, starts at beginning with 
bounded derivations, then non-commutative vector 


fields, and finished with dissipations. MZ 


Calculus, P*, L*. Calculus for a New Century: A 
Pump, Not a Filter. Ed: Lynn Arthur Steen. MAA 
Notes No. 8. MAA, 1988, xiv + 258 pp, $12.50 (P). 
[ISBN: 0-88385-058-3] Expanded proceedings of an 
October 1987 colloquium sponsored by the National 
Academies of Sciences and Engineering intended to 
stimulate national discussion among scientists, math- 
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ematicians, and engineers about revitalizing calcu- 
lus. Contains plenary papers presented at the col- 
loquium, background papers prepared in advance of 
the colloquium, surveys of calculus experiments, re- 
ports from discussion groups, samples of calculus fi- 
nal examinations, and related readings. Dozens of 
excellent yet diverse ideas with little focus except 
discontent with the status quo. A valuable resource 
for would-be calculus reformers; a natural sequel to 
Towards a Lean and Lively Calculus (MAA, 1986, 
TR, March 1987). JAS 


Real Analysis, T(17). Real Analysis. William O. 
Ray. Prentice-Hall, 1988, x + 307 pp. [ISBN: 0-13- 
762386-0] Aimed at beginning graduate students who 
have had a course at the level of Goldberg’s Methods 
of Real Analysis, this book covers the Lebesgue inte- 
gral, Banach spaces, integration over abstract spaces, 
and introductory functional analysis. AWR 


Real Analysis, P. Seminar on Geometric Measure 
Theory. R. Hardt, L. Simon. DMV Seminar, Band 7. 
Birkhauser Boston, 1986, 117 pp, $16.50 (P). [ISBN: 
0-8176-1815-5] Ten lectures, five by each author, in- 
tended to introduce the main ideas of geometric mea- 
sure theory to analysts, given over a period of one 


week at Schloss Mickeln, Dusseldorf. AWR 


Differential Equations, S(16). An Introduction 
to Kalman Filtering With Applications. Kenneth S. 
Miller, Donald M. Leskiw. Robert E Krieger, 1987, 
vi + 113 pp, $16.50. [ISBN: 0-89874-824-0] Start- 
ing with a stochastic differential system (the system 
model) that predicts the state of a system at time ¢, 
we assume that a series of observations are made (the 
measurement model) which can be used to modify 
the theoretic model. The Kalman filter is a recursive 
method that enables one to use all information avail- 
able up to time ¢’ in order to make a best estimate 
at time t* > ¢’. Prerequisite for reading: differential 
equations, some probability, some mechanics. Read- 
able, attractive text. AWR 


Partial Differential Equations, S(17-18), L. 
The Mathematical Theory of Huygens’ Principle. 
Bevan B. Baker, E.T. Copson. Chelsea, 1987, vii + 
193 pp, $16.95. [ISBN: 0-8284-0329-5] Reprint with 
minor changes of the 1950 edition. Huygen’s princi- 
ple is a geometric model of light wave propagation in 
optics. The book deals with the mathematical the- 
ory of Huygens’ principle in optics, its application 
to the theory of diffraction, and the general theory 
of the solution of the partial differential equations 
governing light propagation. AM 


Partial Differential Equations, T(18). Nonlin- 
ear Partial Differential Equations and Free Bound- 
artes, Volume I: Elliptic Equations. J.I. Diaz. Res. 
Notes in Math., V. 106. Pitman, 1985, 323 pp, $44.95 
(P). [ISBN: 0-273-08572-7] A readable introduction 
to the area of nonlinear partial differential equations. 
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Focuses on two methods of solution, energy methods, 
and sub-super solution methods. MZ 


Partial Differential Equations, P. Linear Differ- 
ential Equations of Principal Type. Yu. V. Egorov. 
Transl: Dang Prem Kumar. Contemp. Soviet Math. 
Consultants Bureau, 1986, vii + 301 pp, $85. (ISBN: 
0-306-10992-1] The properties of a differential or 
pseudo-differential operator of principle type are in- 
dependent of low-order terms and determined only 
by the principal symbol. The book presents the top- 
ics necessary for the study of differential equations 
and boundary value problems involving such oper- 
ators. These include distribution theory, pseudo- 
differential operators, wave front sets, Fourier inte- 
gral operators, and results on local solvability of dif- 
ferential equations. AM 


Partial Differential Equations, T*(16-17: 1, 
2), L. Introduction to Partial Differential Equations 
and Hilbert Space Methods, Second Edition. Karl E. 
Gustafson. Wiley, 1987, xix + 409 pp, $46. [ISBN: 0- 
471-83227-8] Enlarged edition features an expanded 
treatment of first-order systems, an introduction to 
computational methods, and some aspects of topical 
research. At earlier reviewers’ behest some straight- 
forward exercises have been added along with four 
novel, several-page “Pauses” designed to give read- 
ers a chance to recollect their thoughts. “Pauses” 
are intended to be “helpful and hopefully interest- 
ing auxiliary material to aid in practice and under- 
standing.” Considerably expanded Answers Section. 
(First Edition, TR, February 1981.) JK 


Numerical Analysis, P. Tables for Lagrangian 
Interpolation Using Chebyshev Points. Herbert E. 
Salzer, Norman Levine, Saul Serben. Applied Sci- 
ence, 1984, $28 (P). [ISBN: 0-915061-00-7] Two ta- 
bles for use in n-point Lagrangian polynomial inter- 
polation of a function tabulated at the Chebyshev 
points. Table I lists the Chebyshev points z,,; for 
every n, and s,,; = sin[(2i — 1)a/2n] for odd n, 
for n = 2(1)25(5)50(10)100, to twenty-five significant 
figures. Table II lists the interpolation coefficients for 
n = 20,2 = —1(.001)1, to twenty significant figures. 
Introductory text includes a discussion of interpola- 
tion and use of the tables and references. RH 


Numerical Analysis, C(14-17), L. Nonlinear 
Parameter Estimation: An Integrated System in BA- 
SIC. MSDOS. John C. Nash, Mary Walker-Smith. 
Stat.: Textbooks & Mono., V. 82. Dekker, 1987, xiii 
+ 493 pp, $65. [ISBN: 0-8247-7819-7] A collection of 
software tools in Basic (on an MSDOS disk inside the 
cover and reproduced in the text) to estimate non- 
linear parameters in mathematical models arising in 
diverse applications. Focuses on methods minimiz- 
ing loss functions that operate well on small com- 
puters. Text provides commentary on the methods 
and illustrative applications, but little on algorithms 
or theory. Extensive bibliography (also available on 
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disk) provides references to literature on the algo- 


rithms. LAS 


Operator Theory, T(18: 1), S, P. Spectral The- 
ory of Linear Differential Operators and Compari- 
son Algebras. H.O. Cordes. Math. Soc. Lect. Note 
Ser., V. 76. Cambridge U Pr, 1987, ix + 342 pp, 
$29.95. [ISBN: 0-521-28443-0] First four chapters 
supply necessary background in spectral theory and 
second-order elliptic differential operators followed 
by six chapters treating various aspects and applica- 
tions of comparison algebras, defined as C*-algebras 
of singular integral operators with special properties. 
References, index. JS 


Functional Analysis, $(18), P, L. Theory of Lin- 
ear Operations. S. Banach. Transl: F. Jellett. Math. 
Lib., V. 38. Elsevier Science, 1987, x + 237 pp, 
$75.50. (ISBN: 0-444-70184-2] An English transla- 
tion of a classic work, supplemented by fifty pages 
on “Some Aspects of the Present Theory” by Pel- 
czynski and Bessaga. Extensive bibliography. JS 


Functional Analysis, P*. Introduction to Various 
Aspects of Degree Theory in Banach Spaces. E.H. 
Rothe. Math. Surv. & Mono., No. 23. AMS, 
1986, vi + 242 pp, $60. [ISBN: 0-8218-1522-9] A 
nice survey of different topics in degree theory, such 
as Leray-Schauder degree, Poincaré-Boh] theorem, 
product theorem, finishing with the linear homotopy 
theorem. MZ 


Functional Analysis, S(18), P. Semigroups of 
Linear Operators: An Introduction. A.C. McBride. 
Res. Notes in Math. Ser., V. 156. Longman Scien- 
tific & Tech (US Distr: Wiley), 1987, 134 pp, $37.95 
(P). [ISBN: 0-470-20824-4] Lecture notes intended 
to serve as an introduction to the subject, develop- 
ing the theory as far as existence and uniqueness for 
Cauchy problems. Applications, references. JS 


Functional Analysis, P. FHigenvalues and s- 
Numbers. Albrecht Pietsch. Stud. in Adv. Math., 
V. 13. Cambridge U Pr, 1987, 360 pp, $59.50. [ISBN: 
0-521-32532-3] Developed on a foundation of abstract 
operators on Banach spaces, the author identifies as 
the most important result of the monograph a gen- 
eralization of Weyl’s theorem identifying an opera- 
tor ideal as being of an optimum eigenvalue type. 
He then builds a theory of traces, determinants, and 
other familiar concepts for operators. AWR 


Analysis, P. Generalized Integral Transformations. 
A.H. Zemanian. Dover, 1987, xvi + 300 pp, $7.95 
(P). [ISBN: 0-486-65375-7] The subject matter deals 
with generalizations following the lead of Schwartz’s 
work with generalized functions, applied to such clas- 
sic integral transformations as Fourier, Laplace, and 
Hilbert. Assumes familiarity with advanced calculus, 
some Lebesgue integration, and some complex vari- 
able theory. (1968 Interscience edition, TR, June- 
July 1969.) AWR 
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Analysis, P. Operational Calculus Based on the 
Two-Sided Laplace Integral, Third Edition. Balth. 
van der Pol, H. Bremmer. Chelsea, 1987, xiii + 415 
pp, $24.95. [ISBN: 0-8284-0327-9] Intended for appli- 
cation of operational calculus in its modern form to 
problems in mathematics, physics, and engineering. 
Originally published in 1950. LC 


Analysis, T(17-18: 2). Measure Theory and In- 
tegration. M.M. Rao. Wiley, 1987, xii + 540 pp, 
$59.95. [ISBN: 0-471-82822-X] An attractive new 
entry in the field of texts for the one-year graduate 
school real variable course. Author gives explanation 
as to how his book differs from other standard texts, 
is thoroughly up-to-date, includes good exercises. By 
comparison, it is a good value for the price. AWR 


Analysis, P. Lecture Notes in Mathematics-1225: 
Inverse Problems. Ed: G. Talenti. Springer-Verlag, 
1986, vii + 204 pp, $19.40 (P). [ISBN: 0-387-17193- 
2] Texts of lectures and seminars at a CIME ses- 
sion on inverse problems. Focuses on ill-posed and 
inverse problems, covering inverse eigenvalue prob- 
lems, Fredholm integral equations of first kind and 
regularization methods, problems motivated by med- 
ical imaging, and others. MZ 


Analysis, P. Jordan Algebras in Analysis, Opera- 
tor Theory, and Quantum Mechanics. Harald Up- 
meier. CBMS Reg. Conf. Ser. in Math., No. 
67. AMS, 1987, vii + 85 pp, $13 (P). [ISBN: 0-8218- 
0717-X] Lectures on Jordan algebras and their appli- 
cations to quantum mechanics, dynamical systems, 
kernel functions and harmonic analysis, Hua opera- 
tors, Toeplitz operators, and Toeplitz C*-algebras. 
Gives good overview of the topics. MZ 


Analysis. Lecture Notes in Mathematics-1259: 
Desingularization Strategies for Three-Dimensional 
Vector Fields. Felipe Cano Torres. Springer-Verlag, 
1987, ix + 194 pp, $16.30 (P). [ISBN: 0-387-17944-5] 
Written with the well-initiated expert in mind, the 
forbidding jargon and notation is complicated by an 
awkwardness in use of the English language. AWR 


Differential Geometry, P. Lie Groupoids and Lie 
Algebroids in Differential Geometry. K. Mackenzie. 
London Math. Soc. Lect. Note Ser., V. 124. Cam- 
bridge U Pr, 1987, xv + 327 pp, $34.50 (P). [ISBN: 
0-521-34882-X] A groupoid is an object which gen- 
eralizes the algebraic structure of a principal fiber 
bundle. This book develops the theory of topolog- 
ical groupoids, Lie groupoids, and algebroids, then 
uses this theory to discuss the cohomology of Lie al- 
gebroids and criteria for the existence of connections 
with a prescribed curvature form. Contains work 
which is published here for the first time. AM 


Differential Geometry, P. Lecture Notes in 
Mathematics-1255: Differential Geometry and Dif- 
ferential Equations. Ed: Gu Chaohao, M. Berger, 
R.L. Bryant. Springer-Verlag, 1987, xii + 243 pp, 
$26.80 (P). [ISBN: 0-387-17849-X] Proceedings of the 
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sixth symposium on differential geometry and differ- 
ential equations held from June 21 to July 6, 1985 at 
Fudan University, Shanghai, China. The book con- 
tains texts of invited lectures and papers presented at 
the symposium, many of which are neither published 
nor due to be published elsewhere. AM 


Differential Geometry, T(17-18: 1), P. Differ- 
ential Systems and Isometric Embeddings. Phillip A. 
Griffiths, Gary R. Jensen. Annals of Math. Stud., 
No. 114. Princeton U Pr, 1987, xii + 225 pp, $35; 
$15 (P). (ISBN: 0-691-08429-7; 0-691-08430-0] Illus- 
trates how the theory of characteristic varieties for 
exterior differential systems may be used to study 
local isometric embeddings. Part of the book’s goal 
is to discuss the notions of involutivity and prolon- 
gation through a careful study of the differential 
exterior system associated to the isometric embed- 
ding problem. This study also discusses the essen- 
tial pointwise algebraic conditions necessary for these 
embeddings and applies the theory to the problem of 
embedding Cauchy-Riemann structures. AM 


Geometry, P. Analytical and Geometric Aspects 
of Hyperbolic Space. Ed: D.B.A. Epstein. Lon- 
don Math. Soc. Lect. Note Ser., V. 111. Cam- 
bridge U Pr, 1987, 323 pp, $29.95 (P). [ISBN: 0-521- 
33906-5]; Low Dimensional Topology and Kleinian 
Groups. Ed: D.B.A. Epstein. London Math. Soc. 
Lect. Note Ser., V. 112. Cambridge U Pr, 1986, 321 
pp, $34.50 (P). [ISBN: 0-521-33905-7] Proceedings 
of two 1984 symposia at the Universities of Warwick 
and Durham, UK, which gathered top researchers 
in hyperbolic geometry, Kleinian groups, and three- 
dimensional topology. JK 


Operations Research, S(16-17), P. Linear Pro- 
gramming in Infinite-Dimensional Spaces: Theory 
and Applications. Edward J. Anderson, Peter Nash. 
Ser. in Discrete Math. & Optimiz. Wiley, 1987, xi 
+ 172 pp, $44.95. [ISBN: 0-471-91250-6] Intended 
to be a comprehensive introduction to linear pro- 
gramming problems posed over infinite-dimensional 
vector spaces which occur in approximation theory, 
optimal control theory, etc. Specific examples from 
a variety of fields are discussed; attention is given 
to algorithms but not numerical analysis. Directed 
to graduate students and research workers in mathe- 
matical programming and operations research. AWR 


Operations Research, P. Mathematical Modelling 
Courses. Ed: J.S. Berry, et al. Math. & Its Applic. 
Halsted Pr, 1987, 281 pp, $95. [ISBN: 0-470-20836-8] 
Directed at those who teach, or hope to teach a mod- 
elling course at the undergraduate level. Twenty-five 
articles, all by different authors, tell what has and has 
not worked in a variety of schools and in a variety 
of courses. Useful resource, but a high price for 280 
pages. AWR 


Optimization, P. Lecture Notes in Control and 
Information Sciences-93: Stability of Solutions to 
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Convez Problems of Optimization. K. Malanowski. 
Springer-Verlag, 1987, ix + 134 pp, $20.60 (P). 
[ISBN: 0-387-17589-X] Deals with both global sta- 
bility (allows the whole domain of values of parame- 
ters) and differential stability (only local changes of 
parameters), but is restricted to problems that are 
strongly convex, subject to point-wise constraints, 
satisfying linear independence conditions. Strictly 
research level. AWR 


Optimization, S(16-17), P. Introduction to Opti- 
mization. Boris T. Polyak. Transl. Ser. in Math. & 
Engin. Optimization Software, 1987, xxvi + 438 pp, 
$75. [ISBN: 0-911575-14-6] A comprehensive look 
at optimization of functions of one or more vari- 
ables: unconstrained, both differentiable and non- 
differentiable; constrained, both linear and nonlin- 
ear. Addresses mathematicians and computer scien- 
tists, attempting to give mathematical insight with- 
out rigorous proof, explanations of how things work 
without algorithmic details. The introduction itself 
gives a grand overview. AWR 


Control Theory, P. Lecture Notes in Control and 
Information Sciences-98: Stochastic Adaptive Con- 
trol Results and Simulations. A. Aloneftis. Springer- 
Verlag, 1987, xii + 120 pp, $20.60 (P). [ISBN: 0- 
387-18055-9] Study of adaptive control theory (esti- 
mators used to produce information that, if known, 
would enable proper feedback) for single-input time 
systems. Emphasis on Auto-Regressive Moving Av- 
erage models with exogenous input for methods of 
self-timing parameter control. Extensive computer 
simulations compare theory with practice. RM 


Systems Theory, T(16-17: 1), P, L? Linear Dy- 
namical Systems. John L. Casti. Math. in Sci. & 
Eng., V. 135. Academic Pr, 1987, xv + 351 pp, 
$59.95. [ISBN: 0-12-163451-5] Revision of Dynam- 
ical Systems and Their Applications: Linear The- 
ory, 1977 (TR, December 1978). Foundations of lin- 
ear systems treated mathematically but with moti- 
vational examples. Discussion of system dynamics, 
controllability, observability, realization, chapters on 
theoretical treatment via structure theorems, canon- 
ical forms, geometric-algebraic view, and infinite di- 
mensional systems. RM 


Probability, P. Fuzzy Sets Theory and Applica- 
tions. Ed: André Jones, Arnold Kaufmann, Hans- 
Jirgen Zimmermann. NATO ASI Ser. C, V. 177. D 
Reidel, 1986, xii + 403 pp, $69.50. [ISBN: 90-277- 
2262-5] Proceedings of the NATO Advanced Study 
Institute on fuzzy sets theory and their applications. 
Topics include Radon-Nikodym theorem for fuzzy-set 
valued measures, decisions with usual values, medical 
applications with fuzzy sets. MZ 

Probability, T(16: 1), S*, P**, L. Statistical 
Independence in Probability, Analysis and Number 
Theory. Mark Kac. Carus Math. Mono., No. 
12. MAA, 1959, xiv + 95 pp, (P). [ISBN: 0-88385- 
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025-7] A paperback reprint of Kac’s very popular 
1959 Carus monograph that uses techniques of clas- 
sical analysis to link random numbers, the normal 
law, statistical independence, prime numbers, kinetic 
theory, continued fractions, and ergodic phenomena. 
Kac’s style—deep, elegant, and crisp—reveals clues 
to the profound unity of mathematics. A marvelous 
source for a senior seminar. LAS 


Stochastic Processes, P. Lectures on Stochastic 
Flows and Applications. H. Kunita. Springer-Verlag, 
1986, 118 pp, DM 20 (P). [ISBN: 0-387-12878-6] 
Brownian flows; stochastic differential equations; ap- 
proximations and limit theorems. Photocopied on 
grey paper. TAV 


Stochastic Processes, P. The Malliavin Calcu- 
lus. Denis R. Bell. Mono. & Surv. in Pure & 
Appl. Math., V. 34. Longman Scientific & Tech (US 
Distr: Wiley), 1987, x + 105 pp, $62.95. [{ISBN: 
0-470-20749-3] An introduction to the subject de- 
veloped first by Paul Malliavin in 1976. Ideas are 
drawn from probability theory and functional analy- 
sis, creating a tool for developing probabilistic proofs. 
Gives accounts of approaches of both Stroock and 
Bismut. AWR 


Stochastic Processes, P*. Eztreme Values, Reg- 
ular Variation, and Point Processes. Sidney I. 
Resnick. Appl. Prob., V. 4. Springer-Verlag, 1987, 
xil + 320 pp, $71. [ISBN: 0-387-96481-9] The topic 
of extreme values and records is of great current in- 
terest. This book lays careful ground work for the 
point process approach. Well written, with numer- 
ous exercises and a substantial bibliography. TAV 


Stochastic Processes, P. Seminar on Stochastic 
Processes 1985. Ed: E. Cinlar, K.L. Chung, R.K. 
Getoor. Prog. in Prob. & Stat., V. 12. Birkhauser 
Boston, 1986, 324 pp, $29. [ISBN: 0-8176-3331-6] 
The proceedings of a seminar held in Gainsburg, 
Florida in March 1985. Seventeen papers on a wide 
variety of subjects in modern stochastic processes re- 
search. TAV 


Statistics, P. Lecture Notes in Statistics-42: Top- 
ics in Statistical Information Theory. S. Kullback, 
J.C. Keegel, J.H. Kullback. Springer-Verlag, 1987, 
ix +158 pp, $18.90 (P). [ISBN: 0-387-96512-2] In- 
formation theoretic methods are used to study the 
representation of discrimination information, in par- 
ticular linear discriminant functionals and informa- 
tional properties of sub-sigma algebras of the funda- 
mental probability space are used to study stochastic 
processes and random functions. RM 


Statistics, P. Optimal Paired Comparison De- 
signs for Factorial Ezperiments. E.E.M. van 
Berkum. CWI Tract, V. 31. Math Centrum, 1987, 
ii + 153 pp, DA. 24.20 (P). [ISBN: 90-6196-311-7] 
In paired comparison experiments, observations are 
made by presenting pairs of objects to one or more 
judges. In a round robin, all pairs are presented to 
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each of the judges. If there are n judges and t objects, 
then (7) comparisons must be made. The aim of 
this book is to construct designs that are more effi- 
cient than the round-robin design. LC 


Programming, T(14-15: 1), S, P, L. Soft- 
ware Engineering with Ada, Second Edition. Grady 
Booch. Ser. in Ada and Software Engin. Ben- 
jamin/Cummings, 1987, xx + 580 pp, $26.95 (P). 
[ISBN: 0-8053-0604-8] A textbook on Ada as a tool 
enabling and promoting sound software engineering 
principles (careful design, structured coding, soft- 
ware life-cycle management), by a teacher of De- 
partment of Defense Ada short courses. Stress 
on good examples, object-oriented development 
method. Top-down organization allows brief intro- 
duction or intensive study. New edition: new design 
problems; more complete programs, exercises; chap- 
ter summaries. RB 


Programming, P. Advanced Turbo Pascal. Her- 
bert Schildt. Osborne/McGraw-Hill, 1987, xii + 331 
pp, $21.95 (P). [ISBN: 0-07-881283-6] An advanced 
text on programming that assumes that the reader 
is quite familiar with the Pascal language in gen- 
eral and the Turbo Pascal compiler in particular. 
The topics covered include advanced data structures 
(stacks, queues, linked lists), statistical processes, 
parsing, and use of the Turbo Pascal toolbox. (1986 
McGraw-Hill edition, TR, January 1987.) MS 


Languages, T(15: 1). Program Translation Fun- 
damentals: Methods and Issues. Peter Calingaert. 
Software Eng. Ser., V. 22. Computer Science Pr, 
1988, xvii + 366 pp, $36.95. [ISBN: 0-88175-096-4] 
An introductory treatment of programs that trans- 
late other programs. This includes such software as 
compilers, assemblers, macro processors, and inter- 
preters. It covers all the major phases of translation, 
including syntactic analysis, parsing, code genera- 
tion, and optimization. It would be appropriate for a 
one-semester course at the advanced undergraduate 
or graduate level. MS 


Algorithms, T(16-18: 1), P, L. Probabilistic 
Analysis of Algorithms: On Computing Methodolo- 
gies for Computer Algorithms Performance Evalu- 
ation. Micha Hofri. Texts & Mono. in Com- 
put. Sci. Springer-Verlag, 1987, xxv + 240 pp, 
$25. [ISBN: 0-387-96578-5] Mathematical treatment 
of algorithm complexity analysis assuming probabil- 
ity distributions on the input data. Emphasis on an- 
alytic, probabilistic aspects rather than development 
of algorithms. Asymptotic analysis, combinatorics 
used as tools; considers algorithms on permutations 
(e.g., sorting), communication networks, bin packing 
heuristics. RM 

Computer Systems, S(14-16), P, L. Real-Time 
Software Design: A Guide for Microprocessor Sys- 
tems. Philip Heller. Birkhauser Boston, 1986, xii 
+ 116 pp, $19.95 (P). [ISBN: 0-8176-3201-8] A brief 
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presentation of design and implementation concepts 
for small real-time systems, for the novice with pro- 
gramming experience. Introduction to Intel 8086 as- 
sembly language, graphical design languages (e.g., 
structure charts, state diagrams), implementation of 
data flow diagrams using circular queues and multi- 
ple process model, detailed examples (control of in- 
telligent keyboard), optimization maxims. RB 


Computer Systems, P. Lecture Notes in Com- 
puter Sctence-254 & 255: Petri Nets: Central Models 
and Their Properties. Ed: W. Brauer, W. Reisig, G. 
Rozenberg. Springer-Verlag, 1987, (P). Part I, x + 
480 pp, $34.60 [ISBN: 0-387-17905-4]; Part II, x + 
516 pp, $37.70. [ISBN: 0-387-17906-2] Petri nets and 
related elementary net systems are used to model 
distributed systems and processes. These volumes 
contain papers based on lectures given at a Septem- 
ber 1986 advanced course in Bad Honnef, Germany, 
including a prologue and an epilogue by Carl Adam 
Petri. Includes basic theory of nets, diverse appli- 
cations, and relations to other models of concurrent 
processes. LAS 


Computer Systems, P*. Managing Program- 
ming People: A Personal View. Philip W. Metzger. 
Prentice-Hall, 1987, xxv + 157 pp, $20.33. [ISBN: 0- 
13-551094-5] A memorable essay on the management 
of people in the context of a programming project’s 
“life cycle;” companion to the author’s previous book 
Managing a Programming Project. Humorously writ- 
ten and easily browsed, this brief book offers experi- 
enced practical advice and food for thought to man- 
agers, particularly to data processing and software 
engineering professionals. RB 


Computer Systems, P. Computer Capacity Plan- 
ning: Theory and Practice. Shui F. Lam, K. Hung 
Chan. Academic Pr, 1987, xix + 215 pp, $24.95. 
[ISBN: 0-12-434430-5] The objective of computer 
capacity planning is to match computer resources 
to computer workload in a cost-effective manner. 
This monograph introduces fundamental concepts 
and evaluates various approaches and techniques in 
the field, then reports on results of the authors’ re- 
search study which surveyed computer capacity plan- 
ning experience in various settings. RB 


Computer Graphics, P. Computer Graphics 1987: 
Proceedings of CG International ‘87. Ed: Tosiyasu 
L. Kunii. Springer-Verlag, 1987, xi + 490 pp, 
$113.30. (ISBN: 0-387-70022-6] Proceedings of the 
fifth international conference held in Japan. Thirty 
papers give a survey of the state-of-the-art in graph- 
ics, including fast algorithms, rendering techniques, 
hardware, computational geometry, image genera- 
tion, interfaces, hierarchical modeling. RM 


Theory of Computation, P. Lecture Notes in 
Computer Science-267: Automata, Languages and 
Programming. Ed: Thomas Ottmann. Springer- 
Verlag, 1987, x + 565 pp, $41.10 (P). [ISBN: 0-387- 
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18088-5] A collection of 46 papers delivered at the 
Fourteenth International Conference on Automata, 
Languages, and Programming held at the Univer- 
sity of Karlsruhe in July 1987. The papers cover 
the following theoretical topics: logic, functional pro- 
gramming, semantics, concurrency, automata theory, 
complexity theory, and Petri nets. MS 


Artificial Intelligence, P. Pattern Recognition 
Theory and Applications. Ed: Pierre A. Devijver, 
Josef Kittler. NATO ASI Ser. F, V. 30. Springer- 
Verlag, 1987, xi + 543 pp, $100. [ISBN: 0-387-17700- 
0] Tutorial and advanced papers from NATO Ad- 
vanced Study Institute in Belgium, June 1986. Ma- 
jor themes are clustering and relaxation techniques, 
Markovian and connectionist models, graph theory 
and geometry, structural methods, hybrid methods 
and fuzzy sets, knowledge-based techniques, machine 
vision and image processing. RK 


Computer Science, T(18), P. Stochastic Analysis 
of Computer Storage. O.I. Aven, E.G. Coffman, Jr., 
Y.A. Kogan. Math. & Its Applic. Kluwer Aca- 
demic, 1987, viii + 254 pp, $89. (ISBN: 90-277- 
2515-2] A monograph on mathematical analysis of 
standard problems in computer memory allocation, 
including fragmentation, page replacement policies, 
etc., by an American and two Russians. The authors 
adopt a stochastic statistical model (“sequences of 
‘arrivals’ and ‘departures’ ”) for analysis, shunning 
combinatorial models. Buffer storage; primary com- 
puter storage; paged two-level memories; secondary 
storage. No exercises. RB 


Computer Science, P. Lecture Notes in Com- 
puter Science-248: Networking in Open Systems. Ed: 
Gunter Muller, Robert P. Blanc. Springer-Verlag, 
1987, vi + 441 pp, $30.60 (P). [ISBN: 0-387-17707- 
8] Proceedings of a one-week seminar, part of IBM 
Europe Institute at Oberlech, Austria, August 1986. 
26 papers on issues concerning open networking 
standards, some research topics: broad standards 
issues; emerging end-user applications; program- 
ming tools for creating such applications; operating 
system-communication system interaction; specifica- 
tion, testing. Open standards vs. proprietary sys- 
tems (such as IBM’s). RB 


Computer Science, P. Lecture Notes in Computer 
Science-246: Graph- Theoretic Concepts in Computer 
Science. Ed: Gottfried Tinhofer, Gunther Schmidt. 
Springer-Verlag, 1987, vii + 306 pp, $25 (P). [ISBN: 
0-387-17218-1] Proceedings of the twelfth annual in- 
ternational workshop on the topic, held in a monas- 
tary (Bernreid) near Munich, June 1986. Twenty- 
two papers on specific algorithms, graph gram- 
mars, graph manipulation, nets, complexity issues, 
algorithmic and network considerations, outerpla- 
nar graphs, graph isomorphism, parallelism and dis- 
tributed systems, graphs and geometry, randomness 
consideration, applications in chemistry. RB 
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Applications (Engineering), P. Lecture Notes 
in Engineering-14: The Boundary Integral Equation 
Method in Azisymmetric Stress Analysis Problems. 
A.A. Bakr. Springer-Verlag, 1986, xi + 213 pp, 
$18 (P). (ISBN: 0-387-16030-2] On the boundary el- 
ement method, an efficient and accurate numerical 
technique and its application to axisymmetric en- 
gineering problems. Compares solutions with those 
obtained analytically or by finite element methods. 
Ties with problems in elasticity, thermoelasticity, 
centrifugal, and fracture mechanics. Over 100 ref- 
erences to recent related work. Many practical ex- 
amples with numerical data and graphs. Authorita- 
tively written. JK 


Applications (Management), P, L. Information 
Systems Education: Recommendations and Imple- 
mentation. Ed: Richard A. Buckingham, et al. 
British Comput. Soc. Mono. in Informatice. Cam- 
bridge U Pr, 1987, x + 256 pp, $39.50 (P). [ISBN: 0- 
521-31234-5] A collection of papers presenting a ma- 
jor British revision of the decade-old IFIP curriculum 
recommendations for majors in information systems 
(or what in the United States is often called MIS: 
management information science). Half the papers 
discuss curriculum and course outlines; the other half 
describe teaching methods, social factors, and vari- 
ous concrete case studies. LAS 


Applications (Physics), S*(13-15), L. Space, 
Time and Gravitation: An Outline of the General 
Relativity Theory. Arthur Eddington. Cambridge U 
Pr, 1987, xii + 218 pp, $14.95 (P). [ISBN: 0-521- 
33709-7|] Reprint of a 1920 classic, one of the most 
popular science books ever written. A masterpiece 
of vigorous writing, Eddington’s exposition reveals 
the revolutionary insights of Einstein’s theory of rel- 
ativity without using any technical mathematics or 
physics. That’s one factor in its abiding popular- 
ity. LAS 
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CALCULUS WITH ANALYTIC 
GEOMETRY 

Alternate Edition 

ROBERT ELLIS and DENNY GULICK 
Carefully revised ordering of topics of this 
popular text. 


CALCULUS 

Fourth Edition 

STANLEY I. GROSSMAN 

A thoroughly revised and streamlined text with many 
modern features. 


CALCULUS FOR THE 
MANAGEMENT, LIFE, AND SOCIAL 
SCIENCES 

Second Edition 

BERNARD KOLMAN and CHARLES 
DENLINGER 

For one-term, non-major calculus course. Careful 
blend of theory and applications. 


APPLIED CALCULUS 

BERNARD KOLMAN and CHARLES 
DENLINGER 

Comprehensive coverage in an expanded version for a 
two-term course. 


A COURSE IN LINEAR ALGEBRA 
DAVID DAMIANO and JOHN LITTLE 
A clear, readable introduction with an emphasis on 
theory of vector spaces. 


LINEAR ALGEBRA AND ITS 
APPLICATIONS 

Third Edition 

GILBERT STRANG 

A best-selling text in an up-to-date revision with a 
modern, computational treatment. 


AN INTRODUCTION TO LINEAR 
ALGEBRA WITH APPLICATIONS 
Second Edition 

STEVEN ROMAN 


Presents a careful balance between abstract, 
computational, and applied ideas. 


PREREQUISITE ALGEBRA 
CHARLES P. McKEAGUE 


Prepares students for precalculus, finite math, 
statistics, and brief calculus courses. 


ALGEBRA WITH TRIGONOMETRY 
FOR COLLEGE STUDENTS 

Second Edition 

CHARLES P. McCKEAGUE 


For students who need an introduction to 
trigonometry for future college work. 


TRIGONOMETRY 
Second Edition 
CHARLES P.- McKEAGUE 


A right-triangle approach, with strong emphasis on 
trigonometric identities. 


COLLEGE ALGEBRA 

and 

COLLEGE ALGEBRA AND 
TRIGONOMETRY 

ADELBERT HACKERT and GENE 
SELLERS 


Thorough explanations, carefully constructed problem 
sets, and many pedagogical features. 


MASTERING INTERMEDIATE 
ALGEBRA 

YOSHIKO YAMATO and MARY JANE 
CORDON 


Text/workbook with numerous step-by step examples. 


HARCOURT BRACE JOVANOVICH, INC. 
College Sales Office 
7555 Caldwell Avenue 


Chicago, IL 60648 


COLLEGE@RPUBLISHING 


SAUNDERS 


YOU CAN COUNT ON 
COLLEGE 
PUBLISHING 

FOR SUPERIOR 
MATH TEXTBOOKS! 


CALCULUS, 2/E 


Dennis D. Berkey, Boston University 


A new edition of this popular text which combines lucid 
development of concepts with a strong emphasis on prob- 
lem solving, using Berkey’s unique strategy/solution for- 
mat. The text features both formal and informal treatment 
of limit, and separate chapters on the antiderivative and 
definite integral. The Second Edition of Calculus includes 
many new examples and problems containing applications 
from the sciences and engineering. It continues to provide 
everything you'll need for the traditional three-semester 
course for students of mathematics, science, and engineer- 
ing science. 


ISBN 0-03-008899-2 1988 1,141 pp. (approx.) 


DIFFERENTIAL EQUATIONS: 
A FIRST COURSE, 2/E 


Martin M. Guterman and Zbigniew H. Nitecki, both of Tufts 
University 


An introduction to differential equations for students of 
mathematics and engineering. The Second Edition features 
new material on qualitative theory, numerical approxima- 
tion, and power series in computers. It also examines 
existence and uniqueness theorem and stability. 


ISBN 0-03-009617-0 1988 752 pp. (approx.) 


STATISTICS AND 
PROBABILITY IN 
MODERN LIFE, 4/E 


Joseph Newmark, The College of Staten 
Island, CUNY 


Provides an introduction to the basic 
concepts of statistics and probability for 
students in all academic areas. Applica- 
tions are drawn from areas as diverse 
as sociology, business, ecology, and 
medicine. 


ISBN 0-03-008367-2 1988 704 pp. 


(approx.) 


TECHNICAL 
MATHEMATICS, 4/E 


Jacqueline Austin and Margarita Isern, 
both of Miami-Dade Community College 


Intended for students with varying 
mathematical backgrounds, this text 
helps develop problem-solving ability 
through the use of models and exam- 
ples featuring a step-by-step approach. 


ISBN 0-03-013233-9 1988 736 pp. 
(approx.) 


APPLIED CALCULUS 


Dennis Berkey, Boston University 


Designed for a two-semester course for 
students of business, biology, and the 
social sciences. 


ISBN 0-03-070733-7 1987 662 pp. 


INTERMEDIATE 
ALGEBRA, 2/E 


Jack Barker, James Rogers, and James Van 
Dyke, all of Portland Community College 


Intermediate algebra with a focus on 
manipulative skills and their applica- 
tions. Taking an adult approach, the 
authors present concise explanations 
written with the student in mind. 


ISBN 0-03-007068-6 1987 868pp. 


ARITHMETIC, 4/E 


Jack Barker, James Rogers, and James Van 
Dyke, all of Portland Community College 


A thorough presentation of arithmetic 
with an emphasis on manipulative skills 
and their applications. Students are 
provided insight into the problem-solv- 
ing procedures and presented with an 
opportunity to discover the arithmetic 
concept. 


ISBN 0-03-007033-3 1987 648 pp. 


BASIC ALGEBRA, 2/E 


Jack Barker, James Rogers, and James Van 
Dyke, all of Portland Community College 


Awork text on elementary algebra witha 
special approach that is flexible enough 
to be adaptable for both the classroom 
lecture format and individualized in- 
struction. 


ISBN 0-03-007052X 1987 750 pp. 


SAUNDERS COLLEGE PUBLISHING 
111 Fifth Avenue, New York, N.Y. 10003 


$37-8AMM-2 


STATISTICS SOFTWARE 
for STUDENTS 


TRUE EPISTAT 


Do you spend more time teaching software than teaching statistics? 
Do your students have to learn a complicated computer language before they can 
even begin to learn the value of statistics? 


For students of biology, botany, dentistry, education, epidemiology, genetics, 
marketing, medicine, nursing, psychology, quality control, sociology, veterinary sci- 
ence, zoology, and many others ... 


The best software when learning statistics is TRUE EPISTAT. 


TRUE EPISTAT is accurate, thorough, well-documented, well-organized, 
but most of all, easy-to-use. Just look at these features! 


Menu-driven 
Over 25 Graphs 
Over 115 Procedures 
Clear, concise Manual 
Example Data for every Test 
CGA, EGA and Hercules Graphics 
8087 /80287 math coprocessor Accuracy 
References/formulas for every Procedure 
Easy to Transfer Data to/from other Software 


SPECIAL SITE LICENSING AGREEMENTS 
for COLLEGES and UNIVERSITIES 


Annual site licenses for computer labs from 5 to 100 computers are available 
for as little as $25 per computer! 


TRUE EPISTAT runs on all IBM‘™ PC, XT, AT and System II microcom- 


puters, and most compatibles. Write or call today for more information: 


EPISTAT SERVICES 
2011 Cap Rock Circle 
Richardson, TX 75080 
(214) 680-1376 


Move through the maze to 


NEW FOR 1988 


ARITHMETIC 
Martha M. Wood, DeKalb Community College 
Peggy Capell, Clayton Junior College 


ALGEBRA FOR COLLEGE STUDENTS 
James W. Hall, Parkland College 


INTERMEDIATE ALGEBRA, FOURTH 
EDITION 
Alfonse Gobran, Los Angeles Harbor College 


INTRODUCTION TO DISCRETE 
MATHEMATICS 


Steven C. Althoen, University of Michigan - Flint 
Robert J. Bumcrot, Hofstra University 


TRIGONOMETRY 
Jerome E. Kaufmann, Western Illinois University 


PRECALCULUS 
Jerome E. Kaufmann, Western Illinois University 


COLLEGE MATHEMATICS, SECOND 
EDITION 
S.T. Tan, Stonehill College 


MATHEMATICAL ANALYSIS FOR 
BUSINESS AND ECONOMICS, SECOND 
EDITION 

Charles W. Schelin, University of Wisconsin - 

La Crosse 

David Bange, University of Wisconsin - La Crosse 


FOUNDATIONS OF HIGHER 
MATHEMATICS 

Peter Fletcher/ C. Wayne Patty 

Both from Virginia Polytechnic Institute and 
State University 


The Prindle, Weber & Schmidt 
Series in Mathematics 


ELEMENTS OF MODERN ALGEBRA, 
SECOND EDITION 

Jimmie Gilbert, University of South 
Carolina - Spartanburg 

Linda Gilbert, University of South Carolina - 
Spartanburg 


INTRODUCTION TO REAL ANALYSIS 
James Kirkwood, Sweet Briar College 


NEW CALCULUS 
TEXTS FOR 1988 


CALCULUS WITH ANALYTIC 
GEOMETRY, SECOND ALTERNATE 
EDITION 

Earl W. Swokowski, Marquette University 


CALCULUS WITH ANALYTIC 
GEOMETRY, FOURTH EDITION 
Earl W. Swokowski, Marquette University 


CALCULUS WITH ANALYTIC 
GEOMETRY, SECOND EDITION 
Dennis G. Zill, Loyola Marymount University 


The Prindle, Weber & Schmidt Series in 
Mathematics 


F< PWS-KENT Publishing Company 
20 Park Plaza 

, Boston, MA 02116 

ad A Division of Wadsworth, Inc. 


Partners in Education 
1-800-343-2204 (in Mass. 617-542-3377) 


MACSYMA 


automates symbolic mathematics. 
And yields enormous improvements in 
Speed, accuracy and modeling power. 


If you work with quantitative models 
in scientific or engineering dis- 
ciplines, MACSYMA can increase 
your modeling power. MACSYMA 
combines symbolic and numeric 
computation. And enables you to 
accurately manipulate symbolic 
expressions in a fraction of the time 
required manually. 


Wide range of capabilities 

MACSYMA offers the widest 
range of capabilities for symbolic 
computations in applied mathe- 
matics of any commercially available 
program. For example: 


Algebra: MACSYMA can Manipu- 
late large expressions, expand, sim- 
plify and factor expressions, handle 
matrices and arrays, and solve 
systems of equations. 

Calculus: MACSYMA can differen- 
tiate, perform definite and indefinite ‘ie 
integration, take limits, expand func- 


You can solve problems... 


ions | : Symbolically... 
tions in Taylor or Laurent series, solve y y computer systems 
differential equations, and compute 

(C1) DEPENDS(Y,T)$ . . 
Laplace transforms. (C2) DIFF (Y,T) + ¥°2 + (2T + A)e¥ 67-274; Current systems include: 


Numerical analysis: You can perform Fiymmmrhs 
numerical analysis using MACSYMA® dT 
language, compute with arbitrary 


¢ Symbolics 3600™Series 
e VAX & MicroVAX || 
e SUN-2 & SUN-3 


ry S ee ee 1 
(C3) SOLN:ODE(D2,Y,T); 
(D3) vy. %CT%ET 1. 


precision arithmetic, use a library of msc one? 4 e Apollo 
Generate Fonte outines and (C4) SOLVE(SUBST([Y — 1, T- Tatiism  ° Masscomp 

enerate coae. (D4) [%C = 0.5518192] ; . . 
Graphics and interfaces: (C5) SPECIFIC SOLN:SUBST(D4,SOLN); Other versions will be following soon. 

e oy T 7 . . . 

MACSYMA can generate report: (D5) Y ~ - 0.5518192T v 77! For an Moey aon kit about all the 
quality graphics in 2D or 3D, with 0-5518192%E 1 ways MACSYMA can work for you, 
perspective, hidden line removal, and just call 


captions. MACSYMA also interfaces 


with the mathematical text processors JR Numerically . 1-800-MACSYMA. 


‘TeX’ and ttroff. In Mass., Alaska or Hawaii only, call 


oo FORT (617) 621-7770. 
Broad base of applications seen g S318 92-7. EXPT) 1 - WB Or please write to us at 
Throughout the world thousands 1 /(0.5518192+*EXP(T) — 1) 
of scientists, engineers and mathe- Dept Manned Mathematics Group 
maticians are using MACSYMA in electromagnetic field problems, Symbolics, Inc. 
such diverse applications as aero- plasma physics, atomic scattering Eleven Cambridge Center 
nautical design, structural engineer- cross sections, control theory, Cambridge, MA 02142 
ing, fluid mechanics, acoustics, CAD, maximum likelihood estimation, 
electronic and VLSI circuit design, genetic studies, and moré. 


MACSYMA 


mm 7 ™ 
The most comprehensive software for Sy b Ol / CS 


mathematical computing. 


MACSYMA®, SyMbDOKCS , Symbolics 3600 are trademarks of Symbolics, Inc VAX and MicrovAX II are trademarks of the Digital Equipment Corporation SUN-2 and SUN-3 are trademarks of 
SUN Microsystems, Inc Apotio® is a trademark of Apollo Computer, Inc Masscomp is a trademark of the Massachusetis Computer Corporation TeX is a trademark of the American Mathematcal Society 
© Copyright 1987 Symbolics, Inc 
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On Knots 
Louis H. Kauffman 


On Knots is a journey through the theory of knots, starting from the 
simplest combinatorial ideas—ideas arising from the representation 
of weaving patterns. From this beginning, topological invariants are 
constructed directly: first linking numbers, then the Conway poly- 
nomial and skein theory. This paves the way for later discussion of 
the recently discovered Jones and generalized polynomials. 

The central chapter is a miscellany of topics and recreations. 

Here the reader will find the quaternions and the belt trick, a devilish 
rope trick, Alhambra mosaics, Fibonacci trees, the topology of DNA, 
and the author’s geometric interpretation of the generalized Jones 
Polynomial. 

Annals of Mathematics Studies 

Paper: $18.95 ISBN 0-691-08435-1 

Cloth: $50.00 ISBN 0-691-08434-3 


Gauss Sums, Kloosterman Sums, and 


Monodromy Groups 
Nicholas M. Katz 


The study of exponential sums over finite fields, begun by Gauss 
nearly two centuries ago, has been completely transformed in recent 
years by advances in algebraic geometry, culminating in Deligne’s 
work on the Weil Conjectures. It now appears as a very attractive 
mixture of algebraic geometry, representation theory, and the sheaf- 
theoretic incarnations of such standard constructions of classical 
analysis as convolution and Fourier transform. 

The book is simultaneously an account of some of these ideas, 
techniques, and results, and an account of their application to con- 
crete equidistrioution questions concerning Kloosterman sums and 
Gauss sums. 

Annals of Mathematics Studies 
Paper: $20.00 ISBN 0-691-08433-5 
Cloth: $52.50 ISBN 0-691-08432-7 


Differential Geometry of Compiex 


Vector Bundles 
Shoshichi Kobayashi 


In recent years holomorphic vector bundles have become objects 
of interest not only to algebraic and differential geometers and com- 
plex analysts but also to low dimensional topologists and mathe- 
matical physicists working on gauge theory. This book grew 
out of the author’s lectures and seminars in Berkeley and Japan. 
Publications of the Mathematical Society of Japan 
Cloth: $57.50 ISBN 0-691-08467-X 


Note: These titles not available in Japan. 


AT YOUR BOOKSTORE OR ; ; 
Princeton University Press 


41 WILLIAM ST. * PRINCETON, NJ 08540 e (609) 896-1344 


Look to McGraw-Hill in 1988! 


CALCULUS AND ANALYTIC GEOMETRY, 4/e 

CALCULUS AND ANALYTIC GEOMETRY, 4/e BRIEF EDITION 
Sherman K. Stein, University of California at Davis 

1987, 0-07-061159-9 (Text); 0-0 7-061162-9 (Brief Edition); 

Student Solutions Manual 0-0 7-061161-0; 

STEINTEST— A Computerized Test Bank 


Capturing the essence of calculus, this text blends both intuitive and 
formal approaches with high-interest applications, margin notes and 


extensive chapter summaries. 


ELEMENTARY LINEAR ALGEBRA 
William L. Perry, Jexas A&M University 


1988, 0-07-0494351-2 


This brand-new text features early treatment of numerical methods for 
the solution of linear systems. Numerical implementation of Gaussian 
Elimination and both LU decomposition and iterative methods are 
treated in Chapter One. Also, complex numbers are used from the 
beginning of the text, so the problem of learning the structure of vector 
Spaces Is eased by considering complex vector spaces as soon as the 


topic is broached. 


PLANE TRIGONOMETRY, 6/e 


E. Richard Heineman, /exas lech University 


1988, 0-07-027945-7 


Still the best approach to helping students prove identities through its 
listing of six different identities and its suggestions as to how to 

begin in each new case. These suggestions still separate this classic text 
from all others. This edition features 1,000 new problems and retains its 
emphasis on the analytical aspects of the subject. 


Also Avai!$|-L—_ __— | _ i iW _asseaasesas=S=s 


ALGEBRA: AN ELEMENTARY COURSE, 2/e 
Raymond A. Barnett, Merritt College and 
Thomas J. Kearns, Northern Kentucky 
University 

1987, 0-07-004924-2 (text); 

Student Solutions Manual 0-07-003770-1: 
Instructor's Answer Manual 0-07-003925-9; 
BARNETTEST— A Printed Test Battery 
Features the hallmark “spiralling learning 
technique’ that takes students from 

current understanding to mastery of more 
complex topics. 


FOURIER SERIES AND BOUNDARY 


VALUE PROBLEMS, 4/e 

Ruel V. Churchill, deceased, formerly 

of University of Michigan and 

James Ward Brown, University of Michigan 


1987, 0-07-010881-1 
Mt 


ALGEBRA: AN INTERMEDIATE COURSE, 2/e 
Raymond A. Barnett, Merfitt College and 
Thomas J. Kearns, Northern Kentucky 
University 

1987, 0-07-004749-3; 

Student Solutions Manual O-07-004884-8: 
Instructor’s Answer Manual 0-07-003924-0: 
BARNETTEST— A Printed Test Battery 

This follow-up text has the same learning 

feature as ALGEBRA: AN ELEMENTARY COURSE 
plus expanded supplements and 

streamlined presentation. 


REAL AND COMPLEX ANALYSIS, 3/e 
Walter Rudin, University of Wisconsin 
1987, 0-07-054234-1 


The Mathematical Symbol for Quality 


7a McGraw-Hill Book Company « College Division ¢ 
a i 1221 Avenue of the Americas ¢ New York, NY 10020 


GREAT SCIENTISTS OF OLD AS HERETICS IN “THE 
SCIENTIFIC METHOD” 
C. Truesdell 


“His defense of a mathematical description of the world, and 
especially of mathematical mechanics, is much to be applauded to- 
day, when even mathematicians too frequently lose touch with the 
greatness of their craft in pursuit of baroque refinement.” —James 
Serrin, Professor, School of Mathematics, University of Minnesota. 


By examples drawn from the writings by differences, then applying mathematical 
Leonardo da Vinci, Galileo, the Bernoullis, process to the result.... By elementary ap- 
Euler, Einstein, Dirac, and others, Trues- plications of ideas from Galileo’s Two New 
dell shows that some great advances in Sciences the author destroys the common 
mathematical physics were made with little claim that classical mechanics are always 
or no basis in experiment... . He lays deterministic and always reversible. He pro- 
weight upon the value of hypothesis and vides a mathematical model to show that 
mathematical deduction, and concludes determinism interrupted by acts of free 
that some of the great theories were ob- choice is compatible with predestination in 
tained by accepting the common features of Galileo’s mechanics. 

earlier, special theories, then discarding the 96 pages, 37 illus. $19.95 


To order call (804) 924-3469 or write 


UNIVERSITY PRESS OF VIRGINIA 
Box 3608 University Station, Charlottesville, VA 22903 


Let f(z)’s animated 
screens show you what 
a textbook cannot. 


The Complete 
Complex Variables 
Graphing Package 


Lascaux Graphics 
3220 Steuben Ave. 
s Bronx, NY 10467 


(212) 654-7429 


Explain this apparent convergence 


If you have a PC compatible, write for a FREE demo disc! 


JOIN MAA TODAY! 


Send this application to: MAA Membership Department 
1529 18th Street, N.W. 
Washington, D.C. 20036 
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Have you been a member of the MAA before? (] Yes (] No 


Your membership may begin on July 1 or January 1. You may elect to receive any of the listed combinations of 
the three MAA journals. All members receive Focus, the MAA newsletter. 
Please circle the appropriate box corresponding to your initial membership period and the selection of journals 
you wish to receive. (Rates guaranteed for the indicated periods only.) Subscription prices are included with 
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THE AMERICAN MATHEMATICAL MONTHLY (M) 
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8 The history of the mathematical 
: sciences in all parts of the world 
from all historical periods 


Historia Mathematica 


EDITOR: Eberhard Knobloch 


Technische Universitat Berlin, West Germany 


MANAGING 
EDITOR: Helena Pycior 


University of Wisconsin, Milwaukee 


Historia Mathematica is concerned with the history of all aspects of the 
mathematical sciences in all parts of the world and from all historical periods. 
The journal includes occasional biographies of mathematicians and historians, 
articles on organizations and institutions, essays on historiography, and the 
interactions between all facets of mathematical activity and other aspects of 
culture and society. 


In addition to original research, Historia Mathematica includes the following 
departments: Notes, Sources, Meetings-Announcements, Meetings-Reports, 
Projects, Reviews, Abstracts, and Letters to the Editor. 


Published under the Auspices of the International Commission on History of 
Mathematics of the Division of the History of Science of the International Union 
of the History and Philosophy of Science 


In the U.S.A. All Other 

4 and Canada Countries 
Volume 15 (1988), 4 issues Institutional rate: $70.00 $84.00 
ISSN 0315-0860 Personal rate: 35.00 45.00 


For more information or a free sample copy, please write or call: 


ACADEMIC PRESS, INC. 


Journal Promotion Department 
1250 Sixth Avenue, San Diego, CA 92101 U.S.A. (619) 230-1840 


$8002 


Toward | 
62 Lean 
oc and Lively 
IRS Calculus 


Report of the Conference/Workshop to Develop Curriculum 
and Teaching Methods for Calculus at the College Level, 
Ronald G. Douglas, Editor. 

MAA Notes #6 

249 pp., 1987, Paperbound, ISBN-0-88385-056-7 

Catalog Number - NTE-07 


Price: $12.50 


Should calculus be taught differently? Can it? Common wisdom says 
‘*no’’—which topics are taught, and when, are dictated by the logic of the 
subject and by client department. The surprising answer from a four-day 
Sloan Foundation-sponsored conference on calculus instruction, chaired by 
Ronald Douglas, is that significant change is possible, desirable, and neces- 
sary. Meeting at Tulane University in New Orleans in January, 1986, a diverse 
and sometimes contentious group of twenty-five faculty, university and foun- 
dation administrators, and scientists from client departments, put aside their 
differences to call for a leaner, livelier, more contemporary course, more 
sharply focused on calculus’s central ideas and on its role as the language 
of science. 

This volume contains the results of that conference and the papers pre- 
sented to the conferees. These are certain to be the point of departure and 
basis for efforts to strengthen and reshape calculus in the next decade. 


Order from 
The Mathematical Association 
of America 
1529 Eighteenth St., NW 
Washington, DC 20036 


MATHEMATICS: 
Queen and Servant 


of Science 


by E. T. Bell 


“Bell is a lively, stimulating writer, inoffensively crotchety 
and opinionated, with a good sense of historical circum- 
stance, a fine impatience with humbug, a sound grasp 
ot the entire mathematical scene, and a gift for clear and 
orderly explanation.” 

James R. Newman, The World of Mathematics 


“Bell's writing stvle makes vou feel as If you are a for- 
tunate student spending time with a charming and gifted 
teacher. Not a history of mathematics, but a wide rang- 
ing exposition of ideas in the subject, the book ts a classic 
and should be required reading for every teacher of 
mathematics.” 

Harold Jacobs, Grant High School, Van Nuys, California 


“This book deserves a place in today’s market. It is a 
much more popular work than most histories of the sub- 
Ject, and that is exactly what makes it accessible to stu- 
dents as well as to non-mathematicians. It is rewarding 
reading for... teachers and students of mathematics 
at all levels.” 

Morris Kline, Courant Institute 


ATHEMATICS 
Queen SServant 
or Science 


ERIC TEMPLE SELL 


xv + 437 pp. Paperbound 
Republished 1987. 
ISBN-0-88385-446-3 

List: $18.50 

MAA Member: $14.00 


This is an absorbing account of the development 
in pure and applied mathematics from the geom- 
etry of Euclid more than two-thousand years ago 
to some of the more recent developments in math- 
ematical physics. Published 30 years ago, MATHE- 
MATICS: QUEEN AND SERVANT OF SCIENCE 
remains a valuable tool for teachers. students, and 
all those enthusiastic about mathematics. The 
book includes 20 chapters on selected areas in 
pure and applied mathematics, including discus- 
sions on topics such as geometry. algebra, logic. 
probability. infinity, transformations, rings, groups, 
topology, and continuity. Also covered are sub- 
jects such as: Mersenne primes, quantum mechan- 
ics, and three-valued logic. The book is not a 
history of mathematics, but rather an enticing 


sampling of selected topics in mathematics and 
science. 


The book contains fascinating information on the 
personalities and philosophies of the great math- 
ematicians, plus perceptive resumes of their spe- 
cific contributions to the moving force of this great 
science. 


Order from: 

The Mathematical Association of America 
1529 Eighteenth Street, N.W. 

Washington, D.C. 20036 


Riddles 
of the 


Sphinx 


mathematical 


puzzle tales. 


by Martin Gardner 

Volume 32 in the New Mathematical Library 
184 pp., Paper. ISBN-0-88285-632-8 

List: $14.50 © MAA Member: $12.50 


Martin Gardner has charmed readers for over fifty 
years with his delightful books and articles on sci- 
ence. He is best known for the popular column, 
“Mathematical Games," which appeared in Scien- 
tific American for twenty-five years. Generations 
of scientists and mathematicians have been 
inspired by his writing and the MAA Is proud to 
include his name in its list of authors. 


This book was drawn from Gardner's column 
in Isaac Asimov's Science Fiction Magazine. The 
riddles presented here incorporate the responses 
of his initial readers, along with additions sug- 
sested by the editors of the New Mathematical 
Library. Each chapter (riddle) poses a problem 
answered in the First Answers section. The solu- 
tion in turn raises another problem that is solved 
in the Second Answers section. This may suggest 
a third question and in several instances there is 
a fourth. Gardner draws us from questions to 
answers always presenting us with new riddles— 
some as yet unanswered. There are 125 different 
pieces altogether. 


NEW MATHEMATICA: 


LIBRARY SD 
THE MATHE 
THEMATICAL ASSOCIATION OF AMERICA 


Solving these riddles is not simply a matter of 
logic and calculation, although these play a role. 
Luck and inspiration are factors as well, so begin- 
ners and experts alike may profitably exercise their 
wits on Gardner’s problems, whose subjects range 
from geometry to word play to questions relating 
to physics and geology. 


We guarantee that you will solve some of the 
riddles, be stumped by others, and be amused by 
almost all of the stories and settings that Gardner 
has devised to raise these questions. 


Order from: 

The Mathematical Association of America 
1529 Eighteenth Street, N.W. 

Washington, D.C. 20036 


THE MATHEMATICAL ASSOCIATION OF AMERICA 


1225 Connecticut Avenue, N.W. 
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Angle Trisection, the Heptagon, and the Triskaidecagon 


ANDREW M. GLEASON!, Harvard University 


Dr. Gleason graduated from Yale in 1942 and then served four years 
in the Navy. After the war he went to Harvard as a Junior Fellow in the 
Society of Fellows. Except for an interlude in the Navy from 1950-52, he 
has been at Harvard ever since. He now holds the Hollis Professorship of 
Mathematicks and Natural Philosophy, a chair that was endowed in 1727. 
Although he has no doctor’s degree, he says that George Mackey was the 
equivalent of his dissertation supervisor. He has worked in several areas 
including topological groups, Banach algebras, finite geometries, and 
coding theory. He received the Newcomb Cleveland prize of the AAAS in 
1952. He is a member of the National Academy of Sciences and is a 
former president of the American Mathematical Society. 


To David Vernon Widder on his 90th birthday 


In 1796, Gauss discovered how to construct a regular 17-gon using only ruler and 
compass. Gauss also showed that regular polygons with 257 or 65537 sides can be 
constructed. He published these results in his famous Disquisitiones Arithmeticae [3, 
section VIII] in 1801. There he gave an analysis of the fields Q(), where € is a 
complex pth root of unity, p an odd prime, and from this analysis he deduced that 
a regular n-gon can be constructed if n has the form 2”p,p,--- p,, where 
Pi, Po»---> Py are distinct Fermat primes, that is, odd primes that are one greater 
than a power of two. He did not give explicit geometric constructions. Gauss also 
stated [3, p. 459] very emphatically that no other regular polygons are constructible, 
but he never published a proof of this fact. A proof was eventually published by 
Wantzel [10] in 1837. Since no Fermat primes larger than 65537 have been 
discovered, the list of constructible regular polygons remains as Gauss left it.” 

If we enlarge our kit of construction tools, other regular polygons may become 
constructible.? For example, given an Archimedean spiral, we can divide any angle 
into any number of equal parts, and hence draw any regular polygon. Suppose we 
allow ourselves to trisect angles in addition to the standard ruler and compass 
constructions, what do we gain? Obviously, we can construct regular polygons with 
9,27,81,... sides, but it is certainly not obvious that we can also make a regular 
polygon with seven sides. In what follows we shall explore the relation between 
angle trisections and solving cubic equations and then determine which regular 
polygons can be constructed with the aid of an angle-trisector. We begin by showing 
how to draw a regular heptagon. 


1. The author thanks Ethan Bolker and Persi Diaconis who made valuable suggestions and Lloyd 
Schoenbach who made the figures. 

2. Since 2° + 1, with r odd and greater than 1, has the nontrivial factor 2° + 1, it is clear that 27 + 1 
can be prime only if q itself is a power of 2. Thus all Fermat primes have the form 22° + 1. The known 
Fermat primes 3, 5,17,257,65537 correspond to k = 0,1,2,3,4. Euler observed that 2° + 1 = 641 x 
6700417, thus disproving Fermat’s conjecture that all values of k would yield primes. More recently, it 
has been shown that 2? + 1 is composite for 6 < k < 19 and many larger values. In many cases the 
complete factorization has not been found. See [2]. 

3. Both [1] and [6] contain much information about constructions using extraordinary tools. For an 
explanation of the standard “ruler and compass” constructions see [5]. 
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Fic. 1. Construction of a regular heptagon. 


Start with a circle @ of radius 6 centered at the origin of a Cartesian coordinate 
system. Mark A(6,0), Q(—3,0), and R(3,0). Locate K(0,/27) and L(0, — y27), 
the vertices of the equilateral triangles with base OR. With center P(—1, 0) draw the 
arc from K to L and trisect it at S and T. The points B and G at which the line ST 
meets circle @ are vertices of a regular heptagon ABCDEFG. The remaining 
vertices can be found by laying off the arc AB successively around the circle. 

Alternatively, we may draw the major arc from K to L and trisect it at S, and 
T,; then S,7, meets @ at D and E. Finally, if we think of arc KL as going all the 
way around P, back to K, and then on to L, and trisect this arc at S,; and 7;, we 
find C and F. 


Proof of the construction. Let U be the point where ST meets the x-axis. It is 
evident from the construction that PK = ¥28, cos ZAPK = 1 / ¥28, PU 
= ¥28 cos ZAPS = ¥28 cos(1/3)ZAPK. The construction is correct if OU = 
6cos27/7; or equivalently, if PU = 1 + 6cos27/7. Thus we need only establish 
the identity 28 cos((1/3)arc cos(1/ ¥28 )) = 1 + 6cos(27/7). To do so, let é = 
cos27/7 + isin27/7 be the principal seventh root of 1. Put n=€+€'= 
2cos27/7. Then w+ 7?-2n-1=8 +2 +€4+14+6'4+€74+E3 =0. 
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Thus 74 is a root of the equation 


X34+ X?-2xX-1=0. (1) 
Setting X = (Y — 1)/3, we find that 1 + 6cos27/7 (= 1 + 3m) is a root of 
Y?-21Y-—7=0. (2) 


The substitution Y = ¥28 cos @ reduces this to 


7¥28 (4cos*@ — 3cos@) = 7; 


whence 


1 
cos 38 = V28 


This leads to six determinations of @ modulo 27, which pair off to give three roots 
of (2). We leave it to the reader to check that 1 + 3n corresponds to the choice 
9 = 1/3arccos1/ ¥28. The other roots of (2) are, of course, 1 + 6cos47/7 and 
1 + 6cos67/7; these correspond to different determinations of 9 and lead to the 
alternative constructions of the points C, D, E, and F mentioned above. 


Plemelj [6] gave a different construction of the regular heptagon ABCDEFG, 
given A and the center O: 


Fic. 2. Plemelj’s construction. 


Draw the circle with center O passing through A and on it find M so that 
AM = OA. Bisect and trisect OM at N and P, respectively, and find T on NP so 
that 2NAT = {ZNAP. Then AT equals the side of the required heptagon, which 
can easily be completed by laying off this segment around the circle. 
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Plemelj goes on to note that because the angle to be trisected is so small, the 
simplest approximate trisection will introduce an error too small to be noticeable in 
any practical case. If we take T to be one-third of the way from N to P, then AT 
will be too long by less than one part in 20000. Even if we take T at N, AT will be 
too short by only about one part in 400. Since AN is half of AQ, the side of an 
inscribed equilateral triangle, we have the rule: The side of the inscribed regular 
heptagon is (approximately) half of the side of the inscribed equilateral triangle. 
According to Tropfke [8], this approximation was used by the tenth-century 
Arabian mathematician Abul Wafa Mohamed, and was also known to Heron of 
Alexandria, and perhaps even to earlier mathematicians. 

To validate Plemelj’s construction, we must prove that AT = OA(2sin7/7). 
Since 2cos27/7 = 2 — (2sin7/7)’, it follows from (1) that 2 sin z/7 is a root of 
the equation 


(2 -— x2)? + (2 — x?)’- 2(2 - x7) -1 =0. 


The other roots are —2sin 7/7, +2sin27/7, and +2 sin 37/7. 
This equation factors: 


(X3 + ¥7(X? - 1))( X? - ¥7(X?- 1)) = 0. 


The zeros of the first factor are 2sin7/7, —2sin27/7, and —2sin37/7. If we 
write the corresponding equation in the form 


a) da : 


and make the standard substitution 1/X = 21/3 cos y, then (3) becomes cos 3y = 
¥27/28 . The desired root corresponds to the choice 


~ = a = tarccos ¥27/28 = tarc tan 1/373 , 


and we have finally 
T 1 
2 sin cos a= 53. 


From FiGure 2 we see that ZNAP = arc tan t/3/3 , so ZNAT = a. Hence we have 
AT cos a = AN = (1/2)¥3 OA. Comparing these equations, we see that AT = 
OA(2 sin 7/7), as required. 


Note that ZOKP in FIGURE 1 is the same as ZNAP in FIGURE 2. Thus ZNAP 
(FIGURE 2) is the complement of ZOPK (FIGURE 1), so ZOPS = 7/6 — a. Thus the 
trisections required in the two constructions are equivalent in the sense that either 
trisector can readily be constructed from the other by ruler and compass. 
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It is interesting to pursue the other two roots of (3). They are given, of course, by 
changing the determination of y from a to a + 27/3 and to a — 27/3. We have 


27 27 32 27 1 
[-2sin = Joos « + =| = [—2sin* }oos{ « — =| = —/3. 


Absorbing the unwanted minus signs into the cosines, we find 


1 


27 7 3a (a 
2 sin sin( = + «| = 2 sin sin{ = — «| = 


6 7 


Fic. 3. Plemelj’s construction extended. O, A, M, Q, R, S are points of an equilateral triangular lattice. 
Line AV is chosen so that ZQAV = 1/3ZQAR. Then AT, AU, and AV equal, respectively, the side, 
short diagonal, and long diagonal of the regular heptagon inscribed in a circle of radius OA. 


In FIGURE 3 we have AU sin(7/6 + a) = AK = (1/2)V3 OA and AV sin(7/6 — a) 
= AL = (1/2)y¥3 OA. Hence AU = OA(2sin27/7) and AV = OA(2sin 37/7). This 
shows that AU and AV are equal, respectively, to the short and long diagonals of 
the heptagon. 

Now let us consider the theory underlying these constructions. We begin by 
reviewing the solution of the cubic equation with real coefficients. The term in X? 
can always be removed by translating the roots and, to avoid fractions later on, we 
write the equation in the form 


X? — 3pX + 2q=0. (4) 
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Since this equation has odd degree, it must have at least one real root. The nature of 
the other roots can be found by considering the quantity + 


D=q’-p’. 
If D > 0, there are two conjugate complex roots and one real root; if D < 0, there 
are three distinct real roots; and if D = 0, there is one double root, g/p, and a 
simple root, —2q/p (unless p = q = 0, in which case 0 is obviously a triple root). 
These facts are easily deduced by examining the critical points of the polynomial 
function in (4). Note that D cannot be negative unless p is positive. 
For positive D, Cardano’s formula gives the unique real root 


J—q + D +~—q-vD . 


This root could be constructed from p and q by ruler and compass and the 
extraction of one cube root. (The second cube root can be constructed from the first 
by ruler and compass, since the product of the two cube roots is p.) 

When D < 0, we have what is known as the casus irreducibilis and a seeming 
paradox: although all the roots are real, they cannot be found by radicals without 
leaving the real domain.°> Cardano’s formula remains valid, but it involves the cube 
roots of complex numbers. 

To find the cube root of a complex number c, we must in general find the cube 
root of the real number |c| and trisect the polar angle of c. In this special situation, 
however, only the trisection requires special tools since the absolute value of either 
cubic radicand is p?/*, for which the cube root can easily be constructed. 

When we know that all the roots are real, we can jump directly to the trisection 
problem by setting X = 2/p cos 9 in (4), which converts this equation into cos 38 = 
— gp >/?. Note that the hypothesis D < 0 guarantees that |qp~?/2| < 1, so there is a 
constructible angle to trisect. As before, the six determinations of 6 modulo 27 lead 
to the three desired roots. After one value of @ has been obtained, the others can 
easily be found by adding and subtracting 27/3; hence one angle-trisection will 
suffice to find all three roots. 

Conversely, any cubic equation that can be solved by an angle trisection must 
have all its roots real, since the method will produce three roots if it produces any. 
Thus we have the fundamental result: 


THEOREM 1. A real cubic equation can be solved geometrically using ruler, com- 
pass, and angle-trisector if and only if its roots are all real. 


In particular, an angle trisector will not help us to duplicate the cube, because the 
equation to be solved, X? = 2, has only one real root. 


4. This discussion is usually given in terms of the discriminant of the equation (i.e., the square of the 
product of all the differences of the roots). The discriminant of (4) is not D, but —108D; consequently 
positive D corresponds to negative discriminant and vice versa. 

5. See (9, p. 180] for a proof of this fact. See [4] for an extended discussion of the solution of 
equations by real radicals. 
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We are now in position to describe precisely what can be constructed with ruler, 
compass, and angle-trisector. 

Associated with any geometric figure (i.e., a finite collection of points, lines, and 
circles) in a Cartesian coordinate plane® is a certain subfield of the real numbers, 
namely, the field generated by the coordinates of all the points and the coefficients 
of the equations of all the lines and circles when written in the standard forms 
y = mx +b (or x =a) and x? + y* = ax + by + c. Suppose the data of a con- 
struction problem’ are associated with the field F, and the figure to be constructed 
is associated with the field G. The construction can be carried out with ruler and 
compass alone if and only if there is a tower of fields FK CPR,CRC::: CF, 
such that G C F, and each F, (i = 1,..., k) is obtained from F;_, by adjoining the 
square root of some positive element of F,_,. (See [9, p. 183ff.] for the proof of this 
standard result.) The intermediate fields F, correspond to the original figure 
augmented by the successive points, lines, and circies used in the construction. 
When we allow the use of the trisector, the result is the same except that we now 
also allow ourselves to build F, from F,_, by adjoining a root of a cubic polynomial 
having coefficients in F,_, and all real roots. The proof of this new theorem is 
virtually identical to the proof of the standard theorem, once Theorem 1 has been 
established. 

It is convenient to describe the above situation by saying that the field F, can be 
constructed from fF. 

For regular polygons we have the following theorem: 


THEOREM 2. A regular polygon of n sides can be constructed by ruler, compass, and 
angle-trisector if and only if the prime factorization of n is 2'3°p,p, +--+ p,, where 
Pi Pos+++5 Py are distinct primes (> 3) each of the form® 2'3" + 1. (We include the 
possibility k = 0; i.e., n = 273°.) 


The proof uses the following lemma. 


LEMMA. Suppose K is a real field and L is a normal extension of K of degree 3. 
Then L can be constructed from K by ruler, compass, and angle-trisector. 


Proof. We know that L = K[B], where B is a zero of some irreducible cubic 
polynomial p(X) with coefficients in K. Since L is a normal extension of K, all of 


6. We assume that the points (0,0) and (1,0) are points of the figure. 

7. If a coordinate system is not given in the plane of the data, first construct one, choosing two given 
points as (0,0) and (1,0). The data must include two points, or at least two points of intersection must be 
immediately at hand, else no constructions can be carried out. (We do not accept the instruction “choose 
a point at random” because that complicates the analysis of constructions considerably. See [5].) 

8. Altogether there are 41 primes of this form less than one million; namely, 2, 3, 5, 7, 13, 17, 19, 37, 
73, 97, 109, 163, 193, 257, 433, 487, 577, 769, 1153, 1297, 1459, 2593, 2917, 3457, 3889, 10369, 12289, 
17497, 18433, 39367, 52489, 65537, 139969, 147457, 209953, 331777, 472393, 629857, 746497, 839809, and 
995329. It is reasonable to conjecture that there are infinitely many; probably about 9¢ of them less than 
10. 
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the zeros of p(X) lie in L, and any one of them generates L. But one of the zeros, 
say y, is real and L = K[y]. Thus, L is a real field, and the zeros of p(X) are all 
real. Hence the lemma follows from Theorem 1. 


Proof of Theorem 2. Since the proof is very similar to the well known proof for 
ruler and compass alone, we give only the most important steps. 

Suppose n is an integer, at least 3. Let € = e?7'/" = cos2a7/n + isin2a/n, and 
n = & + &-! = 2cos2a/n. The Galois group over @ of the cyclotomic field Q(£) is 
abelian with y(n) elements, where is Euler’s phi function. Consequently, every 
field between @ and Q(€) is normal over Q@ with abelian Galois group. In 
particular, the real field Q(7) is normal over Q. Since ¢ has degree 2 over Q(7), the 
degree of Q(7) over Q is (1/2)q(n). 

Now suppose n has the form stated in Theorem 2. Then y(n) = 2°3” for some 
integers v and w, so the Galois group of Q(7) has 2” '3” elements. This group will, 
therefore, have a composition series of length v + w — 1 with all quotients isomor- 
phic either to Z, or Z,. Correspondingly, there is a tower 


R=QcCRC::: CF,, ,= Qn) 


of real fields, each normal over its predecessor of degree 2 or 3. Applying the 
lemma, we see that the field Q(7) can be constructed using ruler, compass, and 
angle-trisector. This means we can construct a segment of length cos2a7/n, and 
from this we can easily construct a regular n-gon. Looking back, we see that we will 
have to use the angle-trisector exactly w times. 

Conversely, suppose a regular n-gon can be constructed with ruler, compass and 
trisector. Then 7 can be constructed, so it must lie in some field of degree 2“3° over 
Q. Hence 7 itself has degree 2°3%, and p(n) = 2°+ 134. But this implies that n has 
the form given in the theorem. 


As an application of our theory, consider the next new prime, 13 = 27-3 + 1. 
Theorem 2 tells us that the regular triskaidecagon can be constructed using one 
angle trisection. There are many ways to proceed; none seem geometrically per- 
spicuous. 

The numbers 2 cos27k/13, k = 1,...,6 are the zeros of the polynomial 


X° + X° — 5X4 -— 4X3 + 6X7 + 3X —-1, 
which factors over the field Q(713 ) to 
(X?— X-1+A(X?-1))(X?- X¥-14+A(X?-1)), 


where A = (1 — y13)/2 and X=(1 + v13 )/2. The first factor has the zero 
2cos2%/13. Then 2 + 12cos27/13 is a zero of a cubic polynomial having no 
quadratic term which we can find as above. After considerable computation we 
obtain 


V3 (v13 + 1) 


27 1 
12 cos—— = y13 — 1+ 1/104 — 8y13 — tan ————__———- 
7 v¥13 / V13 cos-arc an 7s 
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which leads to the following construction, quite similar to the first one for the 
heptagon: 


Ayo 


Fic. 4. Construction of a regular triskaidecagon. 


Let @ be the circle of radius 12 with center at the origin. Mark A(12,0), 
P(v13 — 1,0), Q(5 — 13,0), and R(7 + 13,0). Locate K(6, ¥3(V13 + 1)) and 
L(6, — ¥3(V¥13 + 1)), the vertices of the equilateral triangles having base QR. With 
center P draw the arc KL and trisect it at S and T. The line ST meets @ at A, and 
A>, vertices of the regular triskaidecagon AA,A, --- Ajo. 

Note that a segment of length /13 is easily found as the hypotenuse of a right 
triangle with sides 2 and 3. 

The figure suggests two approximate constructions. The point R appears to be 
very near the line ST. Indeed, the line through R perpendicular to the x-axis meets 
Y ata point about twelve minutes of arc from A,. Even closer, the line PK meets 2 
at a point within three minutes of A,. 

The next new prime, 19, requires two trisections. The details are left to the 
reader! 
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Possible generalizations of Theorem 2 immediately suggest themselves. To con- 
struct a regular 11-gon, we must solve the fifth-degree equation having the root 
2 cos 27/11. Can this be done by quinsecting some angle? Gauss answered all such 
questions. Here is his final statement on the subject [3, p. 450]: 


As a result the division of the whole circle into n [a prime] parts requires, first, 
the division of the whole circle into n — 1 parts; second, the division into 
n —1 parts of another arc which can be constructed as soon as the first 
division is accomplished; third, the extraction of one square root, and it can 
be shown that this is always yn. 


It follows easily from this that a regular n-gon (n need no longer be prime) can be 
constructed if, in addition to ruler and compass, equipment is available to p-sect 
any angle for every prime p that divides p(n). Thus, ruler, compass, and angle 
quinsector will suffice to construct a regular 11-gon, 41-gon, or 101-gon. 
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I. Introduction 


In this article I will concentrate on a diagrammatic approach to invariants of 
knots. I will talk about connections with graph theory, physics, and other topics. In 
the process I shall construct the Jones polynomial and its associated algebra. I will 
also discuss generalizations of the Jones polynomial due to myself and others. [Jones 
introduced his polynomial in 1984. Almost immediately, Hoste, Ocneanu, Millett, 
Lickorish, Freyd, Yetter, Przytcki, and Traczwyk had a significant generalization. 
Shortly, yet another invariant was crafted by Brandt, Lickorish, Millett, and Ho. I 
generalized this one, and in the process found new approaches to the original Jones 
polynomial.] We'll also explain how proofs of some old conjectures about alternat- 
ing knots emerge from this work (due to myself, Murasugi, and Thistlethwaite). 
Many people have helped in this resurgence of the theory of knots. This article is 
dedicated to all of them. 

Let’s begin by thinking about how one might go about making a theory of knots 
and links in three-dimensional space. The typical example of a knot is illustrated in 
FIGURE 1. 


a (realistic) b (schematic) 
Fic. 1. 
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Actually, FIGURE 1 has two illustrations of a trefoil knot T. In the first (a), the 
trefoil is depicted realistically as a physical tube with thickness and shading in the 
three-dimensional space. This picture reminds you that the trefoil might be made of 
rope or rubber—and that such a model would exhibit thickness, tension, friction, 
and other physical properties. In the second illustration (b), there is a schematic 
representation consisting of three continuous planar segments meeting at crossings. 
The crossings have local forms as shown in FIGURE 2. 


Vertex 


PUN, 
| 


Fic. 2. Two forms of crossing associated to a vertex. 


A schematic diagram of this type is a sufficient pattern to allow reconstruction of 
the knot or link from rope or string. It also encodes key topological properties, and 
allows the construction of a diagrammatic theory. 

Thus, I shall regard a knot or link as extra structure (via crossing choices) on a 
(locally) 4-valent planar graph. Each vertex of such a graph has the form seen in 
FIGURE 2, and we shall call such a graph a universe. Thus in FIGURE 3 you see the 
trefoil and its universe. 


(unknot) 7 T (trefoil) 


VAX 


U (trefoil universe) 


Fic. 3. 
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As FIGURE 3 indicates, you may regard the universe as the shadow, under 
projection to the plane, of the overlying knot or link. In general a universe of n 
vertices can be the projection of 2” corresponding knots/links. Many of these will 
be unknotted or unlinked (see FIGURE 3). 

But we now need a definition of equivalence so that the words unknotted and 
unlinked make sense. This equivalence is essentially generated by three fundamental 
types of diagram moves (the Reidemeister moves). See FIGURE 4. I have designated 
the Reidemeister moves as type I (add or remove a curl), type II (remove or add two 
consecutive under (over) crossings), and type III (triangle move). Reidemeister 
proved in the 1920s that these three moves (in conjunction with planar topological 
equivalences of the underlying universes) are sufficient to generate spatial isotopy. 
In other words, Reidemeister proved that two knots (links) in space can be 
deformed into each other (ambient isotopy) if and only if their diagrams can be 
transformed into one another by planar isotopy and the three moves (see [R}). 


SOD =O 
oO Kee 


>a 


OOS 


Fic. 4. Reidemeister moves. 


By a planar isotopy I just mean a motion of the diagram in the plane that 
preserves the graphical structure of the underlying universe. See FIGURE 5. 


5 2 
1 
od 1 
3 
3 


Fic. 5. Planar isotopy. 
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E>? 
BY &) 


Fic. 6. Ambient isotopy. 


In FIGURE 6 an ambient isotopy to an unknotted circle is shown. FIGURE 7 
illustrates an ambient isotopy between the figure eight knot EF and its mirror image 
E*, A knot (link) is said to be ambient isotopic (equivalent) to another if there is a 
sequence of Reidemeister moves and planar equivalences between them. We write 
~ for equivalence. Thus FE ~ E*. (Note that the last two steps in the FIGURE 7 
deformation are planar equivalences.) 


- 
be 

DDQXYY 

DIY 


FIG. 7. 
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The next thing one wants in a theory of knots are methods for distinguishing 
inequivalent knots and links. For example, we know that there is no equivalence 
between () and ©) (©: for the number of components remains invariant under ~ . 
Note that you can determine the number of components even from a complicated 
diagram by choosing a point on some arc of the diagram and then taking a walk 
along the diagram—crossing crossings when you come to them. Each component is 
a complete cycle obtained in this way. (See FIGURE 8). 


3 components 


NL 
4 


Fic. 8. Counting components. 


Note indeed that it is a consequence of the Reidemeister moves that the number 
of components is unchanged by equivalence. Thus the component count is our first 
invariant of knots and links. By itself it is, however, not very strong. In particular, 
©) and CD each have two components but are, in fact, inequivalent. The next 
invariant is the linking number. It gives a measure of how two curves wrap around 
each other. To define it we need notions of orientation and sign. 

A link is said to be oriented if each of its components is assigned a direction 
indicated by arrow(s) on its arcs. The arrows are consistently arranged in the form 
—/——. Oriented crossings are given signs of + 1 as shown in FiGurE 9. 


FIG. 9. Crossing signs. 


Given a link of two components a and B, let a MN £ denote the set of crossings of 
the component a with the component £. (Thus al £ does not include self- 
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crossings of a or of 8.) Then the linking number of a and £ is defined by the 
formula: 


1 
Ik(a, B) = 2 » e( p). 
peallB 
In other words, the linking number is one-half of the sum of crossing signs of one 
curve with another. 


B 


Example 1. ) Ik(a, B) = 50 +1 = 1. 
a 
B’ 
Example 2. & Ik(a’, B) = 3(-1 -)= 1. 
a’ 


Once an orientation has been assigned to a link of two components, it is immediate 
from the Reidemeister moves that the linking number is an invariant. For type I 
moves do not contribute to the linking number, while type II moves add or remove 
both a +1 and a —1. And type III moves do not alter the summation. 

Thus the two examples above suffice to prove that the simplest link 


is indeed linked. For whatever orientation we assign to it, this link has a nonzero 


linking number. 


B 


+ —_ ~~ 
Example 3. DS “ Ik(a, B) = 50 +1-1-1)=0. 
+ — 


This is the Whitehead link (named after the topologist J. H. C. Whitehead). Even 
though it has linking number zero, it is linked. 


Example 4. \ O 
<a 


=| 
( 
q 


aa as «=» 
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Thus \ \ 


" 
' \ \ ‘ 
‘ 

' 

{ 


t 4 
\ 


keeping the endpoints fixed. This is a familiar phenomenon that you can illustrate 
with a belt (the two arcs forming the edges of the belt.) 


©) 4 ~ oa 


If the edges of the belt are oriented in the same direction then we can see what the 
linking number contribution will be from either of these forms. 


oN 1(-1 —-1)= -1 
(QO (-1-1)=-1 
” a <n 


SE (self-crossings are not counted) 


In the curl form -—Q it is worth noting that the linking contribution is the 
same as the contribution of the self-crossing to 


We can use these observations to find the linking number of a more complex link 


such as 
© 


a 


~\ B 


Cy each curl contributes +1 to the linking number 
>< occurs twice giving 4(1 + 1) = 1 
Ik(a, 8B) =14+14+1=3. 


In fact, we can always find the linking number of a link that is built from a knot 
diagram by adding a parallel strand: 


+ 
SD & 
+ 


K K 
Ik(a, 8B) = w(K) =3. 
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The resulting link has linking number equal to the sum of the crossing signs of K. We 
have denoted this sum by w(K ). It is called the writhe of K (or the twist number of 
K ). The writhe w(K ) is not necessarily an invariant of K since it changes by +1 
under the type I move. But the writhe is an invariant of the associated link K 
obtained by drawing parallel strands as above. 


Example 5. , ( a 


Ik(K) = w(K) = +1 


In the context of the associated parallel link K, it is appropriate to call w(K) the 
writhe of K and to reserve the word twist and a number 7(K ) for the twisting of the 
strands. Thus we shall call aoa one full positive twist. And we'll write 


T(€ES)) = +1. 


Then for links £ composed of parallel twisted strands we have the formula 
Ik(L) = w(L) + T(L). (See [Wh].) 

The linking number for parallel twisted strands is the sum of the writhing and 
twisting. Thus 


ff 
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Example 6. 


EP 


w(L) = +3 
T(L) = -3 
Ik(L) =3-3=0. 


This is another example of a link with zero linking number that is nevertheless 
linked. 


Remark. The formula [Wh] Ik(L) = w(L) + T(L) can be regarded as a kind of 
“conservation law” for links of closed parallel (twisted) strands. Neither w(L) nor 
T(L) are individually topological invariants. But since the sum of writhing and 
twisting is a topological invariant, this sum must remain a constant. You can 
observe this conservation by playing with a rubber band. And it has been used to 
help understand the geometry of closed double-stranded DNA. ([BCW], [F]) 


9 


There is much more to say about linking numbers—many other points of view. 
See Knots and Links by Dale Rolfsen or my books Formal Knot Theory; On Knots; 
Knots and Physics for other points of view. We now pass on to the next problem: to 


show that the trefoil KP is indeed knotted. The most elementary proof of this fact 
that is known to me runs as follows: Color the arcs of the trefoil diagram red (r), 


blue (b) and yellow (y). 
CP | 


Say that a knot diagram is tricolored if every arc is colored r, b, or y and at any 
given crossing either all three colors appear or only one color appears. Of course to 
be tricolored there must be arcs of each color in the diagram. Then prove (exercise!) 
that for knots (one component) tricoloration is preserved under the Reidemeister 
moves. 

Since the trefoil is tricolored and the unknot is not tricolored, this method 
articulates a topological property of the trefoil and shows that it can not be 
unknotted. In the next section we will show that the trefoil is chiral. That is, we will 
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prove that it is not equivalent to its mirror image. In the old days (before 1984) this 
was something that required a lot of mathematical background. Now we can prove 
it using only diagrams and a few definitions and calculations. That new invariants 
can be both simple and powerful makes the subject of knots very exciting. 

In the next section we construct the bracket invariant and show how it gives rise 
to the Jones polynomial and to chirality for the trefoil. Section 3 uses the bracket to 
get at subtle facts about alternating knots and links. Section 4 gives more discussion 
of the bracket and its relation to braids and the algebra of Jones’ original 
representation. Section 5 discusses 2-variable generalized polynomials and the 
historical background of Alexander and Conway polynomials. Section 6 shows how 
the bracket (hence, the Jones polynomial) is directly related to the Potts model in 
statistical physics. Sections 7 and 8 explain and generalize a relation with the Tutte 
polynomial in graph theory (see [T1] by M. Thistlethwaite). Section 9 discusses the 
knot theory of graphs embedded in three-dimensional space. Section 10 has specu- 
lations and problems. 


Il. The Bracket Polynomial 


I begin by defining a 3-variable polynomial on unoriented link diagrams. Given 
an unoriented link diagram K, [K] © Z[A, B, d] will denote the corresponding 
polynomial in commuting variables A, B, and d. The bracket polynomial satisfies 
the axioms: 


Bracket Axioms 
1. [><] =A[ ==] + BL) (] 
[~X<] = B=) + 4 )() 
2. [OK]=a|[K] 
[O] = d. 


Some explanation of these rules is in order. First note that an unoriented crossing 
discriminates two out of the four regions incident at its vertex. This can be done 
conventionally by rotating the over-crossing line counterclockwise and choosing the 
two regions swept out. Thus 


((o aff] 


ye 


By using this convention, we can label the regions A and B, respectively: 
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The formula in (1) then reads 
R “4 NE 
4 =A +B 
a 


and we see that A corresponds to a splice that “opens the A-channel” while B 
corresponds to a splice that opens the B-channel. By this convention, the second 
equation in (1) is correct, and a repetition of the first. 

The crossings in these equations stand for larger diagrams that contain them. 


Thus >< ——, } ( are assumed to be parts of otherwise identical diagrams. In this 
sense the expansion formula (1) stands for infinitely many particular formulas such 


KD 4} Phi 


The second equation asserts that an extra disjoint circle placed anywhere in the 
diagram multiplies the value of the bracket by d. In particular 


[any N disjoint simple closed curves] = d”. 


[OO] =a’. 


Clearly, these axioms lead to a recursive calculation of [K] by continued expansion 
to evaluations of collections of simple closed curves. To see that [ K ] is well-defined, 
it suffices to reformulate it as a sum over states S of the universe U underlying K. 

Let U be the universe for K. A state S of U is a choice of splitting for each vertex 
of U. I denote such a choice by a marker at the vertex (see FIGURE 10). FIGURE 11 
shows a state of the trefoil universe and its corresponding splitting. 

Given a state S, let |S| denote the number of components in its splitting. Let 
i,(S) denote the number of A-channels opened in S and j,(S) denote the number 
of B-channels in S. For example, 


Thus 


ix(S)=2,  jx(S) = 1. 


LEMMA 2.1. [K] = LAB _'5!_ This formula for the value of the bracket 
follows directly from the axioms (by expanding using (1) and (2)). It gives a unique 
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split NY 
/oN 


>< 


K = DC 


Fic. 11. A state of the trefoil universe. 


value for the bracket on diagrams (no Reidemeister moves yet) and can be taken as the 
definition of |K for a strictly logical development. 


We now ask: Under what restrictions on A, B, and d will [K] become a 
topological invariant of knots and links? 
This question is easy to answer via the next lemma. (See [K5].) 


LEMMA 2.2. [7] = AB[—> C] + (ABd + A? + B?)[ =]. 
Proof. 
[3a] = ALS] + BI ><] 
= A*[-2_ ] + AB[ =] + BA[> —C] + B?[ YY] 
= AB[> —] + (47+ B* + dA4B)[=—]. 


1988] NEW INVARIANTS IN THE THEORY OF KNOTS 207 


Thus with AB =1 and d= —A?* — B* we obtain invariance under the second 
Reidemeister move. 


LEMMA 2.3. If [_ 3 ~-] =[ <_], then[ _ ] is also invariant under the type LIT 
move. 


Proof. 
7 Su S=<i> _—\—(_— 
[XO] = AL S81 + BE 
Nw” 
= A[Sv] + BI] (by) 
. VAN _ / 
AXA LCI. 
So for the rest of this section we'll take B = A~!, d= —A* — A~? and also write 


(K) = d~'[K] so that (O) = 1. We then have 
L(x) HA) FA-K) (YD 
(=< )HA- =) +4 OL) 
2. (OK) =d(K) 
(OY =1. 


This special bracket is invariant under moves II and III. It behaves as follows under 
the type I move: 


LEMMA 2.4. Let a = —A?. Then 


(3) =a —) 
(S) = aK —) 


We can do two things at this point. We can understand that (K ) is a special kind 
of invariant and it is possible to create an ambient isotopy invariant from ¢ K ) for 
K oriented. First we call the equivalence generated by moves II and III regular 
isotopy. Thus ¢K ) is a regular isotopy invariant. 

Recall from section 1 that the twist number w(k) for an oriented link K is also a 
regular isotopy invariant. (Recall that w(K ) is the sum of all crossing signs.) Thus if 
K is oriented we define f, = a ”‘*)(K where ¢ ) forgets the particular orienta- 
tion. Then f, is an ambient isotopy invariant for oriented knots and links K. 

For mirror images we have: 


LEMMA 2.5. Let K* denote the mirror image of K obtained by reversing all the 
crossings. Then ( K*)(A) = (K )(A_‘) and f(A) = fx(A~'). 


We omit the (easy) proof. 

Now it turns out that f, is a version of Vaughan Jones’ original polynomial. To 
see this we need a definition of the Jones polynomial. Later we will have a deeper 
look at this. For now it suffices to say that Jones’ polynomial V,(t) is determined 
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by the axioms: (see [J1], [J2], [J3].) 
NDev>—-WXK = lv - 
2VOF=!I1 


3) V(t) is an invariant of ambient isotopy. 

LEMMA 2.6. fx(t”"/*) = Vx(?). 

Proof. 

SAS y+ Am 0 

(><) = AKT) + AC) ( 
WANK OX) AK OX) = “a - Ay) 

Aak SS parm 9 — Anta SX yar") = (A? — AY = MD) 

Aaf > -A taf >= (42 - A?) = (fe = a7 "OY KY) 

-A*f SJ +A“ SJ = (4? - A) f — 


Let A = t-!”4. Then 


a 


iy 


mY SX -f >= vi - <\— QED 


— > 


Having constructed the bracket, here are some sample computations: 


1. (Oy=KG) +A KO) 

= A(a) + A '(a“') 

_ — 44 _ Aw4. 
2. (ED Y= AC) + AW XO) 

= A(-A*-—A*) + A-(- A~3) 

= —~A°—-47-34+ 47! 

fr=a XT) = -A-XT) = A 44+ AP - A, 

Thus f,(A) # f;(A_') and, hence, the trefoil knot is chiral. There is no ambient 
isotopy of the trefoil to its mirror image. This is the simplest known proof of the 
chirality of the trefoil knot. Note that all the machinery was developed from scratch, 


and it is all elementary. 
By Lemma 2.6 we have the Jones polynomial for the trefoil as well: 


V(t) =t+e- 14 
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3. Exercise. Calculate (W) and show that it is a nontrivial link. 


<ap 


W 


Ill. Alternating Knots and Links 


The bracket polynomial can be used to get at some subtle properties of alterna- 
ting knots and links. This comes about because we can determine a specific formula 
for the terms of highest and lowest degree in A for such links. 

Recall that a link is said to be alternating if it has an alternating diagram. This is 
a diagram where the crossings alternate under-over-under-over-... as one travels 
along the link (crossing at the crossings). Now view FIGURE 12. It should be clear 
from this figure that in a checkerboard shading of the diagram for an alternating 
link, the shaded regions at each crossing are all of the same type (A or B where this 
is the same discrimination that we used to define the bracket). It is assumed that the 
underlying universe for the diagram is connected. 


We will use this observation to guess the highest degree term in (K), and then 
prove that our guess is correct. Recall that the bracket is given by a summation 


(K) = Yi Aix S04 “ie(SgISi-1, 
Ss 
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My guess is that the highest degree is contributed by that state S where all the markers 
open A-channels. Such a state will contribute a term of the form A’d!5!~! where v is 
the number of vertices (crossings) in the diagram K. And our checker board 
observation shows that in the case of alternating links this A-channel state S has W 
components (|S| = W) where W is the number of white (unshaded) regions, and all 
the A-channels are colored black. View FIGURE 13. 


Ss 
LER 
Ef V=17 
[E W=7, B=12 
ZE R=74+12=19=V+2 
IS|=7=W 


S (split) 


Fic. 13. The A-channel state. 


We see that |S| = W exactly because we have split all the shaded crossings, 
connecting the shaded part into one big shaded region, whose boundary components 


are boundaries of the white regions. 
Thus S contributes the term 
AtdW-) = 4-4? — 477)" ?, 


Hence we assert 
THEOREM 3.1. Let K be a reduced alternating diagram. Then the highest degree 

term in (K has degree given by the formula 

max deg(K) = V + 2(W — 1), 
where V is the number of vertices in the diagram, W is the number of unshaded regions 
(shading corresponding to type A crossings). This term has coefficient equal to +1 in 
(K). The term of minimal degree is also monic and has degree 

min deg(K) = —V — 2(B — 1), 


where B is the number of shaded regions. 
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Comment. A diagram is reduced if it has no crossing that is an isthmus. A 
crossing is said to be an isthmus if any two of the four local regions at the crossing 
are parts of the same region in the whole diagram. See FIGURE 14. 


ISA 


an isthmus 


Fic. 14. 


Proof of Theorem 3.1. Let S be the A-channel state. Note that any other state S’ 
can be obtained from S by flipping some subset of S’s markers. For any state S’, let 
(K|S’) = Aix SO BsxGq'51-? denote the contribution of this state to the bracket 
summation. (B = A~!, d= —A? — A~’). Thus (K) = 2K |S’). 

Now observe the following facts: 

(i) If S’ is obtained from S$” by flipping an A-channel marker to a B-channel 
marker, then max deg( K|S’) < max deg(K|S’”’). The inequality is strict exactly 
when S’ has fewer components than S’’. That is, when |S’| = |S’’| — 1. 

(ii) If S’ is obtained from the A-channel state by one flip, then |S’| = |S| — 1. 

Assertion (i) is obvious, for if (K|S”) = A*d'§"'"!, then (K|S’) = A*~7d/S1-1, 
Since S’ is obtained from S” by one flip, we know that |S’| = |S”|+4+ 1. If 
|S’| = |S”| + 1, then 

(K|S’) — Ax 7d'8"1, 
hence, max deg( K |S’) = max deg( K |S"). If |S’| = |S”| — 1, then 
(K|S’) _ A*~2qiS"1-1-1 
and (using d = —A? — A~’), 
max deg K |S’) = max deg( K|S”) — 4. 
This verifies assertion (i). 

Assertion (ii) is a consequence of our hypothesis of no isthmus. For suppose that 
|S’| = |S| + 1. Begin tracing along one of the components of S’ at the changed 
marker. Note that due to our construction of the state S, this tracing (when drawn 


parallel to the component in the white regions) will encircle all or part of the 
original knot diagram. If |S’| = |S| + 1 then the two cusps ()() at the site of the 
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changed marker will lie on separate components of S. Thus we will end up 
encircling a part of the diagram, showing that this site was an isthmus. This is a 
contradiction. Hence, |S’| = |S| — 1 and max deg K |S’) = max deg( K|S) — 4. 

It follows from (i) and (ii) that max deg( K |S”) < max deg(K|S) for all states 
S”. Thus 


max deg( K ) = max deg( K |S) 
=V+2(W -1). 
This completes the proof of the theorem. 
____»—__— 


We are now in a position to deduce the following (see [K5], [M2], [T1].) 


THEOREM (KAUFFMAN-MURASUGI-THISTLETHWAITE). The number of crossings in 
a reduced alternating projection of a link L is a topological invariant of L. 


Proof. Let span(L) denote the difference between the maximal and minimal 
degrees of (L). Since f, = a~”‘")(L) is an ambient isotopy invariant of L, we 
conclude that span(L) is also an ambient isotopy invariant. By 3.1 


max deg(L) = V + 2(W — 1) 
min deg(L) = —V — 2(B - 1), 


where V is the number of crossings in the diagram, W the number of white regions, 
B the number of black regions. (In a shading where all A-type crossings are 
shaded.) Thus 


span(L) = V+ 2(W —- 1) —- (-V — 2(B - 1)) 
=2V + 2(W+ B- 2). 


But W+ B=R, the total number of regions in the diagram, and R = V + 2. 
Hence, span(L) = 4V. This completes the proof. 


Discussion. This result is one of a number of classical conjectures about alterna- 
ting knots and links that go back to the original compilations of knot tables by Tait 
and Little at the end of the last century. They also conjectured that a reduced 
alternating projection is minimal in the sense that it has the least number of 
crossings of any projection of that link. This is also true, as we shall see. Beyond this 
however, is the Tait flyping conjecture. This states that any two reduced alternating 
projections of the same (up to ambient isotopy) link can be obtained from another 
by flyping. A flype is a move on a tangle (with two inputs and two outputs) obtained 
by rotating the tangle by 180°. See FiGuRE 15. Among other things, the flyping 
conjecture implies that the twist number, w(K ), of a reduced alternating projection 
is an ambient isotopy invariant of K. At this writing, the full conjecture remains 
open. Morwen Thistlethwaite has proved that w(K ) is an ambient isotopy invariant 
for reduced alternating diagrams. His proof uses my extension of the Brandt- 
Lickorish-Millett-Ho polynomial to two variables. (See section 5 for a discussion of 
this polynomial.) 
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D> Sd 


SY #- Wr 


Fic. 15. Flyping. 


The next lemma [K5] gives a quick proof of the general inequality span(K ) < 4V 
(first proved independently by Murasugi and Thistlethwaite). 


Lemma 3.3. Let S be any state of a universe U. Then |S| + |S| < R where R is the 
number of regions in U and § is the dual state for S obtained by reversing all the 
markers of S. 


I omit the proof of this lemma. See FIGURE 16 for an illustration. 


|S] = 2 


U 


4 ma 
PS - 


Fic. 16. |S} + |S} = 6 < 10. 
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We then use Lemma 3.3 to prove 


PROPOSITION 3.4. For any reduced diagram K, span(K ) < 4V, where V is the 
number of crossings in K. 


Proof. Let S be that state for K such that every crossing is split in the 
A-direction. Then the same argument as in the proof of 3.1 shows that 


max deg(K) < V + 2(|S| — 1) 
and 
min deg(K) > —V— 2(|S| —1). 
Therefore, span(K) < 2V + 2(|S| + |S| — 2) < 4V. 


A refinement of Lemma 3.3 by Wu [Wu] shows that span(K) < 4V when K is 
any reduced nonalternating diagram. This gives us a quick proof of this inequality, 
also due to Murasugi and Thistlethwaite. Mr. Wu’s very nice observation is that in a 
state S where every crossing is split in the A-direction for a nonalternating diagram 
there must appear splits in the pattern 


—_— — split ” 4 
——- 


corresponding to two consecutive over or under crossings. Call this a pair of parallel 
markers. Mr Wu notes that if S has at least one pair of parallel markers, then 
|S| + |S| < V. Repeating the argument of 3.4, we obtain the strong inequality 
span(K ) < 4V when K is reduced and nonalternating. Thus we now know that a 
reduced alternating projection has a minimal number of crossings among all diagrams 
for the link. 

This is a remarkable application of these techniques. It is the first result of this 
kind in knot theory, and has a number of ramifications. For example, D. W. 
Sumners has used it to show that the number of knots grows at least exponentially 
as a function of minimal crossing number. See also [Ki]. 


Mirror Images. We now turn to the consequences of Theorem 3.1 for chirality of 
alternating links. Let K be a reduced alternating projection as in 3.1. Then 
max deg(K) = V+ 2(W— 1) and mindeg(K) = —V —2(B — 1). Thus (using 
fr = a") K )) we have 


max deg f, = —3w(K) +V+2(W-1) 
min deg f, = —3w(K) —- V—2(B-1). 


If K is ambient isotopic to its mirror image K*, then f,«(A) = f,(A~') implies 
f(A) = fx(A~*). Hence —min deg f, = max deg fx, thus 


3w(K) + V+2(B-1) = -—3w(K) +V+2(W- 1). 
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Therefore, 

6w(K) = 2(W - B), 
or 

3w(K)=W-B. 


Thus we have a necessary condition for an alternating link to be achiral (equivalent 
to its mirror image). You can check that it follows from this equation that if the 
absolute value of the twist number, |w(K )|, is greater than or equal to one-third the 
number of crossings, then the link is chiral. This is a step in the direction of the 


TAIT CONJECTURE. K reduced, alternating, w(K ) # 0 implies K is chiral. (Now 
proved by Thistlethwaite.) 


Murasugi [M2] also proved the invariance of w(K). His method is to note the 
sum s(K) of the maximal and minimal degrees is an invariant of the ambient 
isotopy class of K. For K reduced alternating, we have (from the above) that 
s(K) = —6w(K) + 2(;W — B). He then uses another technique (the signature of 
knots and links) to show that (W — B) — w(K ) is an ambient isotopy invariant for 
the reduced alternating diagram K. Hence w(K ) must also be invariant. In section 
5 we'll give another proof of the invariance of w(K ). 

There exist many reduced alternating achiral knots with twist number zero. I 
conjecture that each such not only satisfies W = B, but that the graph associated to 
the white regions is isomorphic to the graph associated to the black regions. See 
FIGURE 17 for the example of the knot 8,,. 


817 
aN 
/ ! \\ 
fh 
I o> wa Isomorphic graph and 
VK ’ ; dual graph 
\ \. 7% } 
\ x / 
“oe N / 
\. y, 


Fic. 17. 


IV. Braids and Diagrams 


Let’s now consider the specialization of the bracket to the case of braids. The 
n-strand braid group B,, is generated by elements 0,, 0,,...,6,_, (and their inverses) 
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subject to the relations 
0,9; 49; = 9; 4 19j;9; + 


0,6,=060, for |i —j| > 2. 


The meaning of these generators and relations should become clear from FIGURE 
18. A braid is a collection of unknotted strands, proceeding downward from n 
points (top row) to n points (bottom row). The strands wind around one another 
throughout the descent. Given a braid b € B,, its closure b is the knot or link 
obtained by attaching the n points in the top row to their counterparts on the 
bottom row. (See FIGURE 18.) By definition, the value of the bracket on a braid is its 
value on the closure of the braid: (b) = (b). Braids b,, b, are multiplied by 
attaching the n points on the bottom row of the first to the n points on the top row 
of the second. (See FIGURE 18.) 


“ IX XI IX 


(braid generators for n = 3) 


oR RL 


/ 
y : 
K 


Fic. 18. 


Now consider the states of a braid-universe. It should be apparent from FIGURE 
19 that these can be constructed as diagrammatic products of the elementary 
diagrams h,,..., h,_, with relations 

h? = dh, 
hjh; hj = h; 


hjh,=hjh, for |i —j|>2. 
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ME 
~ Al A]-0 
Alf] 


| (\ 


hyhgh, =h, h; = dh, 


FIG. 19. 


Here we take d to represent the closed loop obtained by plugging h, into itself. As 
diagrams, the states can be multiplied just as we multiply braids. Since the result of 
such multiplications can produce extra closed loops, we need to impose a mixed 
topological and combinatorial equivalence relation to capture the resulting struc- 
ture. Since, for computing the bracket it is irrelevant where a closed component is 
(we only count them), I define two diagrams to be equivalent if one can be obtained 
from another by regular isotopy relative to the endpoints, with free regular isotopy 
for closed loops. Thus, 


VPN 
- Di \-c 
WTP 


/\ 


illustrates the equivalence behind the identity h3 = dh,. Since the h,’s involve 
paired maxima and minima the first definition of this semi-group as generated by 
products of the h,’s is a bit unsatisfactory. We would prefer a more intrinsic 
definition. This can be done, but we won’t go into it here (see [K7]). Also a mixture 
of braid generators and h,’s produces a more intricate structure. 
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Because the braid-states have a multiplicative structure, we see that the bracket 
expansion (>) = A{=~) + A~)() can be construed for braids as a mapping 
0: B, > D, where D, is the free additive algebra over Z[ A, A~*] with multiplicative 
generators h, and relations (*) above. That is, we define 


X = ) ( 


J t ” 
o(0,) = Ah, + A7*1 


p(o,'!)=A-'h, + A 


and take d = —A? — A~* in D.. Then the formalism we have used to prove that 
(K is an invariant of ambient isotopy also proves (via the braiding relations) that 
p iS a representation of the n-strand braid group to the algebra D,. Furthermore 
there is a function tr: D, > Z[A, A~'] that we may interpret as the linear extension 
of tr(h) where h is a product of h,’s. And tr(h) = d'"'~* where |h| is the number of 
disjoint circles in the state corresponding to h. Then tre p(b) = ¢(b) and this gives a 
diagrammatic interpretation to the original construction of the Jones polynomial via 
representations. 

This approach has been generalized (see [K6], [K7], [L]) but the algebra of the h,’s 
remains the most transparent structure in this context. And while it may seem 
transparent, it is in fact rather opaque: We do not yet know whether there is a 
non-trivial knot with trivial Jones polynomial. 


The Mixed Algebra. From our context, it is very natural to consider a mixture of 
products of braid generators o, and the state-elements 4, At this writing, an 
abstraction of this algebra has been used by Birman and Wenzel [BW], and one very 
beautiful systematization of it by Yetter [Y]. For our purposes we shall write such 
diagrams up to regular isotopy. Thus we do not have relations 0,h; = h, since (for 


example) o,h, has diagram 
moe 


a 


and it requires a type I move to obtain the cancellation. 

And since diagram multiplication does yield an extra loop d when squaring h,, 
we hope to retain the relation h? = dh,;. Let @, denote this (multiplicative) 
extension of braids via the h,’s. Obviously, we want a better formal definition of 
M ,, but first consider some examples: 


n? 


vy |) 
1°. Ma me K] ohh, = 05'h,. 


This is a fundamental type of mixed relation. Note how the pairing of 
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maxima/minima to produce the h, on the left-hand side comes from different arcs 
than on the right-hand side! 


: a 


0,0,0,0,h,h, = hyh3. 


We are allowing regular isotopy of the strands relative to the endpoints and to the 
(vertical) sides of the box in which the tangle sits. 


/ U) U/ 
3°. () = iV = 
Yah 78 
0,0,h, =h,h, = hj,o,0,. 


It may begin to look like there is a myriad of possible relations in .@,,. This is true, 
but the kind of relation illustrated in example 1° plus the usual braiding and h, 
relations is sufficient to generate the others. (For details see [Y] and compare with 


[K6] and [K7].) For example: 
[- a () 
Thus one can actually give generators and relations for .7%,, just as for the 


classical braid group. But in order to do so a decision must be made about handling 
appearances of closed loops. We take 


Al 
1 UA 


bringing closed loop forms outside the rectangle, then it is natural to move whole 
knots and links in and out as in: 


ane 


Thus in this formulation, we can write h,h,0,0; ‘0; 'o,h,h3 = Ahh, where A is the 
given link. By allowing multiplication by disjoint union with knots and links, we go 
beyond a simple set of generators and relations. However, the formalism is useful in 
some contents. For example, using [K] = d(K) we have [K LI K’] = [K][K’] so 
that the square bracket preserves this outer multiplicative structure. 

Finally, the outer form of multiplication then fits in with a generalized tensor 
formalism (see [P]) with two types of multiplication corresponding to ordinary 
tensor product and to different forms in index contraction such as matrix multipli- 


cation. Thus if 
B a 
. o AB and A eo BE 
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B 
© ABBS 


(meaning the sum over all occurrences of a by the summation convention). 
Connection by a single connecting line corresponds to ordinary matrix multi- 
plication. Connection using multiple connecting lines can correspond to multi- 
plication in a tensor product. 

We can regard “ and — as diagrams for matrices “,, and ™°* respectively. 


Then corresponds to the tensor product h = .M,,M@%*, and h? = 


M yp M PO MPM gign =(M yg M®)h = Ah. This formalism corresponds directly to 
the diagram for h? with ©) corresponding to the scalar A = “,,4. 

In this way our diagram algebra can be interpreted as the underlying structure 
for specific matrix representations of the multiplicative structure of the h,’s. See 
[K7] for a complete exposition of this. 

In this last comment we have informally presented two points of view about the 
extended braid-like multiplicative structures that appear so naturally from the 
braid-states. By restricting to internal multiplication (matching upper and lower 
strands) one obtains significant generalizations of the Artin braid group. Adding 
outer multiplication by closed forms (knots and links) creates close correspondences 
with representations and abstract tensor products. 


then 


V. Generalized Polynomials 


There are, at present, two 2-variable generalized polynomial invariants for knots 
and links, each a generalization of the Jones polynomial. These are the Homfly 
polynomial and the Kauffman polynomial. I will denote the Homfly polynomial by 
P,-(a, Z) and the Kauffman polynomial by F;(a, z). In this section we will touch 
on the formalisms of these polynomials. And I shall begin by recalling the Conway 
polynomial and telling a bit of the tale leading from Alexander to Conway to 
generalized polynomials. 

In the beginning [A] was Alexander and his invention /discovery of the Alexander 
polynomial A,(t). Alexander probably discovered this polynomial by thinking 
about covering spaces, but his paper was strictly combinatorial, using linear algebra, 
determinants, and the Reidemeister moves. He showed that if two oriented knots or 
links K, K’ are ambient isotopic, then A,(t) = A,.(t) where = means equal up 
to a multiple of +1” for some integer n. The polynomial was seen to be quite good 
at distinguishing knots and links, although it did not distinguish a knot or link from 
its mirror image. “ 

The Alexander polynomial has been an extraordinary and useful tool in knot 
theory since its inception. Attempts to model and reformulate it led to much new 
work and different points of view. One of the most notable of these approaches is 
R. H. Fox’s [CF], discovery of the free differential calculus, a technique for 
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extricating the Alexander polynomial from any presentation of the fundamental 
group of its complement. Then in 1970 John Horton Conway published a remar- 
kable paper [Con] in which he showed that the Alexander polynomial could be 
sharpened to an invariant with a simple recursive definition. The Conway 
polynomial, V;,(z), is determined by the conditions: 


1. VY -v >< = zV 


2.VC)=1 
3. Vx(Z) = Vx (z) whenever K and K’ are ambient isotopic. 


Conway explained that his polynomial was related to the Alexander polynomial by 
the formula A,(t) = Vx(Vt — 1/ Vt). 

Fight years later, Conway became enthusiastic once again about this polynomial 
and he lectured about it in a number of places. This time people heard him and their 
interest led to some papers about the polynomial (see e.g., [K1], [G], [Co]). The focus 
was primarily on how to use this recursive scheme, and on understanding the 
relation to the Alexander polynomial. Some use was made of the extra information 
in the Conway polynomial. (It can distinguish many links with even number of 
components from their mirror images.) This author wrote a monograph [K2] on 
combinatorial and diagrammatic work related to V,(z). In particular, I found a 
states model for V,(z) with a state-summation that is a bit more intricate than our 
model for the bracket. This model allowed a new proof of the theorems of Murasagi 
and Crowell ({M1], [Cr]) on the genus of alternating knots, and a generalization of 
these results to a category I called alternative knots. 

But curiously, no one tried to generalize Conway’s recursive scheme itself. No 
one asked what would happen if the first formula were modified to, say, 


Vos tv KX = 7? 
And then in 1984 Vaughan Jones lectured on his new invariant, derived from a 
representation of the Artin braid group into a von Neumann algebra [J2]. And 


Vaughan proved (among other things) that his (Laurent) polynomial satisfied an 
identity 


tv >S (t) - V >< (1) = [vi — 7 |¥ 0. 


With this formula standing in juxtaposition to the Conway formula, a number of 
people leapt at once to the generalization 
a” 'P x ~aP >= zP__., 

giving a two-variable polynomial Px(a, z) specializing both to the Conway (a = 1) 
and the Jones (a = t, z = vt — 1/ yt) polynomials. This is the Homfly polynomial 
[HOMFLY}]. 

Some time passed, and then Brandt, Lickorish, Millett, and (independently) Ho 
found ({[BLM], [H]) yet another new invariant polynomial Q,(z) satisfying 
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O> +2 ><_= 2(Q ——+@Q )() for unoriented links. This is a one-variable 
polynomial, distinct from the Homfly polynomial. It does not distinguish mirror 
images. 

I then had the good fortune to recognize how to put another variable into the 
context of the Q-polynomial ({K4], [K6}). The idea is to work in the regular isotopy 
category (as we explained for the bracket) and let a polynomial L be defined via: 


1l) LD OXK+lL os (LI 4+L)() 
LQ =al 

L~O = ao} 
3) LOC) = 1 


4) L, = Lz whenever K and K’ are regularly isotopic. 


Then L is normalized to form an invariant of ambient isotopy for oriented knots 
and links via the equation 


Fy = a "OL, 


where w(K ) is the twist number of the diagram K. The polynomial F, turns out to 
be quite good at distinguishing knots and links from their mirror images. It appears 
to be a proper companion to the Homfly polynomial P,(a, z). (The reader should 
note that our names of polynomials by letter and variable choice may differ from 
those given elsewhere in the literature. The translations are always straightforward.) 
Ocneanu and Jones discovered how to put a trace on the Hecke algebra generated 
by elements c, satisfying braiding relations and the Conway-type relation 


c,-c,'=2z 
to produce the Homfly polynomial in a fashion analogous to the representation for 
V(t). Hugh Morton worked extensively with this algebra, producing very good 
programs for computing the Homfly polynomial for braids. Morton continues doing 
deep theoretical work related to these polynomials. (See [M].) 

Birman and Wenzel [BW] have given a similar treatment for the Kauffman 
polynomial by using an algebra with relations c,+ c;! = z(1+ E,), where c, 
corresponds to a braid generator and E, shows the formal properties of our h,’s. 
More will come of this. David Yetter [Y] has given a good general context for 
diagram-related algebras. See also [K5], [K6], and [K7]. 

Both of the two-variable generalized polynomials have the Jones polynomial as a 
special case. For F, this is most easily seen via adding: 


(><) $ (><) = (44 ANVK) + 1) (—)) 
CO =a 
= — 43 
OS) <0 ( —) 


Thus (K) = L,(—A?, A +A7!) and fy = Fy(—A’, A +47). Since V(t) = 
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fie(t7 74) we conclude that V,(t) = F,(—t7 3/4, «71/4 + t'/*). This was observed 
by Lickorish [L] by a different route. 

The two 2-variable polynomials can be established via direct inductive definition. 
It is an open question whether there exist models for the workings of these 
polynomials that connect them directly with geometry beyond the geometry of 
diagrams. I believe that such connections will come about, and that they will be of 
great importance for topology as a whole. 


Remark. Thistlethwaite’s simple proof [T2] of the invariance of the writhe for 
reduced alternating diagrams uses the Kauffman polynomial: He observes that for 
Ly with K reduced alternating, the highest term in z has coefficient k(a + a7‘), 
k > 0 and power z"~' where n is the number of crossings in K. Thus L; = 
k(a + a~')z"~! + (other lower degrees in z) for K reduced alternating. Since 
F, =a “7, is an ambient isotopy invariant, it follows at once that w(K ) is also 
an ambient isotopy invariant. The proof of Morwen’s observation is a direct 
structural induction: 

(i) If >< is reduced alternating, then ——_ and ) ( are both alternating, and 

at least one is reduced. 

(ii) Use (i) and the recursion 


Lot hom= 2(L=Z+L) |). 
The separation of the a and z variables is the crucial ingredient in the proof. 


VI. Graphs and Statistical Physics 


Recall that in section III, I defined a generalized bracket polynomial for 
diagrams so that [K] € Z[A, B, d] and 


1. [><] = 4[] + BL) (1 
2. [0 K] = 41K} 
[0] = 


We then created an invariant of regular isotopy via B = A~!, d= —A* — A~? and 
(K) = d~‘[K] (so that (0) = 1). The square bracket can be specialized in other 
ways. In particular, it is (for the rmght choice of A and B) the dichromatic 
(Whitney-Tutte) polynomial for a planar graph. This in turn can be seen to be a way 
of expressing the partition function for the Potts model (a generalization of the Ising 
model) in statistical physics [B]. 

To understand this connection it is important to realize the following: 


THEOREM 6.1. Universes are in one-to-one correspondence with planar graphs. 


Proof. To each universe, shade it so that the unbounded region is unshaded (..¢., 
2-color the regions). Associate a graph to U, ['(U), so that the vertices of ['(U) 
correspond to the shaded regions of U and the edges correspond to crossings shared 
by shaded regions. Given a graph G, associate to it a universe V(G) by placing a 
crossing of the form ¢———_><__e on each edge of G and connecting these 
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crossings at each vertex as shown in FIGURE 20. It is easy to verify that [(V(G)) = G 
and V(I'(U )) = U. This completes the proof. 


The dichromatic polynomial Z,(q, v) © Z[q, v] 1s defined for graphs G by the 
recursive formulas: 


1) Z = Z—e e—++4+UZ 
2) Leg = 926 
Ze = 4 


The first formula asserts that the value of Z on a graph G 1s equal to the sum of the 
value of Z on a graph G’ obtained by deleting one edge from G plus v multiplied by 
the Z for G”, the graph obtained by collapsing this edge to a point. The second 
formula asserts that the addition of an extra vertex to a graph G multiplies the 
dichromatic polynomial by g, and that the value of Z for an isolated vertex is gq. 


Examples. 
Zeo—»e =Ze et+vZe=q’ + 0g 
ZV =Z.+vUZ.= 9g + 0g 
ZLZN =ZNHZO 
= Zo. tvZe__.t0(Ze_o + 0Z )) 
= q(q? + 0g) + 0(q? + 0g) + v(q? + 0g) + 0°(q + 09) 
= (q+ 2v)(q* + vg) + 0°(q + 09). 
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For v = —1, the dichromatic polynomial specializes to the chromatic polynomial. 
That is, Z.(q, —1) = K,(q) is the number of ways to vertex-color the graph G with 
qg colors so that no two adjacent vertices receive the same color. This is easily seen 
from the recursion formula since K -- « counts all possibilities for these two 
vertices, while K —— corresponds to those cases where the two vertices receive the 
same color. 

Now consider how the recursion formula (1) diagrams: 


Z =Zi.» «——_tUZ_ »,. 


Nw eta ODN (EEL rrr 
Z Se LES) Bee CL SSS 
_ _— _ eee” 
AT te a a [a 
a ae 


We see that deletion and contraction in the graphs become the two ways of splicing 
the crossing in the knot diagrams (universe). And the expansion for Z is formally a 
bracket expansion. 

Some further translation is then required to actually re-write Z, as a bracket. 
First let K(G) be the alternating link diagram associated with V(G) so that all 
shaded crossings are of type A. Then 


THEOREM 6.2. Z,(g, v) = g*/*[K(G)] where N is the number of vertices of G, 
and the bracket is expanded with A = q™+/*v, B = 1, d= q'” so that 


[<= I= 1+1) 0 
and [0] = q\”. 
Example. 
K(-—+) = CX 
(LCDI) = ag [OD ) + [OO)) 
= (qq? + (q'”7)’) 


=gu+q’ 
K(lox)] =Z 


——————_-~o---—-_—- 


Thus we see that the square bracket is fundamentally related both to knot theory 
and to graph theory. This connection raises many questions. We would like to know 
whether qualitative information can be transferred between these two subjects. The 
square bracket gives a picture of a parameter space A, B, d and subvarieties along 
which [K ] is topological or dichromatic. More work is needed here. 

Just to complete this picture I shall explain how the partition function for the 
Potts model in statistical physics is a dichromatic polynomial (see [B], [K7]). The 
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partition function has the form 
Lo = Ye Fle) 
0 


where o runs over all “states” of the lattice G (we will let G be a planar graph) and 
E(o) is the energy of the given state. In the Potts model the energy has the form 


1 
E(o) = a 2% (o 0;), 


where (i, 7) denotes an edge of G with vertices i, j and o, and o, are the state’s 
assignments to these vertices. We assume that each vertex can be freely assigned one 
of g values, and that a state o is such an assignment. In this formula 6 is the 
Kronecker delta 


_ fl ifa=b 
(a,b) = { j otherwise 


and T is the temperature of the system, while k is a constant (Boltzman’s constant). 
The partition function has many uses in this subject. For example, the proba- 
bility of being in a given energy state E is 


P(E) =e-"/Zg. 
PROPOSITION 6.3. For 


E(o) 


I 
M 
oO 
_~ 

a 

A 
— 


and q local states, let 
p=e C/K) — 1. 


Then the partition function is the dichromatic polynomial in q and v: 
ye = Z¢(q, v). 


Proof. 


Fen Hed = Lear taney) 


i, J) 
o 


0G 
= > al (e7 G/KT)8(o,, 9) ) 


ao «i, /) 
YeFO=)y YT] (1 + v8(o,, o,)). 
0 a <i, /) 


It is easy to see that the nght hand side of this equation satisfies the recursion 
relation for the dichromatic polynomial in g and v. 
(Z 
(Zeg = 9Zq) 


1988] NEW INVARIANTS IN THE THEORY OF KNOTS 927 


Fic. 21. Translating the square lattice. 


Combining 6.2 and 6.3 we see that the partition function is a bracket expansion. 
This gives a theoretical explanation for the appearance of the algebra of the h,’s (see 
section 4) in the structure of the Potts model for the square lattice. (See [B].) This 
lattice corresponds to the plat closure (see FIGURE 21) of a particular braid. Any 
bracket evaluation for a braid is expressed in terms of this operator algebra. It 
remains to be seen whether the bracket formulation for the Potts model will shed 
light on its physics. The relationship between the Potts model and the Jones 
polynomial via its operator algebra was first observed by Vaughan Jones. Our 
formulation shows the direct connection through translating graphs and link 
diagrams. 


VII. The Bracket and the Tutte Polynomial 


In the last section we showed how the bracket could be used to give the 
dichromatic polynomial for planar graphs. Here we shall reformulate the state 
expansion of the general bracket function, showing that it can be calculated solely 
from states with a single component. Our reformulation generalizes work of 
Morwen Thistlethwaite. He showed [T1] how to do this for the bracket and the 
Jones polynomial. The ideas go back to a generalization of the dichromatic 
polynomial known as the Tutte polynomial. (See [Tu]). The Tutte polynomial is 
defined recursively as follows. 

To each graph G is associated a polynomial T(x, y) © Z[x, y]. If G is compo- 
sed solely of isthmus and loops then T,, = x'y’ where i is the number of isthmuses 
and / is the number of loops. The polynomial satisfies the recursion T,, = T,, + Tov 
where G’ and G” are the graphs obtained by deleting and contracting (respectively) 
an edge that is neither a loop nor an isthmus. 


T e————e = x, TL =y 
TU. = xy 
TZN =T.NGTS 


-T.N+T..+Tg 
TLN =x? 4+x4+y 


Examples. 
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The dichromatic polynomial Z,(q,v) (see section 6) is related to the Tutte 
polynomial T(x, y) by the formula Z,(q, v) = qv’ 'T,(1 + qv-', 1 + v) where 
N is the number of vertices of G. For example, gu(1 + g/v) = qu + q* = Ze—e 


and ql t+vu)=q+q=Z LO show that the formula is correct for a single loop 
and isthmus. This formula shows that the dichromatic polynomial and the Tutte 
polynomial determine one another. Thus 


1 
T(x, y) = ——_—— Ze ((x — I)(y — 1), (9 - 1). 
(x - 1)(y- 1)" 
Tutte proved a remarkable theorem showing that his polynomial could be computed 
from weightings assigned to the maximal trees of the graph G. This weighting is 
dependent upon an ordering of the edges of G, but independent of the particular 
ordering. 


Definition 7.1. Let G be a graph whose edges have been labelled 1,2,3,..., n. 
Let H C G be a maximal tree in G. Let i € {1,2,..., 1} denote an edge of H. Let 
H, denote H — (the ith edge). Since H is a maximal tree, H,; has two components. 
One says that 7 is internally active if i < 7 for every edge 7 in G-H and endpoints in 
both components of H;. Let i © G-H be an external edge. One says that i is 
externally active if i < j for all edges 7 on the cycle in H extending from one end of 
i to the other. 


Example. 


H 
Here the edge labelled 1 is internally active. Edge number 2 is externally active. 


THEOREM (TUTTE). Let # denote the collection of maximal trees in a graph G. Let 
i(H) denote the number of internally active edges in G, and e(H) the number of 
externally active edges in G (with respect to a given tree H). Then the Tutte 
polynomial is given by the formula 


T(x, y) = Lp xy. 


HCo# 
Example. 
1 2 2 1 
1 2 
3 3 3 
G H, H, H, 


Tg =ytxtx’. 
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It may seem from the definitions of internal and external activity that they are 
somewhat different. Actually there is a symmetry of definition for planar graphs. It 
is through this symmetry that I like to see the relationship of the Tutte weightings 
with universes and with knot theory. To see this symmetry consider the universe 
V(G) (sometimes called the medial graph) associated with a planar graph G. Each 
maximal tree H C G determines a state S = S(H) of V(G) with one component. 


This state is obtained by splitting V(G) along the edges of H [ SS SS], 
and splitting all other crossings in the opposite fashion. 


Example. 


G H 
1 
2 S(H) 
V(G) 5 


The edge labelling of G becomes a vertex labelling of V(G). Call the vertices 
(crossings) of S() internal or external according as they are split with cusps 
pointing to the inside or to the outside of the Jordan trail S(H). Thus in the 
example above, 1 and 3 are internal and 2 is external. Given a Jordan trail S (a 
universe with |S| = 1) with vertices (sites x) labelled 1,2,..., n, call a site i active 
if i < 7 for all sites 7 with cusps in the two components resulting from splitting S at 


i. (Compare [K2].) 
1 1 2 
5 a 
-——» 
split at 
1 
3 

3 


Since 1 < 2, 1 is active. 


Example. 


A site on a trail is internally active if it is internal and active. A site is externally 
active if it is external and active. By replacing the trees in G by Jordan trails on 
V(G), we obtain a symmetrical definition of T(x, y). 

Incidentally, it is easy to see from this reformulation that T,(x, y) = Tey, x) 
where G is the planar dual graph to the planar graph G. Each Jordan trail gives a 
pair of maximal trees, one for G and one for G. 
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By now we are very close to the knot theory, and I can explain how to calculate 
the square bracket, | K | for link diagrams by using Tutte weightings. Recall that [K ] 
has variables A, B, d and that 

[><] = 4[==] + B[) (] 
[0K] =d[K] 
[Oo ]= (44+ BL —] 
[TO ] =(4 + Bal 
Let a = Ad + Banda=A4A + Bd. Now let K bea given diagram, and S bea state 
for K with one component (|S| = 1). Let denote the collection of all states § 
with one component. Let the crossings of K be labelled 1,2,..., n. Each crossing of 


K will determine a local contribution at the corresponding site of S. If the site is 
inactive we retain the usual bracket contribution: 


[><| XC] =4 


inactive site. 


[x<<|><]=B 

If the site is active then we take 
[><|x< ]=a , 
x<|S< _ active Site. 


Then [K|S] is the product of all of these local contributions, and we assert that [K ] 
is given by the formula [K] = L...[K|S]. 


SF CDQ OD) 


[L] = X,[L|S] = aB + aA (1 active in S,, 1 also active in S,). Doing this the long 


way we have, 
(O>1 =P) + >) 


= A(Ad+ B)+ B(A + Bd) 
= Aa+ Ba. 


1 l 
2 
2 
S: 
; 3 S> 3 3 


[K] = \[K|S] = a@BA + aA? + a@’B. 
S 


Example 1. 


Example 2. 
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Note that for (K), a= —A?, a= —A?,-B =A! so that 
(K) = -A3-A4+A77, 
our familiar value for the bracket of the trefoil. (See section 2.) 


Example 3. 
[SO] = [CO |CO) 
IO) =(O1O1= 


These correspond graphically to basic contributions for isthmus and loop. Note 
however, that if we switch the crossing then the contributions flip (since we are 
cataloging the type of curl in the knot diagram). 


l 
R RI 


Warning. It should be clear to both author and the reader by now that the square 
bracket function we are now using is normalized to 1 on the circle: [0] = 1. Let this 
cause no difficulty with regard to our earlier convention. 


Example 4. This is really a reformulation. Note that diagrams of knots and links 
are in 1-1 correspondence with signed planar graphs where the signs are placed on 
the edges of the graph so that + corresponds to an A-channel, — corresponds to a 
B-channel. 


Our Tutte-reformulation of the generalized bracket then gives a generalized Tutte 
polynomial for signed graphs satisfying 

1) If G has i, positive isthmuses, i_ negative isthmuses, /, positive loops, /_ 
negative loops, then 


T; — Xie +]_ yi- ty 


2) If the edge indicated below is not an isthmus or loop, then 

T * = BT _-. — +AT 

T = =AT_. ~—+BT 

Thus 7,(A, B, x, y) is a graph theoretic version of the square bracket. (x = a, 

y = @ recovers previous notation). It has a Tutte expansion in terms of spanning 
trees, and should be explored for its own sake. (N.B. Ax + B? = A? + By). 
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Example 5. If K is an alternating diagram then all crossings have the same 
internal type. Thus the contributions take the form 


ne 1 
[| X_1=4 
[>< |<] =B 
[<x =a 
| >< =0 


From this it is easy to see some specifics about the topological bracket where 
B=A™}, a= —A?, a= —A’. Note that a = —A*(A~'), @= —A~*(A). Thus 
for K alternating, we have 

(K) = APT GK) (-A~4, —A*), 


inactive site 


active site 


where T is the (standard) Tutte polynomial and J denotes the number of internal 
sites on a trail, E the number of external sites on a trail. We note J— E isa 
constant independent of the given choice of trail. 

Now recall that the ambient isotopy invariant f, is given by the formula 


fy = a "XK = (-A BY" R) 
and that the Jones polynomial V,(t) is given by the formula 


Vie(t) = f(t). 
Thus for K alternating we have: 


THEOREM (THISTLETHWAITE). Let K be an alternating projection, G(K) the 
corresponding planar graph. Then the Jones polynomial V,,(t) is equal to the Tutte 
polynomial Tg, x)(-1, —t~') up to a sign and factor a power of t. 

This is a remarkable observation from which it is now easy to deduce such facts 
as: the coefficients of the Jones polynomial of an alternating link alternate in sign 
(according to parity of degree). That is, the Jones polynomial of an alternating link is 
an alternating polynomial. The reader is referred to Thistlethwaite’s paper for more 
details ({T1)). 


VIII. From Graph Theory to Knot Theory 


It is interesting to speculate about alternate realities. How could the bracket 
polynomial, and hence the Jones polynomial have emerged from graph theory?! One 
possible reconstruction is to suppose that graph theory had had in its possession our 
generalized Tutte polynomial for signed graphs. Since knots and links can be 
encoded into signed graphs, it would then have been possible to look for a 
specialization of this Tutte polynomial that gives an invariant. 

To taste the flavor of this reconstruction we must first examine the graph 
theoretic versions of the Reidemeister moves. View FIGURE 22 for this. Because of 
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Reidemeister Move Graphical Move 


II 


Ill 


the translation to graphs via shaded regions, there are two versions of the type II 
move, and two versions of the type III move. Note that the two versions of the type 
II move are signed forms of deletion and contraction, while the type I move involves 
addition or removal of a branch or loop. These relationships could have been taken 
as a hint to try a Tutte polynomial for an invariant. 

In any case, let’s do this. We begin with the generalized Tutte polynomial for 
signed graphs, as explained in example 4 of the last section. This assigns a 
polynomial 7,(A, B, x, y) to any signed graph G. 

It is characterized by the rules: (x = Ad + B, y=A + Bad). 

1) If G has only isthmuses and loops, then T,, = x'+*!-y'-*'+ where i, is the 
number of positive isthmus, i_ is the number of negative isthmus, /, is the number 
of positive loops, /_ is the number of negative loops. 


2) T + = AT +BT —. . 


+AT 4. = e—__ 


(the + edge is not an isthmus.) 


Let’s investigate the behaviour of this polynomial under type II moves. 
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PROPOSITION 8.1. In order for T,(A, B, x, y) to be invariant under type II moves 
it is necessary and sufficient that 


B=A"™} 
x= -A? 
y= — A?, 
Proof. 
T = AT + BT ee oe 
—"y - 
= A( BT +AT_H, -—) + ByT_. ¥— 
= ABT +(A*+ By)T_. e—_— 


4 
T—<>— -AT_5— +8T__,— 
= AxT + B(BT 


+AT__. «—_) 


The rest of the proof follows from these identities. 

The rest of the story now proceeds just as for the bracket. With B = A7}, 
x = —A7~}, y = —A? the polynomial TJ, becomes an invariant of moves II and III 
for arbitrary graphs. 


Definition 8.2. For any connected graph G, let Q © Z[A, A~*] be the Laurent 
polynomial defined by 2, = T,(A, A~*, —A~?, —A?). By Proposition 8.1, Q, is an 
invariant of graphical move II. Type III invariance is free: 


PROPOSITION 8.2. 2, = 5, if G and G’ are related by a type III move. 


Proof. 
Q = AQ BQ 
\~ Nt y 
type II = AQ + BQ 
(type IT) VX yw 
(type II) = AQ ¥ + BQ ne 
+ ~_— 
i. 
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Remark. We want to be able to perform the type II move even when it 
disconnects the graph. Hence we need a formula for 75,,,- where LU denotes 


disjoint union. Note that T= Ax + By. Define Tg, G: = (Ax + By)T,Tg. It is 


then easy to see that {2 has the right invariance properties for this move. Note that 
for B=A7!, x = -—A73, y= —A? we have Ax + By = —A~*— A* =d, the 
corresponding bracket factor. 


This section is just intended as a sketch of the graph-theoretical formulation. Note 
that 2, is an invariant of type II and type III moves for arbitrary (not necessarily 
planar) signed graphs. This is likely to be a useful extension of knot theory to 
arbitrary networks involving those transformations. 

What about the type I move? 2G multiplies by x or y under a type I move. In 
the abstract graph theoretic setting we do not have a direct analog of the twist 
number. Thus it remains to be seen whether 2G can be normalized to form an 
invariant of all three move types. 

To finish the translation we state the now obvious: 


THEOREM 8.3. Let G be a planar signed graph. Let K(G) be the knot /link diagram 
corresponding to G. Then ( K(G)) = Q ,. The bracket polynomial for knots and links 
is a specialization of the generalized Tutte polynomial for signed graphs. 

___ ig 

Finally, we return to the generalized Tutte polynomial 7,(A, B, x, y) for signed 
graphs G, and note that it has a spanning tree expansion. Given an ordering of the 
edges of G and a maximal tree H C G, define contributions from the edges of G as 
follows: 


+ 


internally active, 
externally active, 
internally active, 
externally active, 


internally inactive, 
externally inactive, 
internally inactive, 
externally inactive, + : 


| +/+ 1 | 


x 
x 
y 
y 

7A 
A 
B 
B 


Let G(#) denote the product of the contributions of the edges of G relative to 
activities for H. Then 


T= LG(A), 


where this summation extends over all maximal trees in G. 


Technical Caveat. It is necessary and sufficient for 7, to be well-defined that 
Ax + B* = A* + By. (Note that we are in the category of signed graphs.) This 
certainly holds for the topological case. In the general case we can rephrase this 
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condition, as we did for the square bracket, by introducing a variable d and writing 
x =A+ Bd, y =Ad+ B. This shows that the square bracket is directly gener- 
alized to arbitrary signed graphs by 7,(A, B, x, y) = 7,(A, B, d). 


IX. The Knot Theory of Imbedded Graphs 


It will not do to mention graphs without pointing out the now active extension of 
knot theory considering embeddings of arbitrary graphs in Euclidean three space. 
Here one would like to answer the usual ambient isotopy questions of knot theory in 
this larger context. In particular, we want polynomial (or other simple) invariants of 
graphs in space. The most general notion of ambient isotopy for graphs in space 
allows topological vertices. Strands coming into a topological vertex behave inde- 
pendently. Diagrammatically this means that moves are allowed (see FIGURE 23) 
that create arbitrary braiding at a vertex. At the vertex, any two adjacent strands 
can be given a twist. In general this notion of ambient isotopy is both fundamental 
and difficult. Nevertheless, progress is being made (see [S] and [W]) on the general 
classification. 


Fic. 23. A topological vertex move. 


One may also consider a rigid vertex. Here the vertex is thought of as a rigid 
object with topological strands attached at specific sites. Then I note [K8] that it is 
possible to define some useful invariants in this case. A similar approach was seen 
independently by Ken Millett. Here is a sketch of my viewpoint about the rigid 
vertices. We shall restrict ourselves to 4-valent vertices as shown in FIGURE 24. 
FIGURE 24 indicates the extra move-types that must be added to the list of 
Reidemeister moves in order to have a theory of rigid vertex equivalence. Note that 
the rigidity of the vertex forces double braiding when it is turned by 180°. I have 
denoted by III’ the analog of the type III move. The second move under IV (with a 
three strand twist) can be accomplished up to ambient isotopy by the first type IV. 
(We will not consider regular isotopy of graphs here.) My method for obtaining 
invariants of RV4-graphs (4-valent mgid vertex graphs) is to associate to such a 
graph G a collection of knots and links #(G) obtained as described below. This can 
be done for both oriented and nonoriented graphs. Here we consider only nonori- 
ented graphs. 
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- fh 


Fic. 24. Rigid vertex moves. 


Definition 9.1. Let G be an RV4-graph. Let “(G) be the collection of knots and 
links obtained from G by choosing one replacement of each of the following types 
at each vertex of G: 


mS ROMS 
A VICSICN Ie’ 


O) 


In general, if G has n rigid vertices, then “(G) will contain 4” diagrams, some 
trivial, some ambient isotopic. 


L( 
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Definition 9.2. Let X be a collection of knots and links. Two such collections will 
be said to be ambient isotopic (X ~ X’) if every member of the first collection is 
ambient isotopic to some member of the second collection and vice versa. 

The notation ~ will be used both for RV4-equivalence of graphs and for 
ambient isotopy of knots and links. 


THEOREM 9.3. Let G and G’ be equivalent RV4-graphs in three-dimensional space. 
Then their associated link collections are ambient isotopic—L(G) ~ £(G’). 


Proof. Observe that the extra moves III’ and IV (FiGuRE 24) preserve the 
elements of “(G) up to ambient isotopy. For example: 


OE IOKY 
Can JY 
? g Bo 8 y 


This completes the proof. 


This is a very useful theorem for studying RV4-graphs. For example, we can 
immediately conclude that the graph G in the example above is not equivalent to its 
mirror image. For if this were so then the individual knots and links in #(G) (being 
distinct) would have to each be achiral. We can then check that this is not the case 
by using our results about the Jones polynomial for alternating knots. Recall that 
we have shown that if K is achiral then 3w(K ) = W — B (after Theorem 3.1) where 
w(K ) is the twist number of K and W and B are the numbers of white and black 
regions in a shading where the A-channels are shaded. Choosing K, € #(G) we 
find 


W=3 
B=5 
W-B=-2 
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Since —6 # —2 we conclude that Simon’s graph G is not equivalent (RV4) to its 
mirror image. (Wylbur Whitten has verified by other techniques that this graph is 
not topologically equivalent to its mirror image.) 


Example. 


G’ 


Examination will reveal that this graph is obtained from the graph G of the 
previous example by reversing two crossings. 


{QO QD Ve e 


L(G’) 


We see that to prove G is not equivalent to G’ it suffices to show that Kj is not 
ambient isotopic to K,. But we also know from section 3 that span( Kj) < 4V where 
V is the number of crossings in this diagram, since K; is a non-alternating diagram. 
On the other hand, span(K,) = 4V (V = 7) since K, is an alternating diagram. 
Therefore, without further calculation we know that K, and K» are different and 
hence that G and G’ are not RV4-equivalent graphs. 

To further verify that G’ is chiral requires a calculation of (K3). We omit this 
and assert that the calculation indeed shows that KJ is chiral, and hence that G’ is 
RV4-chiral. 

These examples and our theorem relating equivalences of graphs with ambient 
isotopy for collections of knots and links show how there can be a good collabora- 
tion between problems of graph-imbedding and new invariants of knots and 
links such as the Jones polynomial. Since graphs can be used to model the 
comigurations of molecules and other .naturally-occuring networks, it is to be 
expected that there will be many fruitful applications of these ideas. 

It is interesting to note that in the case of RV4-graphs there is a mixed-mechani- 
cal /topological model that is nevertheless susceptible to a topological analysis. This, 
in itself, is a good sign for applications where there will always be a mixture of 
topology and other structures. 
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X. Patterns and Speculations 
I always thought that the Conway identity 


oe ~V\hu =7\L. 


bore a striking resemblance to the exchange identities of quantum physics such as 
the Heisenberg form of the uncertainty principle PO — QP = hi. And that the 
crossing —><_" and its reverse << were something like a complex number and its 
conjugate. 

From the present vantage these speculations are not nonsense at all. They appear 
like hints about the remarkable connections that have subsequently appeared. The 
operator algebras that produced the Jones polynomial and generalizations were 
traditionally studied with quantum physics as the technical and inspirational source. 
The writing of the Conway identity into a Hecke algebra related to braids via 
o,— 6, =z is a direct algebraic version of this sort of identity. 

Another view of complex numbers lets us think of [A, B] and [B, A] as 
conjugates. If W=a-+ ib let A = (1/2)(a + b), B = (1/2)(a — b).) Then [A, B] 
— [B, A] =(A — BY, —1] = (A — B)V~-1 has a formal resemblance to both the 
Conway identity and the Heisenberg formula. And in the bracket expansion 


[=<] = A4[—=] + BLD 
[><] = B[ ==] + 4[) (J 


the reversal of a crossing appears as this form of conjugation. 
Simple aspects of formalism lie at the root of these similarities. Beyond all the 
conceptual apparatus, the fundamental point is that an unoriented crossing 


4 


discriminates two regions out of four. All the rest, whether in braids or in diagram 
form builds on this distinction. Formulas like PQ — QP = hi rely on the left-right 
distinction along a line. And a formula such as [><] = A[——] + Bl) ( ] relies on 
the discrimination of characters along a line that are identical after a rotation. 

Thus we are initially mapping one simple order into another. These are designer’s 
comments: for the possibility of distinguishing handedness of topological objects in 
three-dimensional space goes back through diagrams, symbols and distinctions to 
the possibility of finding handedness in the plane. 

And the simplest forms of left and right in the plane are Cand©  . Thus 


the detection and discrimination of OO and © (or © and @® )ina 
formalism may be fundamental to its sensitivity to handedness. This is my personal 
explanation for favoring regular isotopy and detecting the curls in the diagrams. 
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There are more questions than you can shake a stick at. What deeper insight will 
unlock the really hidden secrets of these diagrams? Does the Jones polynomial, or 
its generalizations detect knottedness? Is there a further relationship to physics 
based on these diagrams (a la Feynman diagrams or Penrose spin nets)? How can 
one understand the mirror image problem for knots and links completely? What is 
the relationship between these new techniques and the classical methods using 
homotopy theory, covering spaces and algebraic topology? What information about 
slice knots and knot concordance is in the new invariants? What is the next simple 
idea that will turn the subject upside down? How do you follow the hints? 
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If f(x) =a. t+ a,x + -:- +a,x" is a polynomial of degree n over the finite 
field F, of order g = p* where p is a prime and 5 >1, does the associated 
polynomial function f: c > f(c) from F, into F, permute the elements of F,, 1.€., is 
f a 1-1 map of F, onto itself? If f is a permutation of F, then the polynomial f(x) 
is called a permutation polynomial (PP) of F,. Very little is known concerning which 
polynomials are PPs, despite the attention of numerous authors (see the notes to 
Chapter 7 of [13]). Very few good algorithms exist to test whether a given 
polynomial is a PP. We describe several applications of permutations which indicate 
why a study of permutations is of interest, and we list, without proof, the known 
classes of PPs and the known criteria. We indicate a number of open problems and 
make some conjectures. 

Recently, permutations of finite fields have become of considerable interest in 
the construction of cryptographic systems for the secure transmission of data. See, 
for example [12, Chapter 9] and [10]~[11]. Let M be a message (an element of F;,) 
which is to be sent securely from A to B. If P(x) is a permutation of F,, then A 
sends to B the field element N = P(M). Since P(x) is a bijection, B can obtain the 
original message M by calculating P~'(N) = P~'(P(M)) = M. In order to be 
useful in a cryptographic system, P(x) must have several additional properties; see 
problem P3. Cryptographic systems of a different type, based on permutations of 
finite fields, were considered by Levine and Brawley in [10]. 


*Research was supported by the Australian Research Grants Committee, grant F8415183. 


**This author would like to thank the mathematics department at the University of Tasmania for its 
hospitality during a recent visit when discussions which led to this paper took place. 
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Permutations are also useful in several combinatorial applications. The reader is 
referred to Theorems 9.67 and 9.83 of [13] as well as [4] and [19]-[20] for 
illustrations thereof. 

We now list the major known classes of PPs. If f(x) is a PP of F,, then so is 
f(x) = af(x + b) + ¢ for all a # 0, b,c € F, and we say that f,(x) is in normal- 
ized form if a, b, c are chosen so that f,(x) is monic, f,(0) = 0, and (provided the 
characteristic p does not divide the degree n) the coefficient of x”~' is 0. 

(1) In [5S, p. 63] Dickson lists all normalized permutations of degree at most 5. 

(2) x* permutes F, if and only if (k, g — 1) =1. 

(3) If a € F,, then the Dickson polynomial 


[k/2] k k ~j 
a(x, a) = | 


Naya 


j=0 k-j J 


permutes F, if and only if (k, q* — 1) = 1 [13, Theorem 7.16]. 

(4) If F,, is an extension of F, of degree r, then L(x) = L'a ta,x? witha, € F, 
is a linear operator on F,, over F, and it permutes F,, if and only if det(a%” pro 
(i, 7 = 0,1,...,7—1). If each a, € F, then L(x) permutes F, if and only if 
(x’rha,x', x” — 1) = 1 [13, pp. 362, 390}. 

(5) If r > 1 is prime to g — 1, s divides g — 1, and g(x*) has no nonzero root in 
F,, where g(x) © F_[x], then x’(g(x°))9~?” permutes F, [13, Theorem 7.10]. 

(6) Several other specific classes of PPs have been characterized [13, Chapter 7], 
eg, x(atm-D/m™ +4 ay where m divides g—1 and x'(x4—a)?'~)/4, where 
d\( p" — 1). 

This list is not exhaustive, but it does contain all known major classes: so we 
don’t have very many types of PPs. 

Suppose we wish to determine whether f(x) = X7_,a,x' permutes F,. We may 
assume that n < g — 1 since every function from a finite field F, to itself can be 
represented by a polynomial of degree < gq [13, p. 369]. If g is small, one could 
proceed by calculating f(a) for all a © F, and checking whether the g values f(a) 
are indeed distinct. However since this involves O(qn) F, operations for a poly- 
nomial f of degree n, this becomes prohibitive if g is large. 

The first, and in some ways most useful, criterion was proved by Hermite [8] for 
g prime, and by Dickson [6] for general g. 


THEOREM. f(x) permutes F, if and only if 
(1) f has exactly one root in F, and 
(2) For each integer t with1 <t<q-—2 with t#0 (mod p), the reduction of 


[ f(x)]'mod(x? — x) has degree <q — 2. 
COROLLARY. If f(x) is a PP of F, of degree n > 1, thenn t (q — 1). 


London and Ziegler [14], Raussnitz [21], and Mollin and Small [16] investigated 
criteria for f(x) to be a permutation in terms of the coefficients of f(x). Two 
sufficient conditions can be found in Carlitz and Lutz [2]. 

We now list a number of open problems and conjectures involving PPs of finite 
fields. 
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Pl. Find an algorithm of lower complexity than O( gn) to test whether a given 
polynomial is a PP of F,. 

P2. Find new classes of PPs of F,. 

P3. Find new classes of permutations P(x) that are useful cryptographically. 
Hence P(x) should have a simple form so that if M is a message, then N = P(M) 
which is sent from A to B can be easily computed. Also P(x) must have the 
property that without some secret information (the key) that only A and B know, 
P~*(x) will be hard or impossible to get, so that an unauthorized receiver cannot 
calculate P~'(N). At the same time with knowledge of the key, P~'(x) is easily 
obtained by B so that P~'(N) = M can be recovered by B. 

P4. The Chebyshev polynomial of the second kind of degree k is defined by 


fx) = (F>A\(-ayxe 


i=0 


It can be shown that f,(x) = xf,_1(x) — f,_2(x) for k > 2. For odd q determine 
necessary and sufficient conditions on k and q so that f,(x) permutes F,. In [15] 
Matthews showed that for odd gq, if k + 1 = +2(mod m) for m = p, (q — 1)/2, 
and (q + 1)/2, then f,(x) permutes F,, and in fact then, f/,(—a) = —f,(a) and 
f,(a) = +a for all a € F,. Based upon computer evidence, we conjecture that if ¢ 
is prime, then the above conditions are also necessary. Moreover if the characteristic 
p > 5 it appears that the conditions are both necessary and sufficient for f,(x) to 
permute F. For p = 3 and 5, there are examples of q > p and of k such that /,(x) 
permutes F’, but k does not satisfy the above set of congruences. 

P5. Extend Dickson’s list [5, Table 7.1] of normalized PPs to higher degrees. 

P6. Let N, = N,(q) denote the number of PPs of degree d over F,. We have the 
trivial boundary conditions N, = g(¢g — 1), N,=0 if d|(q— 1), and XN, = gq! 
where the sum is over all 1 < d < g— 1 with dt (q — 1). Find N,,. Some partial 
results in this direction were given in Wells [24]. 

P7. Suppose f(x) is a PP of degree p over F,. Dickson [5] conjectured that f(x) 
is reducible to the normalized PP x(x¢ — a)(?~)/4 where d|(p — 1) and a # B4 
for any B & F,, having earlier proved this for p = 3, 5, and 7 in his thesis [6]. Settle 
this conjecture for all odd primes p. 

P8. Settle the conjecture of Chowla and Zassenhaus [3] that if p is a sufficiently 
large prime and f(x) of degree > 2 permutes F,, then f(x) + ax withO <a<p 
is not a PP of F,. For several partial results, see [20] and [17]. 

P9. In an invited address before the MAA in 1966, Carlitz conjectured that for 
each’even positive integer k, there is a coristant C, such that for each finite field of 
odd order q > C,, there does not exist a PP of degree k over F,. Lausch and 
N6Obauer [9, p. 202] proved the conjecture for k = 2” and for k = 6 and 10 the 
conjecture has been proven by Dickson [6] and Hayes [7], respectively. Recently 
Wan Daging [22] settled the cases k = 12 and 14. A major result would be a proof 
of the Carlitz conjecture for arbitrary even k. 

Many similar questions can be considered for polynomials in several variables 
over F.,, see e.g. [13, Sect. 7.5] and for residue class rings of integers, see for example 
[9] and [18], where there are numerous other references. 
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Homomorphisms from Z,,,[i] into Z, [i] and Z,,[p] into Z,[o], 
Where i? + 1 = Oand p? +p +1=0 


JOSEPH A. GALLIAN 
Department of Mathematics and Statistics, University of Minnesota, Duluth 55812 


DOUGLAS S. JUNGREIS 
Department of Mathematics, Harvard University, Cambridge, MA 02138 


In a recent note [1] it was shown that the number of group homomorphisms from 
Z into Z,, is gcd(m, n) and the number of ring homomorphisms from Z,, into Z,, 
is 2°(") — @(" /Ecd(m,")) Where w(u) denotes the number of distinct prime factors of the 
integer u. In this note we solve the analogous problems for Z[i] and Z[p], where 

*+1=Oand p?+pt+1=0. 

We begin with the groups Z[i] and Z[p]. The number of group homomorphisms 
from Z,,[i] into Z,[i] and from Z,,[p] into Z,[o] is (gcd(m, n))*. This follows 
directly from the solution in Z, since the groups Z,[i] and Z,[o] are each isomor- 
phic to Z, ® Z,. In the ring case, the solutions and their derivations are more 
complicated. 

For convenience, let G = Z[i] denote the ring of Gaussian integers and for any 
Gaussian integer a let G, denote the factor ring G/(a). We seek explicit formulae 
for the number of homomorphisms from G into G,, and from G,, into G,,. 


THEOREM 1. The number of ring homomorphisms from G into G,, is c,,- 3°”, 
where w(n) is the number of distinct prime factors of n in G and where c, = 1 if 
4tn,c, = 5/3 if 4|n but 8 tn, c, = 3 if 8In. 


Proof. First observe that a homomorphism from G into G, is completely de- 
termined by its action on 1 and i. Since 17 = 1,1-i =i, and i? = —1 in G, the 
images of 1 and i, call them a and b respectively, satisfy a* = a, a: b = b, and 
b? = —a inG,. Let n = p® --- p* be the prime-power decomposition of n in G. 
Since, by the Chinese Remainder Theorem, G,, is naturally isomorphic to G,, ® 
Gyr. ® +--+ ®G,«,, a homomorphism from G into G,, induces a homomorphism ‘from 
G into G, ky for each j. In the direct sum, let a correspond to (a;, d5,..., a,) and b 
correspond to (b,, b5,..., 5,). 

Then a; = = a,;mod ph , or equivalently p; ‘la, (a; — 1). Since G is a unique 
factorization domain and a, and a,;—1 are relatively prime either pila; or 
pi|(a; — 1). Thus either a; = = 0 or a,= 1. If a; = 0, then b; = a,b, = 0. If a; = 1, 
then b? = —a,= —1, so (b, + i)(b — i) = 0, and pii\(b, + i)(b, — i). Suppose 
p; #1 +i. Then, since the only prime divisor of 27 is 1 + i (up to associates), we 
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and be ¢ P; cannot divide both factors b, + i and 5, — i. Thus pj|b, + i or pj |b, —i 
an 

When >, = 1 + i, complications arise because (1 + 7)? = 2i, so that 1 + i can 
divide both factors. Since n is an integer and |1 + i] = V2, we see that k, j> the 
exponent of p, = 1 + i, must be even, say k, = 2t. We consider three cases. Since 
Gok, is isomorphic to G, or G, respectively ‘when k, = 2 or 4, direct calculations 
show that there are respectively 2 or 4 possibilities for b;. In the case k; > 6, we 
must observe that (1 + i)? divides one factor if and only if it divides the other, but 
(1 + i)? cannot divide both factors. Thus (1 + i)*~? must divide one of the factors. 
Because (1 + i)©~* = (1 + i)*)‘~! = 2°* (up to associates) and {2'~'(r + si)} 
has 4 elements in Gy ~ Gs, we see that (1 + i)~* divides 4 elements in G,«,. This 
gives 8 possible values for by 

Thus far we have determined necessary conditions for the existence of a homo- 
morphism from G into G,. Conversely, if (a), a5,..., a,) and (by, by,..., b,) are 
any elements of the direct sum which satisfy the above conditions, then there is a 
homomorphism from G into G,,. So, the number of homomorphisms is as claimed. @ 


To determine the number of homomorphisms from G,,, into G,, we simply observe 
that the same conditions as above are necessary, and, furthermore, when a; = 1 in 
Gos we must have ma, = m = 0 in Gx ky (since m- 1 = 0 in G,,). So, whenever P; k; 
does not divide m only the trivial homomorphism to that component of the direct 
product is possible. Thus, there is one homomorphism instead of 3, 5 or 9 as before 


and we must reduce the exponent of 3 or 5 accordingly. This proves the following. 


THEOREM 2. The number of ring homomorphisms from G,, into G, is C, 
Zo(m)— o(n/gcd(m,”)) where w(k) is the number of distinct prime factors of kin G and 
c, = 1 if either 4+ n or 2|(n/gcd(m, n)), c, = 5/3 if 2 + (n/gcd(m, n)) and 4|n 
but 8 tn and c,, = 3 if 8|n and 2+ (n/gcd(m, n)). 


We remark that for any integer k it is easy to determine w(k) in G from the 
integer prime-power decomposition of k. Indeed, w(k) in G is the number of 
distinct integer prime divisors of k plus the number of integer prime divisors of k 
that are congruent to 1 modulo 4. This follows from the fact that 2 and prime 
integers that are congruent to 3 modulo 4 have only one Gaussian prime divisor (up 
to associates) while prime integers that are congruent to 1 modulo 4 have two. 

Another familiar ring of algebraic integers is Z[p] where p* + p + 1 = 0 (see 
Chapter 12 of [2] for a thorough treatment of Z[p]). This ring was introduced by 
Eisenstein and Jacobi in their work on cubic reciprocity. The arguments used in 
Theorems 1 and 2 can be adapted to yield the following two results. 


THEOREM 3. The number of ring homomorphisms from Z{p] into Z,[p] is c,, - 3°™, 
where w(n) is the number of distinct prime factors of n in Z[p] and where c,, = 1 if 
3tn,c, = 4/3 if 3|n but 9+ n, andc, = 7/3 if 9|n. 


Proof. Let the images of 1 and p be a and b, G = Z[p] and G, = G/a). Then 
a*=a,a-b=b and b>+b+a=0inG.. Let a; and b, be as in the proof of 
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Theorem 1. Then, as before, a, = 0 or 1 and the former implies 5 = 0. If a, = 1, 
then 5; + b+ 1=0 so (b,- p)(b,- 9p?) = 0 and pi|(b; — p) 
(b;— p 2y, Suppose p; # 1 — p. Then since the only prime divisor of the difference 
of these factors 1 Is, 1 — p, (up to associates), we know p, cannot divide both factors. 
Thus, 5; = p or 0. 

When Pp; = 1 — p, complications arise because 1 — p can divide both oe 
Since n is an integer and |1 — p| = V3, we see that k ,, the exponent of p; = 1 — 
must be even. We consider two cases. Since Gp, 1S isomorphic to G, when k,= 9, 
direct calculations show that there are 3 possibilities for 5. In the case k, > 4, 
(1 — p)*~! divides one factor. Because Z[p]/{1 — p) has 3 elements, (1 — py “1 
divides 3 elements in GyX,. This gives 7 possible values for b,. The results now 
follows as in Theorem 1. 


THEOREM 4. The number of ring homomorphisms from Z,,|p| into Z,[p] is 
c,- 3e(m~ e(n/ecd(m, ™)) where w(k) is the number of distinct prime factors of k in Z[p| 
and c, = 1 if either 3 +n or 3|(n/gced(m, n)), c, = 4/3 if 3 t (n/gced(m, n)) and 
3|n but 9 +n, andc, = 1/3 if 3 + (n/gcd(m, n)) and 9{n. 


Proof. The analysis is identical to that given in the proof of Theorem 2. & 


The authors are indebted to I. H. Rose and David Witte for detecting errors in our original versions 
of Theorems 3 and 4. 


This work was done while the second author was visiting the University of Minnesota, Duluth. Both 
authors were supported by the NSF (Grant number DMS 8407498). 
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An Elementary Proof That a Bounded a.e. Continuous Function 
is Riemann Integrable 


MICHAEL W. BOTSKO 
Department of Mathematics, Saint Vincent College, Latrobe, PA 15650 


In [1, p. 142], the following Theorem is given. 
THEOREM 1. I 'f f is continuous on [a, bj, then it is Riemann integrable there. 


For the proof the function F: [a, b] — R is defined such that 
F(x) = fr) dt — fF) dt fora<x<b and F(a) =0. 
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It is then noted that F'’(x) = 0 for all x in [a, b], which implies that F is constant 
on [a, b]. From this fact it is easy to see that the upper and lower integrals are 
equal. 

It seems to be desirable to have a proof of this type, on the advanced calculus 
level, for the more general situation where the function f is bounded and continu- 
ous a.e. on [a, b]. It is the purpose of this paper to present such a proof. We first 
recall the definition of a set of measure zero. 


DEFINITION. A set £ of real numbers has measure zero if for each « > 0 there is 
a finite or infinite sequence {J,} of open intervals covering EF and satisfying 
~,l2,| < € where |J,| is the length of /,. 


We begin by giving an advanced calculus proof of the following well-known 
result. 


LEMMA 1. If f: [a, b] > R satisfies a Lipschitz condition and f'(x) = 0 except on 
a set of measure zero, then f is a constant function. 


Proof. Let « and 6 be arbitrary positive numbers. Since f satisfies a Lipschitz 
condition there exists M > 0 such that |f(x) — f(y)| < M|x — y| for all x, y in 
[a, b]. Let D = {x: x € [a, b] and f’(x) = 0} so that [a, b] \ D has measure zero. 
For 5/M > 0 there is a sequence { J,,} of open intervals such that [a, b]\\D ¢ UI, 
and »,,|I,,| < 6/M. Now for IJ = [c, d] any closed subinterval of [a, b], we will say 
that J is a type 1 interval if |f(d) — f(c)| < e(d — c) and a type 2 interval if there 
exists J © {J,,} such that J c J. We now define the set 


S = {x:x € (a, b] and there exists P, = {xo9, x1, X2,...,X,} a partition of 
[a, x] such that for each k, [x,_,, x,] is either a type 1 or type 2 interval}. 


We first note that S # ©. If a € D, then f’(a) = 0 and there exists x > a such 

that |f(x) — f(a)| < e(x — a), which says that x € S. (Note that a partition may 

contain only two points.) On the other hand, if a € D then a € J an interval in 

{I,,} and, since J is open, there exists x > a such that [a, x] C J and again x € S. 
Next we show that if x € S, then 


f(x) — f(a) < 8 + e(b - a). 
To see this take a partition P, = {xo, x1, Xz,...,xX,} of [a,x] such that each 
[x,—1,X,] is either a type 1 or a type 2 interval. Clearly 


f(x) — f(a) = Lf lr) — f(xx-1) 
so that 


f(x) — f(a) < Lif(%) — f(%-I 
= Life) — flav + LP Ca) — fn. 


(1) (2) 
(The first summation is taken over type-1 intervals of P,. and the second over type-2 
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intervals of P,.) 


< Ve(x, — X41) + M(x, — X4-1) 
(1) (2) 


< e(b-—a) + M) (x, — X,_4) 
(2) 


<e(b-—a)+6. 


The last inequality holds since &,, |J,,| < 6/M and the subintervals involved are 
nonoverlapping type-2 intervals. Thus x € S implies that 


f(x) — f(a)| <8 + e(b — a). 


Now let c = the least upper bound of S. It follows at once that a < c < b and 
since f is continuous, |f(c) — f(a)| < 6 + e(b — a). We need to show that c = b. 
Suppose c < b. If c © D then f’(c) = 0. Thus there exists r > 0 such that [c — r, 
c +r) C (a,b) and 


f(x) —f(c)| <elx —c| ifxele—rjetr]. 


Since c — r 1s not an upper bound of S, there exists x € S such that c —r<x<c. 
Thus there is P,, = {Xo, X1, X2,...,X,}, a partition of [a, x] such that each subin- 
terval is either type 1 or type 2. Clearly P = {xo, x1, X2,...,X,,c +r} Is a 
partition of [a, c + r] where each subinterval is of type 1 or type 2. Thusc +re S$ 
which is a contradiction. If c € D then c € J for some J in {J,,}, and there exists 
r > Osuch that [c — r,c + r] C J. Again there is an x € S suchthatc —~r<x<ce. 
By an argument similar to the previous case c + r € S, again a contradiction. Thus 
c = b and 


f(b) — f(a] < 8 + e(b — a). 


Since 6 and « are arbitrary positive numbers, f(b) = f(a) and it clearly follows that 
f(x) = f(a) for all x in [a, b], which completes the proof. 


We can now easily accomplish the main task of this paper. 


THEOREM 2. If f:[a,b] > R is bounded, and continuous except on a set of 
measure zero, then f is Riemann integrable on [a, b]. 


Proof. Let 
F(x) = f f(t) at - fr) dt ifa<x<b and F(a) =0. 


Since f is bounded, F clearly satisfies a Lipschitz condition on [a, b]. (To see this 
use Lemmas 5.2e and 5.2g in [1, pp. 137—138].) Also by Lemma 5.2h in [1, p. 139], 
F’(x) = f(x) — f(x) = 0 for each x wheré f is continuous. Thus F'’(x) = 0 except 
on a set of measure zero and by Lemma 1, F is a constant function. Thus 


0 = F(a) = F(b) = ['4() ae [7 dt. 
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Thus f is Riemann integrable on [a,b] since the upper and lower integrals are 
equal. 
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L’H6pital’s Rule in a Poisson Derivation 


WILLIAM P. COOKE 
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Elementary derivations of the Poisson distribution have two basic forms, which 
we might call direct and indirect. Given assumptions of independence and negligi- 
bility of probabilities for more than one arrival in a small time interval, the 
assumption that the probability of an arrival in a small time interval is proportional 
to the length of the interval implies a direct derivation (cf. [1]). When we just assume 
that the arrival probability depends on the length of the interval, then we must first 
develop the exponential density of interarrival times—the indirect approach (cf. [4]). 
Here L’H6pital’s rule (see [3, p. 121]) will be used in the indirect approach to yield a 
simple alternative to the usual derivation as shown in Wagner [4]. 

The direct derivation is more suitable for a first course in probability, where 
discrete distributions are usually discussed early. In a course on queuing theory or 
operations research, however, this alternate indirect derivation could simplify the 
teaching of the topic. 

From the assumptions in Wagner [4] the exponential density of interarrival times 
follows: 


r —At), A>O,t>O0 
f(t) = | exp(—A2) (1) 
, elsewhere. 


Define P,(t) as 
P(t) = Prob[n arrivals in the interval (0, t)]. 
Our independence and negligibility assumptions give 
P(tth) =P,(t)Po(h) + P,_()P, (A), n=1,2,..., (2) 


if h > 0 is regarded as being very small. Subtracting P(t) from both sides of (2), 
dividing by h, and taking the limit gives the derivative 


Pi (t) = lim [{ P,(#)[ PoC) —1 +P) P(A)}/A], 2 =1,2,.... (3) 
From (1) we obtain 
P,(h) = [ rexp(-a.) dt = exp(—Ah), (4) 
and our negligibility assumption gives 
P,(h) = 1— P)(h) = 1 — exp(—AA). 
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Then (3) becomes 


Py(1) = P,(t) lim {lexp(—Ah) — 1] /h} 


(5) 
+ P,-1(t) lim {[1 —exp(—Ah)|/h}, = n=1,2,.... 


Each of the two limits in (5) has the indeterminate form 0/0, but one application 
of L’H6pital’s rule to each limit produces 


P’(t) = -AP,(t) +AP,_,(t), n=1,2,.... 


Then we can use either mathematical induction or the probability generating 
function (cf. [2]), along with (4), to show that 


P,(t) = [(At)"exp(—Azr)| /n!, n= 0,1,2,..., 
the Poisson distribution with parameter Xt. 
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Problems that Teach the Obvious but Difficult 


NEAL KOBLITZ 
Department of Mathematics, University of Washington, Seattle, WA 98195 


Most of our students are supposed to be learning calculus and other subjects in 
order to be able to recognize and apply the concepts in contexts which may be far 
removed from the settings that typify textbook problems. The transition is likely to 
present obstacles for them. Some simple matters which we take for granted and 
generally ignore should, in my opinion, be a major consideration in our design of 
exercises. I would like to illustrate by discussing four of my favorite calculus word 
problems. 


Problem 1. Find the centroid of Nevada. 


Here we make a slight simplification, giving Nevada a small piece of Arizona, so 
that it becomes a perfect trapezoid. We also assume that 1° is a constant distance 
throughout Nevada. The student is given the latitude and longitude of the four 
corners of Nevada: 42°N, 120°W; 42°N, 114° W; 39°N, 120°W; 35°N, 114°W. This 
problem at first stumps many students because of the absence of any equations and 
the unfamiliar form in which the coordinates of the corners are given. The first step 
in solving the problem is to decide how to set up the xy-axes. One can choose the 
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conventional picture, with the point (0°, 0°) (the equatorial point on the Greenwich 
meridian) as origin and east as the positive x-direction. But in that case, the student 
is confronted with fairly large negative numbers for the coordinates and is almost 
guaranteed to make an error in computing the equation of the slanted side. The 
shrewder move is to choose the origin to be one of the two corners along the slanted 
side, and define x and y to be degrees longitude and latitude relative to that 
reference point. Then the student can easily find the equations of the top and 
bottom of Nevada, and the rest of the solution is routine even for the average 
student. 


Problem 2. When crude oil flows from a well, water is frequently mixed with it in an 
emulsion. To remove the water the crude is piped to a device called a heater-treater, 
which is simply a large tank in which the oil is warmed and the water is allowed to settle 
out. Operating experience in a particular oil field indicates that the concentration C of 
water in the treater’s output can be modeled by the following equation in a neighbor- 
hood of the usual operating point of 135°F and a 2-hour holding time: 


C = 0.04 — 0.0032h? — 10~°r? + 4.1 - 10~°hAr, 


where h is the holding time in hours and t is the operating temperature in degrees F. 
(a) Because of random fluctuations in the well’s flow rate, the holding time actually 
varies slightly around 2 hours. Suppose you are given a simple control device that can 
change the tank temperature proportionally to the measured change in holding time. 
What constant of proportionality would best compensate for small holding time fluctua- 
tions and keep the water concentration as constant as possible? (b) Now as field 
equipment ages, its maximum operating settings are generally decreased. Find the 
equation of the line that best approximates the way in which the holding time would 
have to be increased as the maximum temperature rating falls slightly below the usual 
operating temperature. 


All this problem asks for is the derivative (dt/dh in part (a) and dh/dt in part 
(b)), which is easily found by implicit differentiation. But most students find this 
exercise quite difficult, because they have trouble distilling the essence of the 
question from all the words. They are thrown off because the derivative and the 
tangent line are described using “practical” rather than mathematical terms. Unfor- 
tunately, most students are so locked into the textbook terminology that they do not 
necessarily recognize the concept of derivative when it occurs “in nature.” 

Problem 2 was given to me by an engineering student, who said it was a problem 
he‘had been asked to solve while working one summer for an oil company. 


Problem 3. Your friend is at the top of a building which you know to be exactly 100 
feet high. She throws a stone downward at UV, ft/sec, and you time how long it takes to 
hit the ground. Neglect air resistance, so that the distance (in feet) fallen by the stone 
in t seconds is given by the formula s = 16t? + uot. If you time 2 seconds for the stone 
to fall to the ground, and if your stopwatch is accurate only to within +0.1 sec, at 
what downward speed did your friend throw the stone? Include an error estimate in your 
answer. Use the tangent line approximation. 
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Here a student who has just mastered the derivation of the falling body formula 
is almost certain to attempt to treat ¢ and s as the variables and vu, as a constant. 
How can vy not be a constant? Once that psychological barrier is surmounted, and 
the student realizes that in this particular problem s is constant (= 100) and Uy is 
the dependent variable, the error in the value for vy is seen to be simply 
(+0.1)(dv,/dt). 

Here is a problem of a similar sort for more advanced students, who have studied 
functions of two variables. 


Problem 3A. If an object falls vertically from rest with air resistance proportional to 
velocity, then its velocity at time t is given by the formula v(t) = mo —e“') Here 


g = —9.8 m/sec? is known exactly, and k, the constant of proportionality between the 
air resistance and the velocity, is measured experimentally with some error Ak. 
(a) Derive a formula for the height s(t) in terms of the initial height s,. and g, k, t. 
(b) Suppose you measure that it takes 2.0 seconds for the object to hit the ground, with 
an error of At in this measurement. Your wind tunnel experiments give a value of 0.1 
for k, with an error of Ak. Find a simple formula for s, in terms of Ak and At. Use the 
tangent plane approximation. 


The formula for v(t) in this problem is obtained by solving the differential 
equation du/dt = g — kv with initial condition v(0) = 0; then one obtains s(t) = 


So + 72 + e~*'— 1) by integration. After working with ft, v, and s as the 


variables and g, k, s,. as the constants, the student must then shift gears in order to 
answer the question in part (b). That is, she or he must set s(t) equal to zero and 
regard s, as the dependent variable, which is a function of the two independent 
variables k and ¢t. The answer: 18.4 — 11.8Ak + 17.8At. 


Problem 4. You are standing on the ground at point B (see diagram), a distance of 
75 feet from the bottom of a ferris wheel that is 20 feet in radius. Your arm is at the 
same level as the bottom of the ferris wheel. Your friend is on the ferris wheel, which 
makes one revolution (counterclockwise) every 12 seconds. At the instant when she is 
at point A you throw a ball to her at 60 ft/sec at an angle of 60° above the horizontal. 
Take g = —32 ft/sec’, and neglect air resistance. Find the closest distance the ball 
gets to your friend, using Newton’s method to obtain an answer which is accurate to 
within 1/2 foot. 


20 40 60 80 
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In this problem the student must first organize all the information that is 
presented and divide the problem into manageable steps. These steps might be: 
(a) ignoring the ball for a minute, find the friend’s angular velocity in rad/sec, and 
find parametric equations for the friend’s position at time ¢; (b) leaving the friend 
for a minute to look at the ball, resolve the ball’s initial velocity into x- and 
y-components (noticing that the horizontal motion is in the negative x-direction), 
and find parametric equations for the ball’s position at time ¢; (c) express the 
distance between the ball and the friend in terms of ¢; (d) find the derivative of the 
expression in part (c) and set it equal to zero (a slightly simpler approach would be 
to minimize the square of the distance); (e) guess an approximate time ft, when the 
ball is fairly close to the friend (take t, = 2), and use Newton’s method with one 
iteration to find a better approximate value t, when the ball is closest to the friend; 
(f) use part (c) to find the distance at time ¢,; (g) check that taking a second 
iteration in Newton’s method does not appreciably change the value you get for the 
closest distance. The answer turns out to be: 1.9 ft when ¢ = 2.22 sec. 

Most textbook problems involve only one technique. Problem 4, on the other 
hand, employs almost all of the topics in a standard first-semester calculus course: 
use of trig functions, derivatives of trig functions, the chain rule, parametrized 
curves (falling body problems, circular motion), distance formula, max/min, and 
Newton’s method. In this way it demands that the student bring together a variety 
of techniques into a multi-step thought process. 

These four problems illustrate some aspects of the transition from mechanical 
“pure-math” problem solving to “real-world” problem solving: the value of choos- 
ing a convenient setup of the coordinate system; the need to recognize basic 
concepts, such as the derivative, when they are disguised in non-mathematical 
words; the importance of determining at the very beginning which letters stand for 
constants and which letters stand for variables; the usefulness of breaking up a 
problem into bite-sized chunks. This might all seem obvious to us; but unless 
students are drilled with problems that are designed to illustrate these features, they 
are likely to have trouble using the material from their math courses in their 
subsequent work in other fields. 
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E 3253. Proposed by Sydney Bulman-Fleming and Edward T. H. Wang, Wilfri. 
Laurier University, Waterloo, Canada. 


(a) Let n and ¢ be natural numbers with n odd and ¢ <n. Suppose x; is an 
integer not divisible by n for i=1,2,...,¢. Determine the maximum possible 
number of pairs (i, j) with 1 < i <j < ¢ such that x, + x, = 0 (mod 2). 

(b) Let k, n, and ¢ be natural numbers with n relatively prime to k and t < n. 
Suppose x, is an integer not divisible by n for i =1,2,...,¢. Determine the 
maximum possible number of k-tuples (i,, i,,...,i,) with 1<i,<i,< ++: <k 
< t such that x, + x;,+ ++: +x,, = 0 (mod n). 


E 3254. Proposed by David Singmaster, Polytechnic of the South Bank, London, 
England. 


If a » and ¢ are given positive integers, define a sequence {a,,} by letting a,,,, be 
the positive integer obtained by reversing the digits in the ordinary decimal 
expansion of a, +c (n=0,1,2,...). For example, if a) = 18 and c =2, the 
sequence begins 18, 2, 4, 6, 8, 1, 3, 5, 7, 9, 11, 31, 33, 53, 55, 75, 77, 97,.... 

(a) If c = 1, show that the sequence {a,,} is eventually periodic and that all such 
sequences have the same ultimate cycle. 

(b) If c = 2 and 1 < a, < 99, show that the sequence {a,,} is eventually periodic. 
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(c)* If c = 2, show that the sequence {a,,} is eventually periodic for any choice 
of ao. 


E 3255. Proposed by Paul Erdés, Hungarian Academy of Sciences. 


What is the minimum number of edges in a connected n-vertex graph such that 
every edge belongs to a triangle? 


E 3256. Proposed by John Isbell, SUNY at Buffalo. 


(a) Let T be the set of triangles in the plane whose vertices have integral 
coordinates and whose sides have integral lengths. Certain isosceles triangles in T 
can be constructed by fitting together two congruent right triangles in T, e.g., the 
isosceles triangle with vertices (—12, —9), (12, 9), (—12, 16) arises in this way. Are 
there any other isosceles triangles in T? 

(b) Consider the set- V of triangles in 3-space whose vertices have integral 
coordinates. Does V contain any equilateral triangles with integral side-length? 


E 3257. Proposed by I. A. Sakmar, Istanbul, Turkey. 


Let P, Q, R be the new vertices of equilateral triangles constructed outwardly on 
the edges of a given triangle ABC. (Cf. H. S. M. Coxeter, Introduction to Geometry, 
New York, 1961, p. 22.) 

(a) Show that any triangle POR which can be obtained in this way arises from a 
unique triangle ABC, and give a construction for recovering triangle ABC from 
triangle POR. 

(b) Show that not every triangle PQR can be so obtained. 


E 3258. Proposed by Nicolae Gonciulea, Traian College, Drobeta Turnu Severin, 
Romania. 


Prove that 
(jm iat}? Cte 


where |x| is the largest integer not exceeding x. 


SOLUTIONS OF ELEMENTARY PROBLEMS 
A Quantifier Question 


E 2984 [1983, 54]. Proposed by Robert Maddux, Iowa State University. 


Characterize the logically valid sentences of the form 
Q1X, +°* Q,X,Rx, ++* xX, > Qix;, mT" Q x; Rx, Tt Xn, (*) 


where Q, is V or J, i,,..., i, iS a permutation of 1,...,, and R is a relation. 
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Editorial Comment. The third line of the statement of the problem should read: 
“where each Q, and each QO’ is V ord, i),..., 7, is a permutation of 1,...,n, and R 
is a relation symbol.” 


Solution by an anonymous referee, based on a partial solution by William M. 
Lambert, Jr. We show that the validity of (*) is equivalent to the conjunction of 
conditions (*), and (*),: 

If 4x, occurs on the left in (*), it also occurs on the right. (*), 
One cannot go from -:- Vx, ---dx,-:-: R to *), 


Ax, Wx, 00+ R. 


(*) = (*),: Proof by contraposition. Here (*) is of the form: 
Ax, 8 Rey X, WX, RE xy. 


Consider the interpretation given by the relation R C w”: R(m,,..., m,) iff m, = 0. 
Obviously the antecedent in (*) is true and the conclusion false, whence (*) is not 
valid. 

(*) = (*),: Proof by contraposition. Here (*) is of the form: 


Wx, Ex, es RO dx Wx, Rz 
Consider the interpretation given by the relation R C w: R(m,,...,m,,) iff m ja 
m,. Again, the antecedent in (*) is true under this interpretation while the succedent 
is false. 

(*), & (*). = (*): The passage from Q,x, --: Q,x,R to Qix;, +++ Q,x;R is the 
result of two types of operations: 

(a) change some V to J, 

(b) permute some (Qx) and (Q’x’). 
Clearly, we can perform the (a) operations one at a time. Moreover, since every 
permutation can be written as a product of transpositions, we can perform only one 
of these operations at a time. Accordingly, there is a sequence 


Q,x, °°: Q,x,R = Qo, 9),...,0,= Qix;, a Q,x;, RK, 


where each ®,,, arises from ®, by means of one of the following operations: 


(a) change V tod: -::Wx,--: R> :::dx,--:R 
(b') transpose V’s: +++ Wx,Wx, ++: R> +++ WxNx, ++: R 
(b*) transpose 3’s:_ +--+ 4x,4x,--: R> ++: 4xAx,--- R 


(b*) kfll uniformity: --- Axx, 0°: R v++ Wx Ax, +: Rz 


[N.B. (*), rules out changing 4 to V; (*), rules out introducing uniformity, i.e., 
changing V4 to 4V.] But now it is clear that each operation preserves truth: (b') & 
(b*) do not change meaning and (a) & (b*®) merely weaken the assertion. Hence 


®, > ®,, ®,- 9,,...,9_, - 9, 
are all valid, whence ®, > ®,, 1e., (*), is valid. Q.E.D. 
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Enveloping Circle 


E 3084 [1985, 287]. Proposed by Paris Pamfilos, University of Crete, Greece. 


Given a family of circles in the plane all of which pass through a common point 
and no two of which have the same diameter, show that there is another circle 
enveloping all the circles of the family if and only if there is a straight line 
containing all the intersection points of the common tangents of any two circles of 
the family. 


Solution by O. P. Lossers, Eindhoven University of Technology, The Netherlands. 
Let the common point be denoted by P. We now apply inversion with respect to P 
and use the fact that such an inversion permutes the set of all circles and all lines in 
the plane conformally. 

The modified problem reads as follows: 

Given a point P and a family F of lines in the plane at distinct positive distances 
from P, show that the members of F have a common tangent circle, I’ say, if and 
only if there exists a circle, A say, passing through P such that the common tangent 
circles of any pair of lines /, m € F that pass through P share a second point 
OQeEA. 

Given the circle [ with centre M and two tangent lines / and m with intersection 
point S, say, symmetry arguments immediately reveal that the common tangent 
circles of / and m containing P have in common the point Q that is obtained by 
reflecting P with respect to MS. Because M is fixed the orbit of Q is a circle A that 
has M as its centre, too. 

Given the circle A with centre N and two lines / and m such that their common 
tangent circles containing P have point Q € A in common, again by symmetry 
arguments we see that / and m have the same distance from N, in other words, / 
and m are tangent lines of the same circle concentric with A. This circle is called I. 


Also solved by A. Bondesen (Denmark), J. Dou (Spain), J. Fukuta (Japan), L. Kuipers (Switzerland) 
and P. Sztisz, and the proposer. Both Bondesen and Fukuta established the (slightly) stronger fact that 
the assumption “all of which pass through a common point...” could be replaced by “all of which are 
tangent to a common circle, externally or internally... .” 


Symmetric Positive Definite Matrices 


E 3131 (1986, 132]. Proposed by William N. Anderson, Jr., Fairleigh Dickinson 
University, and George E. Trapp, West Virginia University. 


Let A be a partitioned real matrix with submatrices A, pl J =1,2,...,k. Let B 
be the k X k matrix with elements b,, given as follows: b,, is the algebraic sum of 
all of the elements in the A,, submatrix. Show that if A is symmetric positive 
definite, then B is symmetric positive definite. 


Editorial note: Many solvers noted that the problem was incorrect as stated, 
unless by a partitioned matrix one understands that the diagonal submatrices 4,, 
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are square. For example, 


is both symmetric and positive definite, but 
_{1 1 
a= E | 


Solution by A. A. Jagers, Technische Hogeschool Twente, Enschede, The Nether- 
lands. Assume that the rows and columns of A are partitioned the same way. Let 
C = [c;,,] be the point-block incidence matrix of this partition; that is, let c,, = 1 if i 
belongs to the jth block of the partition and let c,, = 0 otherwise. Then C has rank 
k and B= C'AC, where C! denotes the transpose of C. Hence B is symmetric 
positive definite. 


has neither property. 


Also solved by S. W. Bent, J. H. Bolstad, A. J. Bosch (The Netherlands), Chico Problem Group, T. 
Jager, O. Krafft (West Germany), O. P. Lossers (The Netherlands), J. M. Monier (France), D. 
Neuenschwander (Switzerland), B. Poonen, G. S. Rogers, A. Tissier (France), W. P. Wardlaw, P. Y. Wu 
(China), and the proposer. 


Arithmetic-Geometric Mean 


E 3142 [1986, 299]. Proposed by Zhang Zaiming, Yuxi Teachers College, China. 
Prove the following refinement of the arithmetic-geometric mean inequality: 
Vab = G(a,b) < L(A, b) < I(a,b) < A(a,b) =(a+b)/2 for0<a<b, 
where the logarithmic mean is 


_ {(b-a)/(log.b — log,a) if0<a<b 
L(a,b) = | if0<a=b 


and the identric mean is 
“(66/a*)VO" itd <a<b 
I(a,b) =< @ 
a if0<a=b 


(M. E. Mays, Functions which parametrize means, this MONTHLY, 90 (1983) 
677-683.) 


Editorial comment. There was a misprint in the original statement of the prob- 
lem, namely, (a“/b°) was erroneously given in place of (b°/a7) in the definition of 
I(a, b). 


Solution by Ricardo Perez Marco (student), Barcelona, Spain. When a= b we 
have a string of equalities. For 0 < a < b, let r= b/a> 1. Dividing the desired 
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inequalities by a, it suffices to prove the strict inequalities 


1 
vr <(r—-1)/Anr< -r/) < (r +:1)/2 forr>1. 
e 


1) For the first inequality, it suffices to show g(r) = nr — vr + 1/vr <0. 
Observe that g(r) = —(1 — vr)*/(2rvr) < 0 and g(1) = 0. 
2) For the second inequality, it suffices to show 


1 
— r/(r—1)] > e. 
h(r) ri nr>e 


Since lim, _,;+4(7) = e, we need only show h’ > 0 for r > 1. In this range h > 0, so 
we simply compute 


d , 1 1 Inr \? 
n/n= a rine - ("| 


By the first inequality, the bracketed quantity is positive, as desired. 
3) For the third inequality, it suffices to show 


f(r) =r /(r + 1) < e/2. 


Taking a similar approach, we note lim, _,,+f(7) = e/2 and want f’ < 0 for r > 1. 
In this range f > 0, so again we compute 


d 
f/f = 7B f. 
ir 
The result of this is k(r)/(r — 1)’, where 
k rt | 
(r) = yy er 
Now we have what we want, since k(1) = 0 and k’(r) = —(r — 1)?/{r(r + 1)?} 
<0 for r> 1. 


Editorial comment. The Chico Problem Group, I. Merényi, and R. E. Pfiefer 
employed the inequalities 


1 rax _ 1 1 d 1 r d 
~~ < ~ S . 
r—] 1 x exp aq fin xa < say fte 


D. M. Bloom gave bounds for the spread in the inequalities. Many solvers noted 
that solutions or generalizations have appeared in many places, including the 
following: 


H. Alzer, Uber einen Wert, der zwischen dem geometrischen und dem arithmetischen Mittel zweier 
Zahlen liegt, Elemente Math., 40 (1985) 22-24. 

H. Alzer, Ungleichungen fiir (e/a)“(b/e)’, Elemente Math., 40 (1985) 120-123. 

B. C. Carlson, A hypergeometric mean value, Proc. Am. Math. Soc., 16 (1965) 759-766. 

B. C. Carlson and M. D. Tobey, A property of the hypergeometric mean value, Proc. Am. Math. Soc., 
19 (1968) 255-262. 
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E. B. Leach and M. C. Sholander, Extended mean values, this MONTHLY, 85 (1978) 84—90. 
K. B. Stolarsky, Generalizations of the logarithmic mean, Mathematics Magazine, 48 (1975) 87-92. 


Also solved by S. Arslanajyié (Yugoslavia), D. M. Bloom, P. Bracken (Canada), J. L. Brenner, B. 
Brunson, Chico Problem Group, W. Herget, W. Janous (Austria), S. V. Keny, P. K. Khalili, L. Kuipers 
(Switzerland), I. Merényi (Romania), J. E. Peéarié (Yugoslavia), A. Pedersen (Denmark), R. E. Pfiefer, 
D. B. Tyler, M. Vowe (Switzerland), C. Wildhagen (Netherlands), and the proposer. 


ADVANCED PROBLEMS 


6568. Proposed by Itshak Borosh and Douglas Hensley, Texas A & M University 


A lattice point (s, t) € N X N is called primitive if the greatest common divisor 
of s and ¢ is 1. It is known that the density of primitive lattice points is 
6/a* = 0.6079... (Hardy and Wright, Theory of Numbers, Theorem 459). Show 
that the density of lattice points (a,b) € N X N for which each of the eight 
neighbors (a + 1, 5), (a,b + 1), (a + 1, b 1) 15 primitive is 


-Tt — 8/p?) = 0.012..., 


where the product is taken over all odd primes p. 


6569. Proposed by Wilson Castrellon and Samuel Camargo, Universidad de Los 
Andes, Bogota, Colombia 


Show that for every infinite cardinal ) there exists a cardinal k = A, a set A of 
cardinality x, and a family F of subsets of A such that card( F) = 2" and XM Y is 
finite for all X,YeE F, X# Y. 


SOLUTIONS OF ADVANCED PROBLEMS 
A Legendre Polynomial Identity 


6517 [1986, 305]. Proposed by Alexandru Lupas, Facultatea de mecanica, Sibiu, 
Romania. 


If P, is the sequence of Legendre polynomials, i.e., 


show that 


[PAC] — Peas) Pausls) = abe pO 


Editorial Note. The original statement of this problem contained the erroneous 
factor 2h, on the right side of the integral identity, where h,, is defined below. 
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Solution by the proposer. Let 


D,(x) = Pex) — Pyri(*) Paarl) 


be Turan’s Legendre polynomial determinant. By an equality attributed to L. Moser 
and M. Wyman (see D. S. Mitrinovic, Specijalne Funkcije, Beograd, 1978, formula 
(2.2.67)) we have 


ory L ger Lest ves) 


D, (x) = nin tl) X% 


Apply to this the summation by parts formula 


n—-l n-2 k n-1 
Y A,B, = YS (Ag = Agas) » B,+ A,_1 » B; 
k=0 k=0 j=0 j=0 


with B, = 1/(k + 1) to obtain 
x? 


wary F Qs + WPA) hy ~ hy) 


D, (x) = 


where h, = 0 and 


nj 
— rn>l. 

= Le 

Next, the product formula for Legendre polynomials (see A. Erdélyi, W. Magnus, F. 

Oberhettinger, and F. G. Tricomi, Higher Transcendental Functions, vol. 1, 

McGraw-Hill, New York, 1953, formula 3.15(19)) is 


P(x)P(y) = — f' alxy +tv¥l1—x’*yl =P ee. 


Therefore, 


dt 


Px)= =f" Pi(x? + (1 ~ x?)) F——. 


1-?? 


Now use the Christoffel-Darboux Summation formula (see M. Abramowitz and 
I. A. Stegun, Handbook of Mathematical Functions, Dover, New York, 1965, 
Chapter 22, formula 22.12.1) with y = 1 to obtain 


P,_y(x) — P(x nal 
Prats) = Pa) _ z (2; + 1)P(x). 


1-x 


On the left is a telescoping summand, so it is easy to see that a further summation 
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followed by a change in the order of summation on the right leads to 


1- P(x) "S! 


po tn = AAI + YP)(X), 


j=0 
Hence, 
1-—P(x*4+7¢(1 - x’) n~l . 
1a Fee) =(1- 6) de (n ~ h,)(2j + 1) P,(x? + ¢(1 - x?)). 


Upon multiplying both sides by (1 — ¢*)~'/?, integrating with respect to ¢, and 
using the formulas derived above for P?(x) and D,(x), we obtain 


1 f 1-— P(x? + t(1 — x*)) dt 


——— =n(n+1)D,(x). 
— fi — aay Mt DPA) 


This not only establishes the result, but also shows that Turan’s inequality 
D(x)20, -l<x<l, 


with equality only for x = +1) is a direct consequence of the fact that 


|P,(x)| <1, -1<x<l. 


Making Nothing from Something 


6520 [1986, 403]. Proposed by Robert B. Israel, University of British Columbia. 


Show there exists a rational function f such that, for every holomorphic function 
- in the unit disk D, g or g —f has a zero in D. (See Problem 6437 [1983, 485; 
985, 365].) 


Solution by the proposer. By Runge’s theorem, for every positive integer n there is 
a rational function f, such that 


1 1 
lf,(z) —z| < — for 5 < |z| <1 [1] 
n 


[2] 


1 F 1 
<—- =, 
n or 121 4 


1 
f(z) - ~ 


Note that by the Argument Principle, f, has a zero in |z| < 1/2 and a pole in 
|z| < 1/4. I claim that for sufficiently large n, f, satisfies the requirement of the 
problem. Suppose this were false. Then there would be a sequence of functions g, 
holomorphic in D, such that g, and g, — f, have no zeros in D. 

By Rouche’s Theorem, for large n there must be z, with |z,| = 3/4 and 
|z,(Z,)| > 1/4, since otherwise f, — g, and f, would have the same number of 
zeros in |z,| < 3/4 (of course they have the same number of poles). Similarly, for 
large n there is w, with |w,| = 1/4 and |g,(w,)| <5/n, since otherwise 1/f, and 
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1/f, — 1/g, would have the same number of zeros in |z| < 1/4. By the Maximum 
Modulus Theorem (applied to 1/g,) there must be p, with |p,| = 3/4 and 
I@n(Pn)| < S/n. 

On the annulus 4 = {z:1/2 < |z| <1} the holomorphic functions g,/f, do 
not take the values 0 and 1, and so by the Montel-Caratheodory Theorem some 
subsequence g, /f, converges uniformly on compact subsets of A, either to oo or to 
a holomorphic ‘function on A. Since Sn (Pn) / fn ( Pn,) > 9 it converges to a holo- 
morphic function, and in fact (by Hurwitz’s Theorem) to 0. But g, (z,, )/f, (z, ) 
does not converge to 0, which is a contradiction. re 


Editorial Comment. This result may be stated as follows. The three functions f, 0, 
and oo form an “unavoidable family” of meromorphic functions in the interior of 
the unit disc. For the above terminology, and some related results (but in the whole 
plane C rather than the disc) see Lee A. Rubel and Chung-Chun Yan, Interpolation 
and unavoidable families of meromorphic functions, Michigan Math. J., 20(1973) 
289-296. 
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1. Introduction. The problem of allocating scarce resources to achieve competing 
and insatiable ends lies at the core of economic reasoning. The analysis of the 
question “what’s best” by mathematical means is a focal point of economic theory. 
Economists use a variety of mathematical techniques to approach this fundamental 
problem. Shifts in techniques come about in order to expand the theory as well to 
take advantage of the discovery of new mathematical methods. Calculus, set theory, 
topology, real and convex analysis, as well as probability theory play important 
roles in the mathematical expression of economic theory. 

The classical Lagrange multiplier technique for examining local optima dominated 
the early development of the theory of consumer and producer behavior. This work 
dates to the 1930’s and 1940’s. In that period, Hicks and Samuelson advanced the 
frontier of the Walrasian general equilibrium framework by exploiting the calculus 
to explore both the stability of equilibrium and to perform the sensitivity analysis 
known as comparative statics. These two aspects of equilibrium theory form the 
basis for the predictive components of the Walras equilibrium theory. 

Convex analysis methods first appeared in the seminal contributions of von 
Neumann to game theory in 1928 and the existence of an equilibrium configuration 
in a model of an expanding economy in 1938. Von Neumann’s Minimax Theorem 
originates in those papers; it is one of the significant contributions of economic 
theory to mathematics. 

During the Second World War, the mathematical focus shifted from the calculus 
to game theory and linear programming with the efforts of von Neumann, 
Morgenstern, Koopmans, and Dantzig (the independent contribution of Kantoro- 
vich to linear programming was not recognized until much later). Set theoretic and 
topological tools dominated the writings of mathematical economists in the 1950’s 
and led to the fundamental properties of the Walrasian model of perfect competi- 
tion: the existence of equilibrium and the welfare properties of equilibrium accord- 
ing to the Pareto criterion. Nonlinear and dynamic programming were also stimu- 
lated in part by the needs of economic theory in this period. Differential methods 
could still be found in the study of the uniqueness, stability, and comparative statics 
properties of equilibrium. Sufficient conditions for the uniqueness of a Walras 
equilibrium were known to be special. 
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Differential topology techniques were introduced into economics by Debreu in 
1970. He showed how differentiability of demand could be used to show “most” 
(i.e, the regular) economies have only a finite number of equilibria. This new use of 
the calculus went beyond the methods of the 1930’s and 1940’s. Debreu’s research in 
the 1950’s influenced the shift away from the calculus as an important tool in favor 
of convex analysis. Ironically, Debreu was also the major influence in returning the 
calculus to favor in the 1970’s. 

Economic theory has taken an increasingly mathematical orientation since the 
Second World War. The variety of nonlinear problems found in economics has 
made mathematically inclined economists both end users of and suppliers to a 
variety of mathematical fields. It is now commonplace in the top economics journals 
to see papers employing differential topology, functional analysis (both linear and 
nonlinear theory), or stochastic processes. This raises the problem of how to bridge 
the communications gap between the two professions in an era of increasing 
specialization. 

The volume Studies in Mathematical Economics, edited by Stanley Reiter, is an 
attempt to provide a series of papers by distinguished economic theorists which 
illustrate the scope and power of mathematical methods in economics. The papers 
range over game theory, the existence, uniqueness, and computation of Walras 
equilibrium, the nature and meaning of a decentralized resource allocation process, 
and the notion of “satisficing” rather than optimizing behavior. The mathematical 
methods include a sophisticated “advanced calculus” survey of constrained optimi- 
zation problems defined on a finite dimensional vector space, the implicit function 
theorem, Sard’s Theorem, the development of numerical algorithms for the compu- 
tation of a fixed point of a nonlinear map (both a combinatorial approach keyed to 
Sperner’s Lemma and a path following approach tied to Sard’s Theorem are 
developed in separate chapters), the strong law of large numbers, and the theory of 
Markov chains with a countable state space. For the remainder of this essay, I will 
concentrate on the use of differential topology in economics and on issues in game 
theory in order to illustrate the variety of models, techniques, and results forming 
the subject of the Reiter volume. 


2. Differential topology and the calculus. A demand function is defined as a 
mapping from an n-vector of prices into an n-vector of commodities. It represents 
the optimization choice of a consumer subject to a budget constraint. The laws of 
change stating how consumer demand for various goods responds to a change in 
market conditions may be found by calculating the derivative of the demand 
function with respect to the price vector. 

The calculus approach to choice problems in economics is represented by Carl 
Simon’s paper in this volume. The role of the implicit function theorem and the 
presentation of criteria for determining the global optimum are presented in detail. 
The range of methods and examples found in this chapter could be used to show 
how economic theory passes from postulates on behavior to model formulation and 
analysis. It would be an excellent source of economics examples of the calculus in 
action for students in multivariate and advanced calculus courses. 
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The differential calculus has a deeper use in models of the workings of the 
economy as a whole. A central problem in the study of the Walras general 
equilibrium model is to demonstrate the existence of a price system which equates 
supply and demand in all markets simultaneously. The simplest realization of this 
model is an exchange economy where the demand functions are derived by adding 
the demand functions of the individual consumers at different prices and defining 
the supply function to be the constant function equal to the total amount of each 
good initially available in the economy. The existence of a Walras equilibrium 
amounts to proving that a system of nonlinear equations in the unknown price 
system has at least one nonnegative solution. The existence task usually relies on a 
fixed point argument. 

The next problem is to determine the number of possible solutions. Differential 
topology plays a decisive role in settling this question. An economy is defined for 
each vector of initial resources; the space of economies may be identified with the 
commodity space. An economy is regular if the demand function is differentiable 
and its Jacobian has maximal’rank at every equilibrium price. 

Let f be a smooth mapping defined on an open subset of R™” and taking values 
in R”. A point in the domain is a critical point if the Jacobian of f, evaluated at that 
point, has rank less than n. A point is regular if it is not a critical point. Sard’s 
Theorem states that the image of the set of critical points of f has Lebesgue 
measure Zero. 

Debreu’s fundamental result is based on Sard’s Theorem and states that the 
nonregular economies form a closed set of Lebesgue measure zero in the space of 
economies. Moreover, every regular economy has an odd number of equilibria. The 
proof of the latter fact is based on an Index Theorem given in Mas—Collel’s chapter, 
which also applies to some production economies. 

The Index Theorem is closely related to the degree of a mapping. The index of a 
Walras equilibrium is the sign of the determinant of a matrix based on the Jacobian 
of the demand function at the given equilibrium; a normalization imposed by 
economic theory implies that the Jacobian of demand is a singular matrix, hence it 
cannot be directly used in the index calculation. The Index Theorem states that for 
every regular economy, the sum of the indices over all equilibria equals (—1)"~1, 
where n is the number of commodities. The Index Theorem has two corollaries for 
regular economies: first, there is an odd number of Walras equilibria; in particular, 
a Walras equilibrium must exist. Second, if the index has a uniform sign across all 
equilibria, then there is only one equilibrium price system. 

The Index Theorem and its corollaries appear to offer the prospect of deriving 
comparative Statics results for the Walrasian economy. The implicit function theo- 
rem could be used for regular economies to compute the change in equilibrium 
prices in’ response to a parameter shift in the demand function for the economy. A 
series of results initiated by the work of Sonnenschein suggests that economic theory 
imposes very few restrictions on the form of the demand function in an exchange 
economy. The derivation of comparative statics results without further assumptions 
seems doubtful. One trend begun by Hildenbrand is to make stronger assumptions 
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on the distribution of preferences or income in order to obtain more characteristics 
of the economy-wide demand function. 

One of the most important influences of the regular economies approach has 
been to focus researchers’ attention on generic results. For example, the nonregular 
exchange economies are dismissed as unimportant examples. Regularity arguments 
can be found in the study of the determinateness of equilibrium in models 
motivated by fundamental problems in macroeconomic theory. Some other exam- 
ples of the impact of differentiable methods can be found in Hurwicz’s essay on 
decentralization. His and Reiter’s papers are also good sources of other applications 
of general topological methods in economics. 


3. Uncertainty and Information. Uncertainty is an important aspect of economic 
life. The nature and amount of information about the states of nature and the 
economy plays a fundamental role in understanding resource allocation processes. 
Information and uncertainty are dual concepts; to describe the information avail- 
able to an agent is to describe the degree of uncertainty that the agent faces in the 
economic sphere. Information can enter an economic model in a variety of ways. 
Game theory provides one example. 

The chapter by Roger Myerson on game theory attempts to illustrate the ways in 
which mathematical models can shed light on the possible equilibrium positions 
achievable under the hypothesis of rational agents pursuing their own ends within a 
world of like-minded individuals. The formulation of a game depends on a specifi- 
cation of the information available to each player as well as appropriate probabili- 
ties to be used in calculating expected utility payoffs. The expected utility theory 
forms the decision theoretic basis of the player’s choice problems. The most 
important characteristic of the expected utility framework is that a player’s payoff 
function is “linear in the probabilities” assigned to the various possible states of the 
world. One of the most interesting developments in economic theory in the 1980’s is 
the exploration of non-additive expected utility theories by Machina, Chew, and 
Yaari. The interest in an alternative to the traditional expected utility theory arises 
from a controversy over its ability to predict behavior. There are several paradoxes 
which arise in the additive expected utility model which have been traced to the 
Independence Axiom. This postulate imposes the structure that yields the “additive” 
form of the utility function. The development of the implications of non-additive 
expected utility for game theory appears to be an area for promising research. 

Given a finite set of players N = {1,2,...,n} a noncooperative game is a triple 
(N, u,, S(n)) where S(n) is the set of strategies that are open to player n and u, is 
a real-valued function on the cartesian product of the S(n) that defines the payoff 
functions for player n as a function of the strategies selected by all the players. A 
necessary condition for any equilibrium strategy or outcome of the game is that a 
player’s strategy has the property that he cannot increase his payoff by departing 
from his strategy as long as the other players maintain their strategies. A Nash 
equilibrium is a strategy for each player which is simultaneously chosen and 
self-enforcing in this sense. The fundamental result in game theory is the Nash 
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Theorem stating conditions for a noncooperative game to have an equilibrium 
solution. The existence theory for the Nash equilibrium concept applies in the 
additive expected utility framework; it should be applicable to the non-additive 
expected utility model as well. 

Nash equilibria are abundant in many models; a substantial part of Myerson’s 
paper is devoted to surveying the alternative ways of arriving at one equilibrium 
outcome. For example, information plays a decisive role in all of the noncooperative 
selection theories. This often changes the focus of the game form to an extensive or 
multistage form in contrast to the strategic form of the game postulated in the 
original Nash theory. Alternative equilibrium concepts depend on the specification 
of what each agent has in an information set and believes at each stage. For 
example, one could ask that the equilibrium outcome have the property that a 
player’s strategies maximize the agent’s conditionally expected payoff given his 
beliefs about the other players’ actions and random events in any information set. A 
consistency rule also governs the players’ beliefs at the various information states. 
The result of imposing these requirements is called a sequential equilibrium; it does 
give the researchers a nonempty subset of the Nash equilibrium set. Information 
holds the key to this advance. The additive expected utility hypothesis is also a 
critical component in the development of this theory. The impact of the non- 
additive theories on multistage games remains to be seen. 


5. Conclusion. Stanley Reiter has brought a sample of the ways in which 
economists use mathematics together in one volume. The papers are all devoted to 
problems at the heart of economics. The subjects treated are not exhaustive, but this 
volume does serve as an introduction to mathematical reasoning in economics. For 
the teacher of mathematics, this volume opens a door to using economic theory as 
an alternative to the physical and biological applications of nontrivial mathematical 
analysis. 
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General, S**, P, L**. Riddles of the Sphinz And 
Other Mathematical Puzzle Tales. Martin Gardner. 
New Math. Lib., V. 32. MAA, 1987, x + 164 pp, 
$14.50 (P). (ISBN: 0-88385-632-8] Thirty-six mathe- 
matical riddles embedded in imaginative stories first 
published in Isaac Asimov’s Science Fiction Maga- 
zine. Riddles are followed by four sections of “an- 
swers” that unwrap successive layers of the puz- 
zles, only to reveal hidden conundrums. Superb for 
browsing—one can plunge in anywhere and be in- 
stantly enthralled. Excellent supplement for mathe- 
matics clubs. LAS 


General, P. Fourteen Papers Translated from the 
Russian. <A. Ya. Aizenshtat. Transl: Ben Sil- 
ver. AMS Transl. Ser. 2, V. 134. AMS, 1987, v 
+ 153 pp, $49. [ISBN: 0-8218-3110-0] 


Mathematics Appreciation, S**(13-16), P, 
L**, Mathematics: Queen and Servant of Science. 
E.T. Bell. MAA Spectrum. MAA, 1987, xxiv + 437 
pp, $18.50 (P). (ISBN: 0-88385-447-3] First volume 
in MAA’s new “Spectrum” series of popular math- 
ematics books. This paperback is a reprint (with a 
Foreward by Martin Gardner) of Bell’s 1951 classic, 
itself a revised composite of two earlier works on pure 
mathematics (the “queen”) and applied mathemat- 
ics (the “handmaiden”) first published in 1931 and 
1937. A master expositor, Bell cajoles the novice on 
a meandering tour of traditional mathematics, re- 
vealing beauty, application, and wit on every page. 
Should be required reading for every mathematics 
student. LAS 

Mathematics Appreciation, T(13: 1). Math- 
ematics as a Second Language, Fourth Edition. 


Joseph Newmark, Frances Lake. Addison-Wesley, 
1987, xxi + 832 pp, $31.95. [ISBN: 0-201-05885-5] 


Well-written text introducing basic concepts. Ex- 
cellent applications, examples, graphs and newspa- 
per clippings from business, sociology, education, 
medicine. Historical notes and calculator exercises 
throughout. Topics include sets, logic, number sys- 
tems, geometry, matrices, probability, statistics, and 
an introduction to BASIC. Good problems, chap- 
ter objectives and mastery tests. (First Edition, 
TR, May 1974; Second Edition, TR, November 1977; 
Third Edition, TR, December 1982.) MS 


Logic, T(15-18: 1, 2), S, P. Logics of Time and 
Computation. Robert Goldblatt. CSLI Lect. Notes, 
No. 7. CSLI (Stanford U., Ventura Hall, Stan- 
ford, CA 94305), 1987, ix + 131 pp, $11.95 (P); 
$24.95. [ISBN: 0-937073-12-1; 0-937073-11-3] Dis- 
cusses modal logic, emphasizing temporal and dy- 
namic logics. Part one is on the basic theory of nor- 
mal modal and temporal propositional logics. Part 
two is an application of this theory to temporal and 
computational logics, while part three is devoted 
to first-order dynamic logic. Exercises throughout. 
Bibliography. Index. RJA 


Combinatorics, T(16-18: 1), S, P, L*. Simu- 
lated Annealing: Theory and Applications. P.J.M. 
van Laarhoven, E.H.L. Aarts. Math. & Its Applic. D 
Reidel (US Distr: Kluwer Academic), 1987, xi + 186 
pp, $49. (ISBN: 90-277-2513-6] Simulated anneal- 
ing is an efficient, iterative algorithm for minimizing 
the cost function in an NP-complete combinatorial 
problem (e.g., travelling salesman problem) by capi- 
talizing on a deep and close analogy with annealing, 
the slow cooling of a solid until it reaches its low 
energy ground state. This monograph discusses the 
theoretical basis of the algorithm (asymptotic con- 
vergence), implementation performance, and a few 
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of the many dozens of applications. LAS 
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ic Functions, Second Edition. Serge Lang. Grad. 
Texts in Math., V. 112. Springer-Verlag, 1987, xi 
+ 326 pp, $38. [ISBN: 0-387-96508-4] Elliptic func- 
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Slightly revised from the 1973 edition (TR, June-July 
1974). BC 
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ics-1254: Explicit Constructions of Automorphic L- 
Functions. Stephen Gelbart, Ilya Piatetski-Shapiro, 
Stephen Rallis. Springer-Verlag, 1987, vi + 152 
pp, $18 (P). [ISBN: 0-387-17848-1] Two papers on 
L-functions attached to classical groups. The first 
applies a formal identity to the simple classical 
groups. The second generalizes the Rankin-Selberg 
method to certain non-simple groups of the form 
G x GL(n). BC 


Number Theory, P. Lecture Notes in Mathemat- 
ics-1253: An Approach to the Selberg Trace For- 
mula via the Selberg Zeta-Function. Jurgen Fischer. 
Springer-Verlag, 1987, 184 pp, $18 (P). [ISBN: 0- 
387-15208-3] The Selberg zeta function is for Fuch- 
sian groups what the Riemann zeta function is for 
the integers. The trace formula is a variant of Pois- 
son summation, with miraculous implications for the 
zeta function (e.g., a “Riemann Hypothesis”). Fis- 
cher uses a “resolvent method” to clarify the mira- 
cle. BC 


Group Theory, S(18), P. Representations of Alge- 
braic Groups. Jens Carsten Jantzen. Pure & Appl. 
Math., V. 131. Academic Pr, 1987, xiii + 443 pp, 
$59.50. (ISBN: 0-12-380245-8] Part I, which requires 
knowledge of varieties and algebraic groups, develops 
the representation theory of algebraic group schemes. 
Part II covers representations of reductive groups, 
the prerequisite being the basics of reductive alge- 
braic groups over an algebraically closed field. GG 


Group Theory, T(18:1). Transformation Groups. 
Tammo tom Dieck. Stud. In Math., V. 8. Walter 
de Gruyter, 1987, x + 312 pp, DM 128. [ISBN: 0- 
89925-029-7] An introduction to the theory of com- 
pact transformation groups, concentrating on the al- 
gebraic topology of Lie transformation groups. The 
reader is expected to have some understanding of 
basic algebraic topology. References to the litera- 
ture provide background material. Many interesting 
exercises are included. RH 


Group Theory, P. Lecture Notes in Mathematics- 
1243: Non-Commutative Harmonic Analysis and Lie 
Groups. Ed: J. Carmona, P. Delorme, M. Vergne. 
Springer-Verlag, 1987, 309 pp, $31.50 (P). (ISBN: 
0-387-17701-9] Thirteen papers from the sixth collo- 
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quium on harmonic analysis and Lie groups, held at 
Marseille-Luminy in June 1985. GG 


Algebra, T(18: 1), S, P. Complez Semisimple Lie 
Algebras. Jean-Pierre Serre. Springer-Verlag, 1987, 
ix + 74 pp, $19.80 (P). [ISBN: 0-387-96569-6] Based 
on lecture notes from 1965, there is a brief survey 
of the general theory of Lie algebras followed by a 
detailed discussion of complex semisimple algebras. 
Chapters on SZ2, root-systems, and structure theo- 
rems. Concludes with a chapter on compact groups. 
Bibliography, index. JS 

Calculus, T(13), S, L. Applied Calculus: An Intu- 
itive Approach for Management, Life, and Social Sci- 
ences. Richard L. Faber, Marvin I. Freedman, James 
L. Kaplan. West, 1986, xiii + 607 pp, $34.95. [ISBN: 
0-314-85235-2] A text written for students preparing 
for careers in business, economics, or in the life or so- 
cial sciences. The book is replete with examples from 
all these fields. The title is very apt, since the authors 
illustrate almost every concept with several intuitive 
examples designed to help students understand the 
“why” of calculus. Contains sections on mathemati- 
cal modeling, numerical integration, price elasticity, 
least squares, Lagrange multipliers, power series so- 
lution of differential equations, Newton’s method and 
a chapter on probability. Trigonometric functions 
are introduced following differentiation and integra- 
tion in several variables; exponential and logarithmic 
functions are introduced much earlier. SM 


Calculus, P. Zahlen Kontinuum: Eine Einfuhrung 
in dite Infinitestmalmathematik. Detlef Laugwitz. 
Lehrbucher und Mono. zur Didaktik der Math., B. 
5. Bibliographisches Institut, 1986, 269 pp, 38 DM 
(P). [ISBN: 3-411-03128-X] An introduction to non- 
standard analysis, based on the adjunction of an ideal 
element to the reals and intended for teachers. Many 
historical and philosophical remarks. JD-B 


Differential Equations, P. Hydrodynamic Behav- 
tor and Interacting Particle Systems. Ed: George 
Papanicolaou. Inst. for Math. & Its Applic., V. 9. 
Springer-Verlag, 1987, xi+ 210 pp, $19.80. [ISBN: 0- 
387-96584-X] Proceedings of a March 1986 workshop 
conducted as part of the 1985-86 IMA program on 
“Stochastic Differential Equations and Their Appli- 
cations.” Contains fifteen papers presenting exper- 
imental results on suspensions and a discussion of 
analytical methods used to study these systems. AM 


Partial Differential Equations. Partial Differ- 
ential Equations. J. Wloka. Transl: C.B. & M.J. 
Thomas. Cambridge U Pr, 1987, xi + 518 pp, 
$79.50; $29.95 (P). (ISBN: 0-521-25914-2; 0-521- 
27759-0] Discusses theoretical and numerical aspects 
of solving boundary value problems. Assumes famil- 
iarity with basic functional analysis and begins with 
an introduction to distributions and Sobolev spaces. 
With this foundation, the book presents different 
methods for solving elliptic, parabolic, and hyper- 
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bolic equations, as well as difference methods for nu- 
merical solution of partial differential equations. AM 


Partial Differential Equations, P. Problémes de 
Dirichlet Variationnels non Linéaires. Jean Mawhin. 
Pr U Montreal, 1987, 168 pp, $22 (P). [ISBN: 2-7606- 
0799-2] 


Partial Differential Equations, P. Nonlinear 
Parabolic Equations: Qualitative Properties of So- 
lutions. Ed: L. Boccardo, A. Tesei. Res. Notes in 
Math. Ser., V. 149. Longman Scientific & Tech- 
nical (US Dist: Wiley), 1987, 232 pp, $49.95 (P). 
[ISBN: 0-470-20379-X] Proceedings of a conference 
held at the Second Rome University from April 1-5, 
1985. The book contains 31 papers which discuss the 
following topics: existence, uniqueness and regular- 
ity of solutions for parabolic equations and systems; 
special classes of solutions; free boundary problems; 
degenerate diffusion; asymptotical behaviour of so- 
lutions; and models from biology, physics, and engi- 
neering. AM 


Partial Differential Equations, P. Lecture Notes 
in Mathematics-1256: Pseudo-Differential Opera- 
tors. Ed: H.O. Cordes, B. Gramsch, H. Widom. 
Springer-Verlag, 1987, x + 479 pp, $41.70 (P). 
[ISBN: 0-387-17856-2] Twenty-two research papers 
on pseudodifferential operators, a spin-off from par- 
tial differential equations. Includes a lengthy contri- 
bution by Hormander on non-linear hyperbolic equa- 
tions. BC 


Partial Differential Equations, P. Oakland Con- 
ference on Partial Differential Equations and Ap- 
plied Mathematics. Ed: Louis R. Bragg, John W. 
Dettman. Res. Note in Math. Ser., V. 154. Long- 
man Scientific & Technical (US Distr: Wiley), 1987, 
117 pp, $38.95 (P). (ISBN: 0-470-20777-9] Five pa- 
pers from a conference at Oakland University, Michi- 
gan, focusing on transmutations and other methods 
in partial differential equations. Topics include in- 
verse scattering theory, ocean acoustics, and poten- 
tial theory. BC 


Partial Differential Equations, P. Lecture Notes 
in Mathematics-1260: Nonlinear Evolution Opera- 
tors and Semigroups. Nicolae H. Pavel. Springer- 
Verlag, 1987, vi + 285 pp, $24.30 (P). [ISBN: 0-387- 
17974-7] An exposition of fundamental results and 
recent research on nonlinear evolutionary operators 
and semigroups, with applications of the theory to 
several standard types of differential equations aris- 
ing in physics. The techniques allow partial differen- 
tial equations to be treated as ordinary differential 
equations in infinite dimensional Banach spaces, en- 
abling a unified approach. RB 


Partial Differential Equations, P. Lecture Notes 
in Mathematics-1236: Stochastic Partial Differential 
Equations and Applications. Ed: G. Da Prato, L. 
Tubaro. Springer-Verlag, 1987, 257 pp, $23.60 (P). 


TELEGRAPHIC REVIEWS 


275 


[ISBN: 0-387-17211-4] Proceedings of an October 
1985 conference in Trento, Italy. LAS 

Numerical Analysis, T(16-17: 1, 2), S*, L. Nu- 
merical Methods in Engineering and Applied Science: 
Numbers are Fun. B. Irons, N.G. Shrive. Math. 
& Its Applic. Halsted Pr, 1987, 248 pp, $32.95. 
[ISBN: 0-470-20803-1] A delightful, witty, back-of- 
the-envelope survey of how engineers think about 
mathematical methods intended to fill “regrettable 
gaps” in students’ education. Covers integration, in- 
terpolation, splines, differential equations, matrices, 
eigenvalues—not systematically but in the manner of 
a personalized idiosyncratic tour. Excellent source of 
intuition and synthetic ideas. LAS 


Numerical Analysis, P. Defect Minimization in 
Operator Equations: Theory and Applications. R. 
Reemtsen. Res. Notes in Math. Ser., V. 163. Long- 
man Scientific & Technical (US Distr: Wiley), 1987, 
106 pp, $34.95 (P). [ISBN: 0-470-20877-5] Defect 
minimization is a numerical technique for approxi- 
mating solutions to operator equations Ta = r by 
minimizing Ta — r (the defect) in finite-dimensional 
subspaces for a. This book develops the general the- 
ory (especially convergence), and applies it to or- 
dinary differential equations and the heat equation. 
Begins with a wonderful quote from A Hundred Years 
of Solitude. BC 


Numerical Analysis, T(16-17: 1), S, P, L. Sen- 
sitivity Analysis in Linear Systems. Assem Deif. 
Springer-Verlag, 1986, xii + 224 pp, $56. [ISBN: 0- 
387-16312-3] Five self-contained chapters (with thor- 
ough examples and exercises) on perturbation tech- 
niques and sensitivity analysis: perturbation of linear 
equations, interval analysis, iterative systems, least- 
squares, and sensitivity in linear programming. BC 

Functional Analysis, P. Lecture Notes in Mathe- 
matics-1235: Séminaire de Théorie du Potentiel, 
Paris, No. 8. Ed: F. Hirsch, G. Mokobodzki. 
Springer-Verlag, 1987, 209 pp, $19.40 (P). [ISBN: 
0-387-17210-6] 

Functional Analysis, T?(18: 2), P. Barrelled Lo- 
cally Convez Spaces. Pedro Pérez Carreras, José 
Bonet. Math. Stud., V. 131. Elsevier Science, 1987, 
xv + 512 pp, $80 (P). [ISBN: 0-444-70129-X] This 
systematic treatment of barrelled spaces begins with 
background material from various branches of anal- 
ysis and then focuses on Baire spaces. In addition to 
the abstract theory of barrelled spaces, the volume 
discusses local completeness, the inherited Mackey 
topology, bornological and ultrabornological spaces. 
Clearly written with extensive bibliography but no 
exercises. MU 


Functional Analysis, S(17-18), P. Inner Prod- 
uct Structures: Theory and Applications. Vasile Ion 
Istratescu. Math. & Its Applic. D Reidel (US 
Distr: Kluwer Academic), 1987, xv + 895 pp, $149. 
[ISBN: 90-277-2182-3] A thorough, self-contained de- 
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velopment of inner product structures, which under- 
lie many linear spaces. Applications to probability 
and physics interspersed. An extensive bibliography 
and a collection of open problems. BC 


Analysis, P. Fourteen Papers Translated from the 
Russian. L.A. Aksent’ev, et al. Transl: Ben Sil- 
ver. AMS Transl. Ser. 2, V. 136. AMS, 1987, v + 
154 pp, $47. (ISBN: 0-8218-3112-7] Thirteen papers 
on analysis and one on social choice theory. LAS 


Analysis, P. Lecture Notes in Mathematics-1262: 
Zahlentheoretische Analysis IT. E. Hlawka. Springer- 
Verlag, 1987, 158 pp, $16.30 (P). [ISBN: 0-387-18015- 
x] 


Algebraic Geometry, P. Lecture Notes in Mathe- 
matics-1266: Space Curves. Ed: F. Ghione, C. Pe- 
skine, E. Sernesi. Springer-Veralg, 1987, 272 pp, 
$24.30 (P). (ISBN: 0-387-18020-6] Proceedings of a 
conference in Rome, June 1985 on “Curves in Pro- 
jective Space.” Twelve papers: “good and bad fam- 
ilies of projective curves;” postulation of projective 
space curves; classical problems in enumerative ge- 
ometry. RB 


Differential Geometry, P. Lecture Notes in 
Mathematics-1263: Differential Geometry. Ed: V.L. 
Hansen. Springer-Verlag, 1987, xi + 288 pp, $24.30 
(P). (ISBN: 0-387-18012-5] Proceedings of the Nordic 
Summer School in mathematics held at the Tech- 
nical University of Denmark in Lyngby. Contains 
the manuscripts for the main lectures given at the 
school. These lectures discussed gauge theory and 
moduli; twistor methods; and global differential ge- 
ometry. AM 


Differential Geometry, P. Monopoles, Minimal 
Surfaces and Algebraic Curves. Nigel Hitchin. Pr 
U Montreal, 1987, 94 pp, $18 (P). [ISBN: 2-7606- 
0801-8] Monograph studies a collection of non-linear 
differential equations that derive from the self-dual 
Yang-Mills equations. The techniques of symplec- 
tic geometry and moment maps are used to study 
the structure of the moduli space of monopoles— 
a special class of solutions to the Yang-Mills equa- 
tions. AM 


Algebraic Topology, P. Homotopy Theory and 
Related Topics. Ed: H. Toda. Adv. Stud. in 
Pure Math., V. 9. Elsevier Science, 1987, xii + 344 
pp, Dfl. 250.00. [ISBN: 0-444-70201-6] Proceedings 
of a Japanese symposium (Kyoto, December 1984). 
Nineteen research papers and four expository lec- 
tures on simple homotopy theory and G-actions, clas- 
sifying spaces and characteristic classes, topology of 
manifolds, homotopy problems (unstable and stable 
cases). RB 


Dynamical Systems, P. Dynamical Systems and 
Bifurcation Theory. M.I. Camacho, M.J. Pacifico, 
F. Takens. Res. Notes in Math. Ser., V. 160. 
Longman Scientific & Technical (US Distr: Wiley), 
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1987, 421 pp, $62.95 (P). [ISBN: 0-470-20843-0] Pro- 
ceedings of a conference at Rio de Janeiro, Au- 
gust 1985. Thirteen papers, primarily on structural 
stability of vector fields and bifurcation theory, in- 
cluding Hopf bifurcation, transition from periodic to 
strange attractors and homoclinic tangencies. Other 
topics: polynomial foliations, complex vector fields 
and foliations, dynamical aspects of lines of curva- 
ture; geodesic flows. RB 


Dynamical Systems, S(18). Lecture Notes in 
Control and Information Sciences-77: Detection of 
Abrupt Changes in Signals and Dynamical Systems. 
Ed: M. Basseville, A. Benveniste. Springer-Verlag, 
1986, x + 373 pp, $28.50 (P). [ISBN: 0-387-16043-4] 
The detection of abrupt changes in dynamical sys- 
tems has applications which range from the detec- 
tion of sensor failure to the prediction of the catas- 
trophic failure of mechanical and biological systems. 
Certain meteorological phenomena, tsunamis for ex- 
ample, can also be considered abrupt changes in dy- 
namic systems. Hypothesis testing plays an impor- 
tant role in the detection of these changes. This 
book presents a series of twelve research papers which 
cover the topic from both a theoretical and applied 
perspective. SM 


Control Theory, P. Algebraic and Geometric 
Methods in Nonlinear Control Theory. Ed: M. 
Fliess, M. Hazewinkel. Math. & Its Applic. D 
Reidel (US Distr: Kluwer Academic), 1986, xii + 
642 pp, $99. (ISBN: 90-277-2286-2] The proceed- 
ings of a conference sponsored by the Centre Na- 
tional de la Recherche Scientifique and the Labora- 
torie des Signaux et Systems, held in Paris in June 
1985. Contains 31 papers on state-of-the-art non- 
linear control theory. Topics include controllability, 
observability, and realization; feedback synthesis and 
linearization; optimal control; discrete time systems; 
and applications. Survey papers and new results are 
presented. SM 


Control Theory, P. Lecture Notes in Control and 
Information Sciences-91: Stochastic Modelling and 
Filtering. Ed: Alfredo Germani. Springer-Verlag, 
1987, 218 pp, $35.60 (P). (ISBN: 0-387-17575-X] Pro- 
ceedings of a December 1984 IFIP workshop held in 
Rome: fourteen papers on recent advances in theory 
and applications of stochastic modelling. LAS 


Control Theory, P. Modelling, Robustness and 
Sensitivity Reduction in Control Systems. Ed: Ruth 
F. Curtain. NATO ASI Ser. F, V. 34. Springer- 
Verlag, 1987, ix + 492 pp, $89. [ISBN: 0-387-17845- 
7] Papers on the theory and practice of modelling of 
control systems. Mathematical foundations of H™- 
control theory, modelling from measurement data, 
approximation, model reduction, robustness and sen- 
sitivity reduction. RM 


Systems Theory, T(18: 1), P. Stability of Adap- 
tive Systems: Passivity and Averaging Analysis. 


1988] 


B.D.O. Anderson, et al. Ser. in Signal Proc., Op- 
timiz. & Control., V. 8. MIT Pr, 1986, xviii + 
326 pp, $35. [ISBN: 0-262-01090-9] This book, with 
eight authors, consists of six chapters on the stabil- 
ity of adaptive systems, systems which depend on a 
set of time varying parameters. The ultimate goal 
of this field is to develop algorithms which remain 
stable while the system evolves into the future. This 
book concerns itself more with methodology and ba- 
sic concepts rather than with the specific details of 
any algorithm. The final chapter includes a detailed 
example. SM 


Probability, T*(15-16: 1, 2), S, L. Probabil- 
ity and Stochastic Processes. Frederick Solomon. 
Prentice-Hall, 1987, xv + 426 pp. [ISBN: 0-13- 
711961-5] An introduction to probability at a level 
that takes the axiom of countable additivity as op- 
tional. Includes discussions of correlation coefh- 
cients, the central limit theorem, continuous-time 
birth and death processes, and discrete-time Markov 
chains. FLW 


Probability, S*, P, L**. Paradozes in Probabil- 
ity Theory and Mathematical Statistics. Gabor J. 
Székely. Math. & Its Applic. D Reidel (US Distr: 
Kluwer Academic), 1986, xii + 250 pp, $59. [ISBN: 
90-277-1899-7] A fascinating exposition of paradox- 
ical results in probability, statistics, random pro- 
cesses, and foundations of probability, complete with 
historical detail on the origin and resolution of each 
paradox. Topics range from old dilemmas about dice 
that led to the creation of probability theory to trap- 
door functions in cryptography. Historical context 
shows how often paradoxes led to new insights in the 
mathematics of chance. LAS 


Stochastic Processes, P. Random Media. Ed: 
George Papanicolaou. IMA, V. 7. Springer-Verlag, 
1987, xi + 321 pp, $34. [ISBN: 0-387-96524-6] 
Twenty-one papers from an IMA workshop, mostly 
on wave equations and random Schrodinger opera- 
tors. One-dimensional problems are emphasized (in 
part because they can be solved). BC 


Stochastic Processes, S(17). Statistical Esti- 
mation for Stochastic Processes. K. Nanthi, M.T. 
Wasan. Papers in Pure & Appl. Math., No. 78. 
Queen’s U, 1987, vi + 245 pp, (P). Provides math- 
ematical background, including estimation and test 
of hypothesis in discrete time Markov chains, max- 
imum likelihood estimators, etc., for such problems 
as birth and death models, immigration and emmi- 
gration, diffusion, continuous time processes, and fil- 
tering. Statistical estimations for these problems aid 
in work where actual parameters are unknown, such 
as life table analysis (actuarial) and network theory 
(computer science). LB-E 


Elementary Statistics, T*(13: 1, 2). Ezploring 
Statistics: A Modern Introduction to Data Analysis 
and Inference. Larry J. Kitchens. West, 1987, xii 
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+ 595 pp, $30.56 [ISBN: 0-314-28498-2]: Instructor’s 
Manual with Solutions and Test Bank, v + 333 pp, 
(P). [ISBN: 0-314-35115-9] Covers the usual topics, 
but in a more modern way than most introductory 
texts. Exploratory data analysis (EDA) techniques 
are introduced early and used throughout, robust 
and nonparametric procedures are integrated with 
standard methods, many examples and exercises use 
real data, and optional sections on using Minitab are 
included in each chapter. RSK 


Elementary Statistics, S*(14-18), P*, L*. A 
Handbook of Introductory Statistical Methods. C. 
Philip Cox. Ser. in Prob. & Math. Stat. Wiley, 
1987, xxi + 272 pp, $34.95. [ISBN: 0-471-81971-9] 
An “economical summary” of many statistical tech- 
niques for estimation and hypothesis testing. El- 
ementary procedures, contingency tables, bivariate 
and multiple regression, multiple comparisons, cor- 
relations, and randomized block and Latin square 
designs. FLW 


Statistics, S(15-18). The Statistical Consultant in 
Action. Ed: D.J. Hand, B.S. Everitt. Cambridge U 
Pr, 1987, xii + 189 pp, $34.50. [ISBN: 0-521-30717-1] 
Twelve articles, ranging from the entertaining to the 
technical, designed to give a realistic picture of sta- 
tistical consulting. Shows the diversity of problems 
and challenges with which statisticians must deal. 
Includes a bibliography. RSK 


Statistics, T(18: 1), P*. Statistical Analysis with 
Missing Data. Roderick J.A. Little, Donald B. Ru- 
bin. Prob. & Math. Stat. Wiley, 1987, xiv + 
278 pp, $34.95. [ISBN: 0-471-80254-9] Comprehen- 
sive treatment for the applied statistician, divided 
into two parts. The shorter Part I reviews histor- 
ical approaches to missing data problems in anal- 
ysis of variance, survey sampling, and multivariate 
analysis. Part II presents a systematic approach us- 
ing likelihood-based methods which take into account 
the mechanisms that lead to missing data. Includes 
applications to a wide variety of situations. RSK 


Statistics, T(15-18: 1), S, P. The General Linear 
Model: Data Analysis in the Social and Behavioral 
Sciences. Raymond L. Horton. Robert E Krieger, 
1986, xi + 274 pp, $29.95. [ISBN: 0-89874-906-9] A 
reprint of the 1978 edition. Develops the general lin- 
ear model in matrix form, assumptions needed for 
inference, and use of the model for factorial, Latin 
square, repeated measures, covariance, and regres- 
sion analysis designs. Written for the “applied re- 


~ searcher.” FLW \ 


Statistics, T(16-18: 1-3), S. Statistical Madels in 
Applied Science. Karl V. Bury. Robert E Krieger, 
1986, xvii + 625 pp, $49.50. [ISBN: 0-89874-747-3] 
A reprint of the 1975 edition (TR, October 1977). 
Statistical notions pertinent in engineering. Includes 
chapters on probabilistic design, component reliabil- 
ity, systems analysis, and strength of materials. FLW 
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Statistics, $(17-18), P. Goodness-Of-Fit. Ed: P. 
Révész, K. Sarkadi, P.K. Sen. Elsevier Science, 
1987, 624 pp, $120. [ISBN: 0-444-70087-0] 36 pa- 
pers presented at a colloquium held in Hungary in 
1984 on the theory and application of goodness-of-fit 
tests. FLW 


Statistics, T(15-16: 1), S. Multivariate Data 
Analysis. William W. Cooley, Paul R. Lohnes. 
Robert E Krieger, 1985, x + 364 pp, $30.50. [ISBN: 
0-89874-781-3] A reprint of the 1971 edition. A data 
analytic and non-mathematical approach to factor 
analysis, canonical correlation, partial correlation, 
multivariate analysis of variance, and discriminant 
analysis. FLW 


Statistics, S(17). Mizture Models: Inference and 
Applications to Clustering. Geoffrey J. McLachlan, 
Kaye E. Basford. Stat., V. 84. Dekker, 1988, xi + 
253 pp, $69.75. [ISBN: 0-8247-7691-7] Theoretical 
and practical issues of clustering. Numerous exam- 
ples of real data sets taken from agriculture, botany, 
education, medicine and zoology used to demon- 
strate applications of finite mixture models. Excel- 
lent source of references on work in area. Fortran 
listing of computer programs given. Good supple- 
ment for multivariate analysis texts. MS 


Statistics, T(17-18: 1), S, P*, L. Multiple Impu- 
tation for Nonresponse in Surveys. Donald B. Rubin. 
Ser. in Prob. & Math. Stat. Wiley, 1987, xxix + 258 
pp, $32.95. (ISBN: 0-471-08705-X] Gives a method 
for handling nonresponses in sample surveys by “re- 
placing each missing value with two or more imputa- 
tions, analyzing the resulting multiply-imputed data 
set using standard methods, and then combining the 
complete-data answers in simple ways.” FLW 


Elementary Computer Science, T(13-14). 
Structured Programming Using True BASIC: An In- 
troduction. Wade Ellis, Jr., Ed Lodi. Harcourt Brace 
Jovanovich, 1988, xviii + 356 pp, $19 (P). [ISBN: 
0-15-584076-2] Requires Macintosh, IBM (or com- 
patible) PC’s and True BASIC (a new BASIC in- 
corporating structured programming technique not 
present in BASIC). Many sample programs. Prob- 
lems at the end of each chapter. Appropriate for 
teaching structured programming on PC’s without 
Pascal or other language. LB-E 


Programming. Complete BASIC for the Short 
Course. James S. Quasney, John Maniotes. Boyd & 
Fraser, 1985, xi + 188 pp, $7.50 (P). (ISBN: 0-87835- 
151-5] Written to supplement a general textbook in 
an introductory data processing course with a BA- 
SIC programming component. Assumes no previous 
programming experience. Seven different dialects of 
the BASIC language are documented. Emphasizes 
structured programming techniques, minimizing use 
of GOTO statement. Includes many simple examples 
of applications. SM 
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Programming, T(13-18: 1), S. The SCHEME 
Programming Language. R. Kent Dybvig. Prentice- 
Hall, 1987, xi + 242 pp, $21.33 (P). [ISBN: 0-13- 
791864-X] Dialect of LISP supporting static scoping, 
first-class procedures, and continuations. Presup- 
poses familiarity with Scheme or another program- 
ming language. Topics include an introduction for 
the novice Scheme programmer, creation and assign- 
ment of identifier bindings, program control oper- 
ations, operations on object types, I/O operations, 
system operations, syntactic extension and structure 
definition. Ends with a chapter of longer, complete 
examples. Text uses the Chez Scheme implementa- 
tion. References. Summary of Forms. Index. RJA 


Programming, S(14-16), P*, L. Elements of C. 
Morton H. Lewin. Found. of Comput. Sci. Plenum 
Pr, 1986, xiv + 246 pp, $29.50. [ISBN: 0-306-42182- 
8] An exposition of the C programming language (as 
opposed to programming technique) which presents 
expressions (pointers, structures and all) as a whole 
before turning to program structure and complete 
programs. Expressly for the professional, experi- 
enced student, or hobbyist who already programs in 
assembly language and at least one high-level lan- 
guage. At last! RB 

Programming, T(14-16: 1, 2), S. A Structured 
Approach to Programming, Second Edition. Joan K. 
Hughes, Glen C. Michtom, Jay I. Michtom. Prentice- 
Hall, 1987, x + 322 pp, $26.67. (ISBN: 0-13-854159- 
0] Topics include program design, planning and im- 
plementation, structured programming, stepwise re- 
finement, structured walk-throughs and inspections, 
testing. Four language chapters, two of which are 
new on Pascal and on C. Also new material added 
on top-down design and testing. Chapter summaries, 
review questions and exercises. Glossary; bibliogra- 
phy; index. RJA 

Programming, T(13-18: 1), S. Advanced Turbo 
Prolog: Version 1.1. Herbert Schildt. Osborne 
McGraw-Hill, 1987, xiii + 323 pp, $21.95 (P). (ISBN: 
0-07-881285-2] Considers many typical artificial in- 
telligence problems and their solutions. Chapters in- 
clude an overview of artificial intelligence, search, ex- 
pert systems, natural languages, vision and pattern 
recognition, robotics, machine learning, logic. Ap- 
pendices on interfacing with other languages and on 
the Turbo Prolog Toolbox. Index. RJA 
Programming, T(13-18: 1), S. The T Program- 
ming Language: A Dialect of LISP. Stephen Slade. 
Prentice-Hall, 1987, xiv + 425 pp, $21.33 (P). (ISBN: 
0-13-881905-X] T is a version of the Scheme dialect 
of LISP. Assumes familiarity with programming in a 
language other than LISP. Suitable as a supplemental 
text for a course in artificial intelligence, data struc- 
tures, computer systems, compiler design. In addi- 
tion to regular LISP features, T includes procedural 
arguments, debugging and error checking, data ab- 
straction, object-oriented programming, locales, op- 
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timizing compiler, uniform semantics for interpreted 
and compiled code. Chapter summaries. Answers to 
selected exercises. References. Index. RJA 


Programming, T(13-18: 1), S. The XLISP 
Primer. Bonnie J. Fladung. Prentice-Hall, 1987, xix 
+ 172 pp, $25.67 (P). (ISBN: 0-13-972084-7] A tuto- 
rial and reference manual for the XLISP interpreter. 
Provides an introduction to LISP, emphasizing func- 
tions compatible with Common LISP. Chapter ex- 
ercises with answers. Summary of XLISP features; 
appendix of graphics and interface commands for the 
IBM PC and for the Macintosh. Index. RJA 


Languages, T(13-18: 1), S*, P, L. Productive 
Prolog Programming. Peter Schnupp, Lawrence W. 
Bernhard. Prentice-Hall, 1987, xiv + 296 pp, $26 
(P). [ISBN: 0-13-725110-6] Treats Prolog as a pro- 
duction language, as a tool for software engineers. 
Prolog is first viewed as a relational knowledge base, 
secondly as a procedural interpreter of its rules, 
thirdly as a pattern matcher in the form of the uni- 
fication process. Subsequent to these three views, 
Prolog is investigated as a system of programming 
in logic. Application and implementation details, in- 
dex. RJA 


Computer Systems. Lecture Notes in Computer 
Science-266: Advances in Petri Nets 1987. Ed: Grze- 
gorz Rozenberg. Springer-Verlag, 1987, vi + 451 pp, 
$34.60 (P). [ISBN: 0-387-18086-9] Papers from the 
seventh European workshop on applications and the- 
ory of Petri nets held in Oxford in June 1986. Con- 
cludes with a major comprehensive bibliography on 
Petri nets. LAS 


Computer Graphics, P. Techniques for Computer 
Graphics. Ed: David F. Rogers, Rae A. Earnshaw. 
Springer-Verlag, 1987, viii + 512 pp, $59.50. [ISBN: 
0-387-96492-4] Proceedings of the International Sum- 
mer Institute on the State-of-the-Art in Computer 
Graphics held at the University of Stirling. Con- 
tains nineteen papers covering: workstations, graph- 
ics standards, image generation, computer-aided de- 
sign, curves and surfaces, human-computer interface 
issues, electronic documents, integrated graphics and 
text, solid modeling, and VLSI. Many of the papers 
contain background discussions, so the book should 
be useful for non-specialists as well as for more ex- 
perienced researchers. AM 


Computer Graphics, P. Advances in Computer 
Graphics: Hardware I. Ed: W. Strafer. Eurograph- 
icSeminars. Springer-Verlag, 1987, x + 145 pp, $44. 
[ISBN: 0-387-18222-5] Proceedings of a 1986 work- 
shop. Contributions discuss workstation architec- 
tures, hardware support for geometric modelling, 3- 
D models, graphics for traffic and movement simula- 
tion, and ray tracing. RM 


Theory of Computation, T(18: 1, 2), S, P. Va- 
riettes of Formal Languages. J.E. Pin. Transl: A. 
Howie. Plenum Pr, 1986, x + 138 pp, $37.50. [ISBN: 
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0-306-42294-8] Presents basic results on theory of fi- 
nite automata and rational languages. The concept 
of variety of languages is used to formalize the re- 
lationship among finite automata, recognizable lan- 
guages, finite semigroups. Chapter problems. Bibli- 
ographic notes. Index. RJA 


Theory of Computation, P. Lecture Notes in 
Computer Science-271: From Logic Design to Logic 
Programming. Dominique Snyers, André Thayse. 
Springer-Verlag, 1987, iv + 125 pp, $15.40 (P). 
(ISBN: 0-387-18217-9] P-functions are described and 
related to the design of algorithms and to declara- 
tive programming. Topics include theorem proving, 
grammars, logics, semantics. Bibliography. RJA 


Theory of Computation, T(18: 1), P. The Com- 
plezity of Boolean Functions. Ingo Wegener. Ser. in 
Computat. Sci. Wiley, 1987, xi + 457 pp, $44.95. 
(ISBN: 0-471-91555-6] Covers models for computing 
Boolean functions, parameters for measuring com- 
plexity, bounds on complexity, relationship of circuit 
models and Turing machines. Assumes basic knowl- 
edge of linear algebra, combinatorics, algorithm anal- 
ysis, complexity theory. Exercises at end of each 
chapter. Extensive bibliography. KS 


Theory of Computation, §(17). Essays on Con- 
cepts, Formalisms, and Tools. Ed: P.R.J. Asveld, A. 
Nijholt. CWI Tract, V. 42. Math Centrum, 1987, v + 
278 pp, Dfi. 42.90 (P). (ISBN: 90-6196-326-5] This 
is a series of about a dozen papers addressing re- 
search issues in the field of theoretical computer sci- 
ence. These papers were presented at a symposium 
on this subject at Twente University, The Nether- 
lands, in October 1987. The papers address such is- 
sues as formal languages, logic programming, parses, 
and formal models of computation. GMS 

Theory of Computation, T(16-18: 1, 2), S, P, 
L. Computability Theory, Semantics, and Logic Pro- 
gramming. Melvin Fitting. Logic Guides, V. 13. 
Oxford U Pr, 1987, xi + 198 pp, $35. (ISBN: 0- 
19-503691-3] Early chapters investigate theoretical 
properties of the high-level language EFS. A model 
computer is subsequently introduced with a pro- 
cess for translating from EFS into its machine lan- 
guage. EFS is chosen to be a logic programming lan- 
guage, similar to Prolog. The model of a computer 
is based on register machines. Three major themes 
are developed: robustness, essential limitations, and 
program semantics. Exercises. Chapter background 
sections. Chapter references. Appendex on induc- 
tion. Index. RJA 


Artificial Intelligence, P. Artificial Intelligence 
and Instruction: Applications and Methods. Ed: 
Greg Kearsley. Addison-Wesley, 1987, xiv + 351 pp, 
$29.95. (ISBN: 0-201-11654-5] Individual chapters 
written by experts on different aspects of Intelligent 
Computer Assisted Instruction (ICAI). Begins with 
an overview of ICAI. Parts two and three concern ar- 
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tificial intelligence in education and in training. Part 
four is on building intelligent tutors, while the last 
part is on implementing ICAI systems. Chapter ref- 
erences. Index. RJA 

Artificial Intelligence, T*(15-18: 1, 2), S, 
P, L. Logical Foundations of Artificial Intelligence. 
Michael R. Genesereth, Nils J. Nilsson. Morgan 
Kaufmann, 1987, xviii + 405 pp, $36.95. (ISBN: 
0-934613-31-1] Text rests on two main assumptions: 
progress in a discipline requires development and use 
of appropriate mathematical apparatus to express 
ideas, and symbolic logic forms, one of the most 
important aspects of the mathematics for artificial 
intelligence. Topics include declarative knowledge, 
inference, resolution, nonmonotonic reasoning, in- 
duction, reasoning with uncertainty, knowledge and 
belief, planning, metaknowledge and metareason- 
ing, state and change, intelligent-agent architecture. 
Chapter exercises, bibliographical and historical re- 
marks. Answers to exercises. RJA 


Applications, T(15: 1), L. Introduction to An- 
alytical Dynamics. N.M.J. Woodhouse. Oxford U 
Pr, 1987, x + 169 pp, $39.95. [ISBN: 0-19-853198- 
2} A textbook for undergraduates. Slim, no-fat 
coordinate-based treatment of Newtonian mechan- 
ics. Requires background in linear algebra, multivari- 
able calculus and some three-dimensional mechanics. 
Builds a case for Newtonian mechanics. Many exer- 
cises and examples from recent Oxford examination 
papers. JK 


Applications (Electrical Engineering), T(14- 
16). Fundamentals of Modern Digital Systems, 
Second Edition. B.R. Bannister, D.G. Whitehead. 
Springer-Verlag, 1987, ix + 304 pp, $29.50. (ISBN: 
0-387-91311-4] Good introduction to digital systems 
with excellent introductory chapters on combina- 
tional logic, numbering systems, and coding. Prob- 
lems of varying levels with solutions to selected (not 
odd) problems. Fine illustrations well-spaced within 
text. Includes chapters on memory systems and pro- 
grammable devices as well as more fundamental top- 
ics: semiconductors, logic design techniques, logic 
components and state tables. LB-E 


Applications (Engineering), P, L. Formulas, 
Facts and Constants for Students and Profession- 
als in Engineering, Chemistry and Physics, Second, 
Revised and Enlarged Edition. H.J. and K.H. Fis- 
chbeck. Springer-Verlag, 1987, xv + 260 pp, $23 
(P). (ISBN: 0-387-11315-0] 


Applications (Engineering), P. Lecture Notes in 
Engineering-26: Shell and Spatial Structures: Com- 
putational Aspects. Ed: G. De Roeck, et al. 
Springer-Verlag, 1987, vii + 487 pp, $60 (P). [ISBN: 
0-387-17498-2] Proceedings of a July 1986 sympo- 
sium held in Lenven, Belgium on new techniques for 
design of structures using engineering workstations. 
LAS 


TELEGRAPHIC REVIEWS 


[March 


Applications (Physics). Analysis and Mathemat- 
ical Physics. Hans Triebel. Math. & Its Applic. 
D Reidel, 1986, xxviii + 456 pp, $89. [ISBN: 90- 
277-2077-0] Lists without proof many major theo- 
rems from a vast area of mathematics, beginning 
with elementary algebra and trigonometry and pro- 
gressing through differential and integral calculus of 
one and several variables, ordinary and partial differ- 
ential equations, calculus of variations, measure and 
integration theory, complex function theory, Hilbert 
spaces, and much more. Apart from the definitions 
and statements of theorems, the text consists primar- 
ily of numerous disconnected remarks. MU 


Applications (Physics), S(17-18), P. Obstacle 
Problems in Mathematical Physics. José-Francisco 
Rodrigues. Math. Stud., V. 134. Elsevier Science, 
1987, xv + 352 pp, $71 (P). [ISBN: 0-444-70187-7] 
Presents a general account of the applicability of el- 
liptic variational inequalities to obstacle type bound- 
ary problems. Beginning with some obstacle prob- 
lems which can be reduced to variational inequali- 
ties, the book then presents some of the main aspects 
of elliptical inequalities from the abstract Hilbertian 
framework to the smoothness of the variational so- 
lution. Clearly written and laced with informal dis- 
cussion. MU 


Applications (Physics). Nonlinear Diffusive 
Waves. P.L. Sachdev. Cambridge U Pr, 1987, vii + 
246 pp, $49.50. [ISBN: 0-521-26593-2] “This mono- 
graph deals with Bergers’ equation and its general- 
izations. Such equations describe a wide variety of 
nonlinear diffusive phenomena, for instance, in non- 
linear acoustics, laser physics, plasmas and atmo- 
spheric physics ....” Both analytical and numerical 
perspectives. Prerequisites: basic course in gas dy- 
namics; ordinary and partial differential equations; 
parabolic partial differential equations helpful. RB 


Reviewers 


RJA: Richard J. Allen, St. Olaf; DFA: David F. Appleyard, 
Carleton; RB: Richard Brown, Carleton; LB-E: Lisa Brown- 
Ewing, Macalester; JNC: Judith N. Cederberg, St. Olaf; LC: 
Laura Chihara, St. Olaf; BC: Barry Cipra, St. Olaf; CEC: 
Clifton E. Corzatt, St. Olaf; JD-B: John Dyer-Bennet, Car- 
leton; RSF: Robert S. Fisk, St. Olaf; SG: Steven Galovich, 
Carleton; GG: George Gilbert, St. Olaf; BH: Bruce Hanson, 
St. Olaf; RH: Rhonda Hatcher, St. Olaf; RK: Roger Kirch- 
ner, Carleton; RSK: Richard S. Kleber, St. Olaf; JK: Joseph 
Konhauser, Macalester; LCL: Loren C. Larson, St. Olaf; AM: 
Alan Magnuson, St. Olaf; SM: Steve McKelvey, St. Olaf; RM: 
Richard Molnar, Macalester; RWN: Richard W. Nau, Car- 
leton; AO: Arnold Ostebee, St. Olaf; MR: Matthew Richey, 
St. Olaf; AWR: A. Wayne Roberts, Macalester; GMS: G. 
Michael Schneider, Macalester; PS: Paul Schoening, St. Olaf; 
JS: John Schue, Macalester; SS: Seymour Schuster, Carleton; 
KS: Kay Smith, St. Olaf; LAS: Lynn Arthur Steen, St. Olaf; 
MS: Myriam Steinback, Macalester; MU: Milton Ulmer, Car- 
leton; TAV: Theodore A. Vessey, St. Olaf; MW: Martha Wal- 
lace, St. Olaf; LW: Lynne Walling, St. Olaf; FLW: Frank L. 
Wolf, Carleton; MZ: Michael Zielinski, St. Olaf. 


NCTM Yearbooks 


Each One a Treatise on a 
Timely Mathematical Topic 


All Are Valuable to Teachers of Future Teachers 


The Ideas of Algebra, K-12 (1988 Yearbook), edited by 
Arthur F. Coxford, emphasizes that algebra is an exciting, 
living, and growing school mathematical field. Because 
algebra is the area in which symbolic conventions needed 
for more advanced mathematics are first developed, 


understanding of its fundamental concepts is very impor- 


tant. This book will guide you in establishing K-12 alge- 
bra curricula and selecting specific approaches for its 
teaching. It will also help you diagnose children’s diffi- 
culties, gather ideas for using word problems, and encour- 
age you to use technology in the algebra classroom. 256 


pp., #380, $16.* 


GEOM 
ETRY 


BRA 


Learning and Teaching Geometry, K-12 (1987 
Yearbook), edited by Mary Montgomery Lind- 
quist, reminds the mathematics education commu- 
nity that there is a great body of knowledge that 
should be called on when decisions are being made 
about the geometry curriculum, especially research 
about how students learn and what they are not 
learning. Authors of this yearbook—basing their 
writing on this wealth of knowledge—will show you 
that geometry is for all (for kindergarten through 
teacher education) and that geometry is useful, in 
many new ways in today’s technological society, 
in problem-solving approaches to teaching mathe- 
matics, and in other branches of mathematics. 250 
pp., #359, $16.* 


Estimation and Mental Computation (1986 Yearbook), 
edited by Harold L. Schoen. Because of advances in tech- 
nology, a broader view of estimation and its companion, 
mental computation, is required in today’s society. This year- 
book’s major theme is the interdependency between the proc- 
ess of estimating in a particular area and the understanding 
of mathematical concepts in that same area. It stresses the 
point that an important by-product of learning to estimate 
is better conceptual understanding, and—conversely—con- 
cepts must be understood in order to provide the flexible 
set of processes and decision-making rules needed by the pro- 
are estimator. Covers K-12 grade levels. 256 pp., #347, 
16.* 


IX) fel] NATIONAL COUNCIL OF 
T TEACHERS OF MATHEMATICS 
1906 Association Drive, Reston, VA 22091 


ESTI 
TION 


*|Individual members, bookstores, or orders of 10 or more copies of the same publication shipped to one address earn a 
discount of 20% off the list price. VISA and MasterCard charges are acceptable; please specify the credit card number 


and expiration date. Virginia residents add 4.5% sales tax. 


TH 


EDITION 


CALCULUS 
AND 
ANALYTIC 
GEOMETRY 


Calculus and Analytic Geometry, 
Seventh Edition 


by George B. Thomas, Jr. and Ross L. Finney 
Discover the Thomas/Finney Advantage. 


In keeping with the dynamic nature of the subject their names have become 
synonymous with, George Thomas and Ross Finney have made some thoughtful 
changes in the newest edition of their best-selling text. They’ve combined new 
problems and applications with an overall presentation that’s lively and refreshingly 
accessible—without sacrificing precision. And hardworking supplements, such as 
AWTest—a computerized test generator—the Student Edition of MathCAD, and 
The Calculus Toolkit,” are also available. Calculus and Analytic Geometry has 
it all. It’s precise, applied, pedagogical, accurate, and proven. 


New for 1988 


Calculus: A Modeling Approach, Fourth Edition 
by Marvin L. Bittinger 


Applied Calculus, Second Edition 

by Marvin L. Bittinger and Bernard B. Morrel 

Essential Mathematics, Fifth Edition 

by Mervin L. Keedy, Marvin L. Bittinger, and William Rudolph 
Developmental Mathematics, Second Edition 

by Mervin L. Keedy and Marvin L. Bittinger 

Differential Equations: An Introduction, Second Edition 
by David Sanchez, Richard Allen, and Walter Kyner 
Introduction to Discrete Mathematics 

by Jim Bradley 

An Introduction to Linear Algebra 

by Carroll Wilde 

Technical Mathematics I 

by Thomas J. McHale and Paul T. Witzke 

Technical Mathematics II 

by Thomas J. McHale and Paul T. Witzke 

Elementary Number Theory with Applications, Second Edition 
by Ken Rosen 


Contact your Addison-Wesley sales representative for more details on any of the titles you see in 
this ad. 


wv Addison-Wesley Publishing Company 


Reading, MA 01867 
(617) 944-3700 


The leader in mathematics. 
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Let f(z)’s animated 
screens show you what 
a textbook cannot. 


The Complete 
Complex Variables 
Graphing Package 


Lascaux Graphics 
3220 Steuben Ave. 
Bronx, NY 10467 


(212) 654-7429 
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Explain this apparent convergence 


If you have a PC compatible, write fora FREE demo disc! 
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Professional/ Technical /Reference Publishing = 


Algorithms and Data Structures 
Niklaus Wirth 

1986, 288 pp., paper 013-022005-1, 
$41.00 


Algorithms and Complexity 
Herbert S. Wilf 

1986, 240 pp., cloth 013-021973-8, 
$38.00 


Algorithms + Data Structures = 
Programs 

Niklaus Wirth 

1986, 366 pp., cloth 013-022418-9, 
$45.00 


PRENTICE HALL 


Simon & Schuster 
Higher Education Group 
Englewood Cliffs, NJ 07632 
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implementing Mathematics with the 
NUPRL Proof Development System 
Robert Constable 

1987, 320 pp., cloth 013-451832-2, 
$27.00 


Solving Polynomial Systems Using 
Continuation for Engineering and 
Science Problems 

Alexander Morgan 

1987, 384 pp., cloth 013-822313-0, 
$54.00 


Available at better bookstores or direct from 
Prentice Hall at (201) 767-5937. For 
quantity orders call (201) 592-2498 


Prices subject to change without notice 


Order your copy of the Prentice Hall Professional/ 
Technical/Reference Catalog: Books for 
Computer Scientists, Computer/Electrical 
Engineers and Electronic Technicians for only 
$2 00 and receive $5 00 off your first purchase 
from the catalog (013-622804-6) 
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Setting new standards 
for mathematics 
instruction ... 


CALCULUS 


POR IEEE FOPPEEO'S 


/ ? 
ty i e 
Up Mlbate Pe aas 


STANLEY 1, GROGSMAWN 


CALCULUS WITH ANALYTIC 
GEOMETRY 

Alternate Edition 

ROBERT ELLIS and DENNY GULICK 


CALCULUS 
Fourth Edition 


STANLEY I. GROSSMAN 


CALCULUS FOR THE MANAGE- 
MENT, LIFE, AND SOCIAL 
SCIENCES 

Second Edition 

BERNARD KOLMAN and 
CHARLES DENLINGER 


APPLIED CALCULUS 
BERNARD KOLMAN and 
CHARLES DENLINGER 


COLLEGE ALGEBRA 


COLLEGE ALGEBRA AND 
TRIGONOMETRY 

Both books by ADELBERT HACKERT and 
GENE R. SELLERS 


STATISTICS 
An Introduction 
Second Edition 


ROBERT D. MASON, DOUGLAS A. 
LIND, and WILLIAM G. MARCHAL 


1988 textbooks LINEAR ALGEBRA AND ITS 


APPLICATIONS 
Third Edition 
GILBERT STRANG 


A COURSE IN LINEAR ALGEBRA 
DAVID DAMIANO and JOHN LITTLE 


AN INTRODUCTION TO LINEAR 


ALGEBRA WITH APPLICATIONS 
Second Edition 


STEVEN ROMAN 


APPLIED FINITE MATHEMATICS 
Fourth Edition 

HOWARD ANTON, BERNARD 
KOLMAN, and BONNIE AVERBACH 


MATHEMATICS WITH APPLICA- 
TIONS FOR THE MANAGEMENT, 


LIFE, AND SOCIAL SCIENCES 
Third Edition 


HOWARD ANTON, BERNARD 
KOLMAN, and BONNIE AVERBACH 


PREREQUISITE ALGEBRA 
CHARLES P McKEAGUE 


TRIGONOMETRY 
Second Edition 


CHARLES P McKEAGUE 


HARCOURT BRACE JOVANOVICH, Inc. 
College Sales Office 

7555 Caldwell Avenue 

Chicago, IL 60648 


The Journal of Undergraduate Mathematics and Its Applications 


Table of Contents for Volume 8.1, 1987 


Editorial: Solving Problems Graphically Using Microcomputers..Bert K. Waits and Franklin Demana 
Articles: Secret Encryptions with public Keys...Linda L. Deneen 
The Role of Mathematics in Atonal Music...Howard J. Wilcox 
UMAP Modules: Price Elasticity of Demand: Gambling, Herion, Marijuana, Whiskey, Prostitution, 
and Fish...Yves Nievergelt 
Funding Pension Benefits: The Individual Spread Gain Method...Ho Kuen Ng 
On Jargon: The Language of Growth, Enlargement, and Decrease...Paul Campbell and Gary Britton 


The UMAP Journal contains contemporary applications and intelligent articles at an affordable price.}} 
Subscribe now and reap the benefits. COMAP is also delighted to offer the 1987 annual collection,|} 
UMAP Modules, Tools for Teaching, as part of your 1988 Journal subscription. The annual collection] 
comprises all modules published in 1987 and is a solid compendium for classroom usage, as well as af 
must for your library. | 


Volume 9, 1988 discounted subscription rates for members of MAA 
Individual, four issues 


The Mathematics of Games and Gambling, 


by Edward Packel 
141 pp., 1981, Paper, ISBN-0-88385-628-X 
List: $11.00 MAA Member: $8.80 


“‘The whole book 1s written with great urbanity and clarity . . . 
it 1s hard to see how it could have been better or more readable. ’’ 
Stephen Ainley in The Mathematical Gazette 


You can’t lose with this MAA Book Prize winner, if you want to see how mathe- 
matics can be used to analyze games of chance and skill. Roulette, craps, black- 
jack, backgammon, poker, bridge, state lotteries, and horse races are considered 
here in a way that reveals their mathematical aspects. The tools used include 
probability, expectation, and game theory. No prerequisites are needed beyond 
high school algebra. 

No book can guarantee good luck, but this book will show you what deter- 
mines the best bet in a game of chance, or the optimal strategy in a strategic 
game. Besides being a good supplement in a course on probability and good bed- 
side reading, this book’s treatment of lotteries should save the reader some 
money. 


Order from: 

The Mathematical Association of America 
1529 Eighteenth Street, N.W. 

Washington, D.C. 20036 


THE BRIDGE SOFTWARE MATH UTILITIES 


SURFS 


SURFS makes plotting functions z = f(x,y) in three 
dimensions easy and effective. You can opt for 
automatic scaling and positioning, but all aspects of 
the plot, including rotations through any angles, 
stretchings and translations are modifiable from 
simple menus. Hidden-line removal and correct axis 
placement are built in; cross hatching is another 
option. You can also set the number and the fineness 
of contours, and change the domain of the function. 


DIFFS 


DIFFS plots the direction fields and solution curves to 
ordinary first order differential equations y’ = f(x,y). 
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CURVES 

This is a very powerful yet easy-to-use curve sketcher. 
You can enter equations in Rectangular, Polar, or Par- 
ametric form and CURVES will plot them. Scaling is 
possible for special effects. CURVES also provides a 
unique movable ‘‘magnification window”’ permitting 
repeated ‘zooming in’’ on any section of the plot. You 
can use this to find the solution of simultaneous equa- 
tions to a high degree of accuracy. 


= 19co0(t) - Scoo(i04t/S) 
= 190in(t) + Soin (99t/5) 


FEATURES: 
® High Resolution output to a dot y = sinta) sin tx), 
matrix printer: 1000 x 1000 dot 

images are Near Plotter Quality 
e Easy Keyboard input of function 
@ Full 8087 support 
® Easy to understand documentation, X) + SIN (Xx) 
with examples and illustrations. No 
programming necessary. 


REQUIRES: 


e IBM PC, XT, AT or compatibles, “They h ; . 
with/without 8087 chip; 320K RAM; They have been a delight to use. 


PC-DOS, MS-DOS, versions 2.0 or “| just can't imagine being without it.” 


higher. . - Anthony Barcellos, PC Magazine 
© PRINTER: Standard version supports 
Epson, IBM or compatible dot-matrix 


printers. Special versions available CALL FOR OUR S$ 
. DNotten tie enn NEC ‘SURES EDUCATOR’S SPECIAL 
otter: HP plotter version 
CURVES) for slight extra charge. All 3 Programs Only... 


Af I-\|BRIDGE SOFTWARE epucational scientiric WARE 
P.0.118, New Town Branch, Boston, MA 02258 (617) 527-1585 


CUAVES CURVES 


$125 
Value 


THE UNIVERSITY OF PAPUA NEW GUINEA 
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Port Moresby 
PROFESSOR OF MATHEMATICS 


Mathematics Department 
W121001 


Applications are invited for the position of Professor of Math- 
ematics from distinguished academics having a Ph.D. or an equiv- 
alent degree in a mathematical science with research potential, 
inter-disciplinary outlook and dedication to teaching mathemat- 
ics at university level. Experience in a developing country would 
be an advantage. The appointee will be expected to provide ac- 
ademic leadership within the department, supervise higher de- 
gree students, and also participate in the staff development 
programme for national academic staff. Detailed information will 

e available from the Head of Mathematics. 
SALARY 
Professor -K28010p a plus 24% gratuity 

TEACHING BY EXTENSION AND DURING LAHARA (SUMMER 
SESSION) IS AN IMPORTANT PART OF THE UNIVERSITY’S ACTIV- 
ITIES. THE SUCCESSFUL APPOINTEES SHOULD BE PREPARED TO 
TAKE PART IN THESE OR OTHER OUT REACH ACTIVITIES. 

Applications will be treated as strictly confidential and should 
include three copies of a full curricufum vitae, a recent small 
photograph, the names and addresses of three referees and date 
of availability. In order to expedite the appointment procedure, 
applicants are advised to contact their referees to send confiden- 
tia reports directly to the University without waiting to be con- 
tacted. 

Applications should be forwarded to the Deputy Registrar 
(Staffing), University of Papua New Guinea, P.O. Box 320, Uni- 
versity Post Office, Papua New Guinea. 


Applications close on 30 April 1988. 


T IAMO (MR) 
ACTING REGISTRAR 


HRA 87439 


“When we read what their college will cost... 
we bought the maximum life insurance available 
in the MAA Members’ Insurance Program...’ 


“,..Oh, we expect to be there—proud as can jobs, your group insurance automatically 
be—when they enter college. But, just in goes with you, so there’s no lapse in your 
case one of us isn’t...the insurance cover- —_ insurance coverage. 
age willbe.” Check your insurance portfolio. Will it 

Ifa breadwinner is no longer aroundto meet your family’s future needs? If not, call 
provide for family shelter and the children’s — or write the Administrator. 
college education...life insurance had 
hetter be UP TO $225,000 IN 

te: TERM LIFE INSURANCE PROTECTION IS 

Our group ferm life INSUFANCE Can Sup- AVAILABLE TO MAA MEMBERS. 
plement your insurance portfolio so that Plus this other group insurance plan: 
your family has the protection it needs Excess Major Medical 

—and for the future In-Hospital Insurance 

now—an Disability Income 

As a member, you can fill your insurance High-Limit Accident Insurance 
needs at low group rates. You can get 
family coverage, too. And if you change 


The MAA Life Plan is underwritten by Connecticut General Contact Administrator, 

Life Insurance Company, a CIGNA Company, Hartford, MAA Group Insurance Program 

Connecticut 06152 on form number GM 3000. Smith-Sternau Organization, Inc 
1255 23rd Street, NW. 
Washington, D.C. 20037 


800 424-9883 Toll Free 


in Washington, D.C. area, 202 296-8030 


Special to readers of the Monthly: $9.95, postage included 
$11.95 for foreign orders 


CALCULUS 
MURDER 


Erik Rosenthal 
Department of Computer Science, Wellesley College 


",..1s the story of a mathematician who doubles as a ... private investigator. He does this 
outside work because he has not been able to latch onto a university teaching job. Perhaps 
his specialty — bounded linear operators on a separable Hilbert space —is toorarified ... His 
lawyer friend asks him to investigate the murder of a fashion manufacturer ... The Calculus 
of Murder turns out to be an orthodox and even classic mystery ... The writing is easy-going, 
the characters well etched ... — Newgate Callendar, New York Times Sunday Book Review 


"[Dan Brodsky] proves to be an engaging narrator, as he drives, in an ancient anthropomor- 
phised motor car, around the Bay Area, pondering mathematical problems..." — The Tablet 
(England) 


“For those who prefer sherry to beer ... there [ is a ] promising debut. Erik Rosenthal's The 
Calculus of Murder ... takes place in ... San Francisco. His victim is a malign industrialist, 
his detective is a Ph.D mathematician who works as a private eye while waiting for a 
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An Algebraic Approach to Some Number-Theoretic Problems 
Arising from Paper-Folding Regular Polygons 


JON FROEMKE AND JERROLD W. GROSSMAN 


JON FROEMKE: I received my Ph.D. from the University of California at 
Berkeley in 1967 in universal algebra under the direction of Alfred Foster. I 
have been at Oakland University ever since. This article represents a return 
to number theory, my first love, and in an unusual way, a continuation of my 
interest in the arithmetic of primal algebras. 


JERROLD W. GROSSMAN: I specialized in mathematical logic at Stanford 
University (B.S., M.S. in 1970), studied algebraic topology at M.LT. (Ph.D. 
in 1974 under Daniel Kan), and have been at Oakland University since 1974. 
My research articles have dealt with algebraic topology, algebra, graph 
theory, combinatorics, probability and statistics, computer science, and now, 
apparently, number theory. 


Introduction. Since 1983 Peter Hilton and Jean Pedersen have been studying the 
ramifications of an ingenious paper-folding construction they devised for approxi- 
mating angles and regular polygons [2], [3], [4], [5], [6], [7]. They are quickly led into 
number-theoretic questions. In this article we will see that with the use of some 
additional tools of graph theory, number theory, and algebra, we can more fully 
explain the phenomena they are studying. We also pose some new questions they 
have not raised and answer many of them. Perhaps not surprisingly, our investiga- 
tions touch on several aspects of elementary number theory and lead quickly to 
some of its famous unsolved problems. On the other hand, Hilton and Pedersen 
pursue other interesting number-theoretic aspects of the construction, which we do 
not consider, so in no sense does our paper supersede their work. 

This article is somewhat self-contained, in the sense that most of the number 
theory we need is reviewed herein. The underlying problems are extremely easy to 
state, and thus the article should be accessible to a wide audience. Indeed, the basic 
numerical construction proved fascinating to the first author’s ten-year-old son, who 
could appreciate the questions being raised, but not, of course, the reasons behind 
their answers. 

In what follows, we will let Z denote the set of integers, and for integers x and y 
we will let (x, y) and [x, y] denote their greatest common divisor and least common 
multiple, respectively (with the obvious extensions to more than two variables). 

Since the statement of the problems we are considering cannot be given until 
Section 2, it would be meaningless to give an outline of the paper at this point. The 
usual preview, section by section, is instead given at the end of Section 2. 
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1. Folding angles: the geometric motivation. Hilton and Pedersen have a clever 
method for approximating rational angles, to any desired degree of accuracy, using 
only an elementary paper-folding operation: bisecting an angle. (By rational angle 
we mean an angle whose measure is a rational number of degrees.) We summarize 
their procedure briefly. Although the geometric problem has no direct bearing on 
the rest of this paper, it serves as the motivation for considering the numerical 
construction of Section 2. 

Let r, = am/b be a given acute rational angle, i.e., assume that 0 < a < b/2, 
where a and b are integers with (a, b) = 1. Call b the denominator of r). For their 
purposes, Hilton and Pedersen assume initially that b is odd, and we will also (for 
now) make this assumption. Suppose angle r, is formed between the bottom edge of 
a long strip of paper and a fold in the paper (see FIGURE 1). 


Fic. 1. Angle 7% on a strip of paper. 


If a is even, say a = 2a’, then by folding the original crease onto the bottom 
edge of the strip, we can construct angle r, = a’1/b (see FIGURE 2). 


LX 


FIG. 2. Bisecting aa/b when a is even. 


On the other hand, if a is odd, then the supplementary adjacent interior angle at 
the top of the strip has measure (b — a)7/b, with (b — a) even, say equal to 2a”. 
We can, therefore, bisect it by folding the original crease onto the top edge of the 
strip and, thereby, form r, = a’’7/b along the top of the strip (see FIGURE 3). 


FIG. 3. Bisecting (b — a)a7/b when a is odd. 


Iterating this procedure, we obtain a sequence of acute rational angles 7, 7,, %,... 
with denominator b. It turns out (as a consequence of some observations in Section 
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2) that the sequence is cyclic, and hence that rp = r, for some k (to be called the 
quasiorder of 2 mod b). See FIGURE 4. 


FIG. 4. % = 1%. 


Now if in fact 7p 1s only an approximation to a7/b, say with a small error e, then 
it is easy to See that the r, constructed by this procedure is again an approximation 
to a/b, but with error ¢/2*. Thus, by iterating the process, we can construct 
arbitrarily good approximations to az /b. 

Using this paper-folding procedure, Hilton and Pedersen devised a systematic 
method for constructing arbitrarily good approximations to any regular convex 
polygon or regular star polygon. 


2. Charm bracelets: the numerical construction. Isolating the numerical ingredi- 
ents of the foregoing geometric construction, we have simply the following oper- 
ation on the set of positive integers less than b and relatively prime to b, where b is 
a fixed odd integer greater than 2: 


if a is even, then halve it, 1.e., let a’ = a/2; 
if a is odd, then subtract it from b,1.e., let a’ = b — a. 


(The reader may have noticed the similarity of this operation to the one that appears 
in the Collatz problem [9], although we have found nothing here that seems relevant 
to that notorious problem.) We will find it easier (but equivalent) in what follows to 
work instead with the opposite operation, which we now define. 


DEFINITION. Let V(b) = {a € Z|0 < a < b and (a, b) = 1}. The function f,: 
V(b) — V(b) is given by the rule 


(a) = 2a ifa<b/2 
fa) = b-a ifa>b/2 


(The subscript is 2 because of the multiplication by 2; we generalize this later.) To 
get a picture of this operation, we construct, for each odd b > 1, a graph whose 
vertices are the elements of V(b) and whose edges are all the unordered pairs 
(a, f,(a)). (If b = 3, then we put two edges between 1 and 2, since f,(1) = 2 and 
f,(2) = 1.) The graph for b = 17 is shown in FIGURE 5. 
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Fic. 5. The graph for 6 = 17. 


The following observations about these graphs follow immediately from the 
definitions. 


1. Eath vertex a for which a is odd and greater than 5/2 has degree 1, being 
adjacent only to b — a. Such values of a we will call charms. 

2. Each vertex a for which a is even and less than b/2 has degree 3, being 
adjacent to 2a, a/2, and the charm b — a. 

3. Each vertex a for which a is even and greater than 5/2 has degree 2, being 
adjacent to b — a and a/2. 

4. Each vertex a for which a is odd and less than b/2 has degree 2, being 
adjacent to b — a and 2a. 


Now if we momentarily discard the charms, then every vertex has degree 2, and 
hence (by a trivial result of graph theory) the graph consists of one or more disjoint 
cycles (polygons). We call such a cycle, together with all the charms connected to it, 
a bracelet. We call the number of vertices in a bracelet its weight, and (continuing 
the jewelry metaphor) we call the number of noncharms in a bracelet its size. 
Finally we let B(2, b) denote the number of bracelets in the graph. For b = 17, we 
see from FIGURE 5 that B(2,17) = 2, that the weight of each bracelet is 8, and that 
the sizes of the two bracelets are 5 and 7, respectively. 

Our bracelet graph replaces Hilton and Pedersen’s “symbol” for representing the 
same information. They would display the second bracelet in FIGURE 5, for 
example, as 


This symbol is interpreted as follows: beginning with a = 3, we subtract a from 17 
and divide the answer (14) as many times as we can by the number 2 (in this case, 
we divide once, obtaining the number 7 as the next value of a). The number of 
factors of 2 that were divided out appears in the second row, and the next value of a 
appears in the first row of the next column. The process is repeated until the 
original value of a is obtained. Two differences between our approaches are worth 
noting. First, Hilton and Pedersen list only the odd values of a less than 5/2; the 
missing intermediate even values are implied and the second row in each column 
shows the number of missing even values. (Hilton and Pedersen use their symbol, 
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among other things, to succinctly encode the instructions for paper-folding ap- 
proximations to star polygons.) Second, the symbol, as read from left to right, 
represents divisions by 2, rather than multiplications by 2, as in our approach, 
although Hilton and Pedersen also present a multiplication-based, as well as a 
division-based, “quasi-order algorithm.” At this point, the differences are minor, 
but we will see that the bracelet symbolism clarifies the situation when we gener- 
alize, in Section 3, from multiplication (or division) by 2 to multiplication (or 
division) by an arbitrary number ¢. 

We will sometimes find it convenient to modify our way of looking at things by 
identifying each number a in V(b) with b — a. This is more in the spirit of the 
geometric construction, where we considered only acute angles. 


DEFINITION. Let D(b) = {a © Z|O0 <a < b/2 and (a,b) =1)}. The function 
g,: D(b) > D(b) is given by the rule 


(a) = (24 if2a<b/2 
S2\0) ~~ \b-2a if2a>b/2’ 


To obtain a visual model of the structure induced by g,, we need only draw a box 
around each pair (a, b — a) in the graphs obtained above (see FIGURE 6 for the 
situation when b = 17). 


CP ES 


FIG. 6. Dominoes for 5b = 17. 


We will call each such pair a domino, so that each bracelet can now be thought of 
simply as a cycle of dominoes. The cyclic geometric construction of Section 1 is 
represented by this graph: the domino containing a © D(b) corresponds to the 
acute angle az/b. Note that as a consequence of this view, we obtain our first easy 
fact about bracelets. 


PROPOSITION 1. The weight of a bracelet is always even. 


If one computes (either by hand or, more easily, with a computer) the bracelet 
structures for some small values of b, especially for b prime, then one is led to an 
almost inexhaustible set of questions and conjectures. (To provide the reader with 
some examples for our discussion, we display a brief set of data in the Appendix.) 
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For example, do all the bracelets for a given b have the same weight? Do they all 
have sizes of the same parity? Is there any way to “compute” the number of 
bracelets, B(2, b)? 

In the remainder of this article, then, we will explore such questions, with an 
emphasis on what can be said about the function B. In Section 3, we lay out the 
basic number-theoretic and group-theoretic way of looking at this problem, gener- 
alizing from 2 as the generator to an arbitrary ¢ relatively prime to an arbitrary 
b > 2. In Section 4 we give a formula for computing the number of bracelets, their 
weights, and the parity of their sizes. Restricting ourselves to prime b in Section 5, 
we look more closely at how ¢ and b determine the bracelet structure and 
specifically the number of bracelets. In Section 6 we extend the investigation to 
powers of primes and in Section 7 to arbitrary composite b; we will see that the set 
of prime factors of 5 tells essentially the whole story. Finally we close, in Section 8, 
with a few open-ended musings. 

In addition to the work of Hilton and Pedersen, we mention that of J. B. Roberts 
[13], whose work anticipates some of ours, and H. P. Lawther, Jr. [10], who applied 
some related ideas to the splicing of telephone cables. 


3. Some number theory and a slight generalization. In order to continue our 
investigation, it will be helpful to recall some elementary terminology and results 
from number theory and group theory. 

Fix a (not necessarily odd) number b > 2. First note that we need not restrict 
ourselves to working with the set V(b) of numbers relatively prime to b and less than 
b, because we are really working modulo b. Indeed, V(b) is simply a collection of 
representatives for the set Z}* of reduced residue classes modulo b, which forms a 
finite abelian group under multiplication. Recall that the order of Z}*, 1e., the 
cardinality of V(b), is denoted by $(b). 

For ¢ relatively prime to b, we consider the sequence ¢, ft’, t’,... . For some 
n > 0 we must have t” = 1 (mod 5); indeed, 1?) = 1 (mod b). The least such n is 
called the order of t mod b, and we shall denote it by o(t, 5). In fact, ¢? = 1 
(mod b) if and only if o(t, b) is a divisor of n. Now it might happen that in the 
sequence ft, ¢7, t?,..., the number —1 (mod b) occurs before 1 does; this possibility 
has fundamental significance for the problems we are studying. Thus we make the 
following definition (which appears in [6] and, under different names, in [10] and 
[13], as well). 


DEFINITION. Let ¢ and b be relatively prime integers, b > 2. The quasi-order of t 
mod b, denoted q(t, b), is the least positive integer k such that :* = +1 (mod b). If 
r7 >) = —] (mod 5), then we call t basic (mod b), and if t%*) = 1 (mod b), then 
we call t nonbasic (mod b). 


Note that g(t, b) must equal either o(t, b) or o(t, b)/2, since if +* = +1 
(mod b), then t7* = 1 (mod b). The following fact about quasi-order can be proved 
in a manner similar to the corresponding fact about order. 
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PROPOSITION 2. Let t and b be relatively prime integers, b > 2. Then t? = +1 
(mod Db) if and only if q(t, b) is a divisor of n. 


If o(t, b) = o(b), then the set {7, ¢*, 2°,...,¢°)} represents all the reduced 
residue classes mod b, and hence ¢ generates Z} under multiplication. In other 
words, Z/* is cyclic in this case. A well-known result of number theory, which we 
exploit in Section 5, states that Z* is cyclic if and only if b equals 2, 4, p” or 2p” 
for p an odd prime and n a positive integer. In any case, the powers of ¢t (mod 5) 
form a subgroup <t) of Z;}*. 

In what follows we need to look at the subgroup (t, —1) of Z* generated by t 
and —1. If ¢ is basic, then —1 € (ft), so in this case (t, —1) = (t) (hence our 
choice of the term basic). If t is nonbasic, then (t, — 1) contains all the elements of 
(t) together with their negatives. Thus in either case (but for different reasons), we 
obtain the following simple result. 


PROPOSITION 3. Let t and b be relatively prime integers, b> 2. Then 
{+1, +17, +27,..., 41%) is a set of representatives for (t, —1), and the cardinal- 
ity of (t, —1) is 2q(t, b). 


Returning to the terminology of Section 2, we see that the bracelet containing 1 is 
exactly a collection of representatives for (2, —1). Furthermore, we can construct 
bracelets for values of t other than 2 simply by generalizing the definition of f, (and 
that of g,), reducing all calculations modulo b. (Unfortunately, it no longer seems 
to be easy to characterize the charms.) 


DEFINITION. Let ¢ and b be relatively prime integers, b > 2. The function /: 
V(b) — V(b) and the function g,: D(b) > D(b) are given by the following rules: 


(a) = tamodb ifa<b/2 
Wa), ifa>b/2 


and 


(a) = (i mod b if ta mod b < b/2 
§i\ 4) = b-—tamodb if tamodb>b/2- 


As in the case t = 2, we let B(t, b) be the number of bracelets formed in the 
construction. Note that Proposition 1 remains valid. As an example, we have the 
bracelet in FIGURE 7 for b = 17 and t = 7; thus B(7,17) = 1 and q(7,17) = 8. 

Again, our function f, (or g,) and the bracelet graph replace the “quasi-order 
algorithm” and “symbol” of Hilton and Pedersen. Since all that is involved 1s 
multiplication modulo b, we avoid the cumbersome calculations and bookkeeping 
that they encounter when using the division-based approach when ¢ > 2. On the 
other hand, their symbol does immediately determine the quasi-order and a criterion 
for whether ¢ is basic (mod b). 
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Fic. 7. Bracelet (with domino structure) for b = 17, t = 7. 


4. A little group theory sheds some light. In Section 3 we saw that the bracelet 
containing 1 is just (a collection of representatives for) the subgroup (7, — 1). In this 
section we apply some rudimentary group theory to discover what the remaining 
bracelets are, what the number of bracelets signifies, and how the domino structure 
can be interpreted. To avoid the awkward construction in parentheses in the first 
sentence of this paragraph, we will identify elements of Z;* with their representa- 
tives and think of the bracelets as actually containing elements of Z;. 


THEOREM 1. The bracelets for a given b > 2 and t relatively prime to b are precisely 
the cosets of <t, —1) in Z*. They all have weight 2q(t, b), and the number of 
bracelets is given by 


o(b) 


Bled) = 2q(t, b) 


= Index of (t, —1) in Z#. 


Proof. For each a € Z*, the bracelet containing a consists of all numbers 
(mod b) of the form at’, i =1,2,.... Thus it consists of precisely 
{+at, tat?,..., tat”), which is the coset a(t, —1). The remaining statements 
follow from Proposition 3 and the definitions. 


We next turn to an analysis of the sizes of the bracelets, 1.e., the number of 
vertices in each bracelet which are not charms. We saw in FIGURE 5 that the sizes 
need not all be the same for a given b and ¢. On the other hand, the following 
theorem allows us to calculate their parity. 


THEOREM 2. The sizes of all the bracelets for a given b > 2 and t relatively prime to 
b have the same parity. This parity is even if t is basic (mod b) and q(t, b) is odd, or if 
t is nonbasic (modb) and q(t, b) is even; and otherwise this parity is odd. 
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Proof. Each step in a traversal of the cycle of a bracelet (i.e., the vertices which 
are not charms) corresponds either to multiplication by ¢ (with a move to the next 
domino) or to multiplication by —1 (staying within the same domino). If s is the 
size of the bracelet, then after s steps (and not before), we will have returned to the 
starting point. If we performed k multiplications by ¢ and s — k multiplications by 
—1, then we must have (mod 5) 


at*(-1)° “=a 


or simply 


tk=(-1)°“. 


Therefore, k must be g(t, b) and (—1)*°~* must be —1 or 1 according as whether 
t is basic or nonbasic (mod 5), independent of the particular bracelet, i.e., indepen- 
dent of s. Thus the parity of s — k, and, hence, of s, depends only on 6 and f, in 
the manner stated. (It is also possible to give a proof based on a generalization of 
Gauss’s Lemma due to Emma Lehmer [11].) 


Since the size of a bracelet plus the number of its charms is even by Proposition 
1, a statement similar to Theorem 2 holds as well for the number of charms. 

Finally, we note that D(b) can be viewed as a collection of representatives for 
the factor group Z}*/{ — 1), which is essentially just the set of dominoes. 


5. Some more number theory sheds some more light. In Section 4 we reduced 
questions about the sizes, weights, and numbers of bracelets to questions about 
g(t, b) and about ¢%"”) mod b, i.e., whether ¢ is basic or nonbasic (mod b). We 
now continue this study by classifying the bracelet structure into one of eight types, 
defined in terms of the sign of t%") mod b, the parity of g(t, b), and the parity of 
B(t, b). The following table shows the types, together with the smallest example in 
which ¢t = 2 and b is prime (the dashes in the last column indicate that—as 
Theorem 3 will state—these types are impossible for prime 5). 


TABLE 1. TYPES OF BRACELET STRUCTURES. 


Type 1. t basic q(t, b) odd B(t, b) odd (b = 3) 
Type 2. t nonbasic q(t, b) odd B(t, b) odd (b= 7) 
Type 3. t basic q(t, b) even Bct, b) odd (b = 5) 
Type 4. t nonbasic q(t, b) even Bit, b) odd — 
Type Sa. t basic q(t, b) odd Bt, b) even (b = 281) 
Type Sb. t nonbasic q(t, b) odd Bt, b) even (b = 73) 
Type Se. t basic q(t, b) even Bt, b) even (b = 17) 
Type 5d. t nonbasic q(t, b) even B(t, b) even — 


By using quadratic residues and the fact that Z}* 1s cyclic for certain values of b 


(including all odd prime numbers), we will now determine, for such values of 5, 
when each of these cases occurs. Recall that an integer x is a quadratic residue 
mod b if x = y” (mod b) for some integer y. 
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THEOREM 3. Suppose that Z}* is cyclic, b # 2,4 (i.e., b = p” or 2p", for p an odd 
prime and n a positive integer), and suppose that (t, b) = 1. Then 


n—-1 —] 
B(1,b) <2 (p—1) 
2q(t, b) 
Furthermore, 
a) if neither t nor —1 are quadratic residues mod b, then q(t, b) is odd, t%") = —1 


(mod b), (t) = (t, -1) # ( — t), and B(t, b) is odd (Type 1); 

b) if t is a quadratic residue modb but —1 is not, then q(t, b) is odd, t%) = 1 
(mod b), (t) # (t, -1) =« — t), and B(t, b) is odd (Type 2); 

c) if —1 is a quadratic residue mod b but t is not, then q(t, b) is even, t%@°) = —1 
(mod b), (t) = ¢t, -1) =¢( — t), and B(t, b) is odd (Type 3); 

d) if both t and —1 are quadratic residues modb, then B(t, b) is even (Type 5); if 
Type Sa occurs, then (t) = (t,-1) #¢ —¢t); if Type 5b occurs, then (t) # 
(t, -1) = ( — t); and if Type Sc occurs, then (t) = (t, -1) = « — 1); 

e) Type 4 and Type Sd cannot occur. 


Proof. Since b = p” or 2p”, it is easy to compute the order of Z*, namely 
o(b) = p"~'(p — 1). Thus the displayed equation follows from Theorem 1. 
a) Let the integer g be a generator of Z*, and write t = g” (mod db). Since ¢ is 
assumed not to be a quadratic residue mod b, r must be odd. Thus 19/2 = (—1)’ 
= —1 (mod b). We claim that, therefore, q(t, b), which must be a divisor of 
o(b)/2 by Proposition 2, is odd. Indeed, since —1, which is congruent to g?(’/?, is 
assumed not to be a quadratic residue mod b, it must be that $(b)/2 is odd. It 
follows, then, that 17°) = —1 (mod b), ie., ¢ is basic. Combining all this we see 
that B(t, b) is odd. Finally, since ¢ is basic, it 1s clear that ¢ generates (t, —1); on 
the other hand, since g(t, b) is odd, (—t)%"”) =1 (mod b), so —t does not 
generate (t, —1). 
b) The proof is similar to part (a), except that we use —?¢ in place of t. This works 
because, given that ¢ is a quadratic residue and —1 is not, —?¢ is not a quadratic 
residue. 
c) The proof is similar to part (a), except that since $(b)/2 is even (by the 
assumption that —1 is a quadratic residue), g(t, b) must absorb all the factors of 2 
in @(b)/2 in order for t?/* to be congruent to —1. Thus g(t, b) is even, and, 
again, B(t, b) is odd. Since ¢ is basic, (t) = ¢t, —1); and since, in addition, g(t, b) 
is even, ( — t) = (t, —1), as well. 
d) If g(t, b) is odd, then since $(b)/2 is even, B(t, b) is even, and the other 
statements follow as above. If g(t, b) is even, then 17%") = —1 (mod b), since 
otherwise 17°°")/2 would also be congruent to +1 (mod b), contradicting the 
definition of g(t, b). (This last statement uses the fact that the number 1 has exactly 
two square roots in Z* [14, p. 247].) Thus the order of ¢ 1s 2q(¢, 5). It follows that 
(t) = (t,-1) = ( — 1). Now since ¢ is a quadratic residue, ¢*°)/? = 1 (mod b). 
Therefore 2q(t, b) divides ¢(b)/2, and so 2 divides $(b)/(2q(t, b)) = B(t, b). 
e) If g(t, b) were even and ¢%*°) = 1 (mod b), then 1%" ")/2 = +1 (mod 5), as in 
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the proof of (d). This contradicts the definition of g(t, b), so Type 4 and Type 5d 
cannot occur. (We will see later that they can occur when Z} is not cyclic.) 


We note the following corollary of Theorems 2 and 3. 


COROLLARY 1. Suppose that Z} is cyclic, b # 2,4 (i.e., b = p" or 2p", for p an 
odd prime and n a positive integer), and suppose that (t, b) = 1. Then there are an 
even number of bracelets if and only if both t and —1 are quadratic residues mod b. 
Furthermore, the sizes of the bracelets are even if neither t nor —1 are quadratic 
residues, and odd if exactly one of t and —1 is a quadratic residue (the sizes may be 
either even or odd if both are quadratic residues ). 


The determination of the quadratic residue status of 2 and —1 mod 5 is always 
very simple in the cases we are considering. The following proposition is a standard 
exercise in number theory (see [14, pp. 254 and 256] for the flavor of the arguments 
involved). 


PROPOSITION 4. If Z* is cyclic, b # 2 or 4, then —1 is a quadratic residue mod b 
if and only if b =1 or 2 (mod 4). If b = p” for p an odd prime and n a positive 
integer, then 2 is a quadratic residue mod b if and only if p = 1 or 7 (mod 8). 


Combining Proposition 4 with Theorem 3, we see that, for t = 2 and b prime, 
types 1, 2, 3, and 5 occur when 5b is congruent to 3, 7, 5, and 1 (mod 8), respectively. 
Since by Dirichlet’s Theorem [14, p. 375] there are an infinite number of primes in 
each of these congruence classes, we see that each of these types occurs infinitely 
often. In particular, for t = 2 there are at least two bracelets for infinitely many values 
of b. In case t is some number other than 2, results similar to Proposition 4 can be 
obtained using the Law of Quadratic Reciprocity, and again we will have an infinite 
number of occurrences of each of types 1, 2, 3, and 5. We leave it as an exercise to 
show that the residue class of b mod12 determines the type when b is prime and 
t = 3. 

Two obvious questions about the number of bracelets arise when one looks at the 
data in the Appendix: is there only one bracelet infinitely often, and can there be an 
arbitrarily large number of bracelets? In the remainder of this section, we will 
consider these questions for prime b. 

The answer to both questions is an easy “yes” if we are willing to think of B asa 
function with two arguments, rather than thinking of ¢ as a fixed parameter. 


THEOREM 4. For infinitely many pairs (t,b) with b prime, B(t,b)= 1. As a 
function of two variables, B(t, b) is unbounded, even with b restricted to being prime. 


Proof. For the first claim, we need only observe that for prime b, we can choose 
the integer ¢ to be a generator of the cyclic group Z?. For the second, we can make the 
number of bracelets arbitrarily large simply by taking ¢ to be b — 1, since then the 
quasi-order of ¢ will be 1, and so the number of bracelets will be (b — 1)/2 by 
Theorem 1. 
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The questions become much harder if we fix ¢ (still insisting that b be prime). By 
Theorem 3, B(2, b) = 1 if and only if the integer 2 (or in some cases —2) is a 
generator of Z*. Number theorists call 2 (or —2) a primitive root in this case. It is 
conjectured, but not known, that 2 is a primitive root for infinitely many primes. If 
this conjecture were true (in a somewhat stronger form, since we would need to 
restrict ourselves to Type 1 or Type 3), then we could conclude that there is only 
one bracelet infinitely often. Similar statements could be made for other values of ¢. 

Another long-standing conjecture is that there are an infinite number of primes p 
for which b = 2p + 1 is also prime [15, p. 129]. If p and b are such primes, then it 
is easy to show that g(t, b) = p, and therefore that B(t,b) = 1, for all ¢ not 
congruent to +1 (mod b). In particular, if the conjecture is true, then for each t > 1 
we would have B(t, b) = 1 infinitely often. 

At the other end of the spectrum, B(?, b) is large when q(t, 5) is small compared 
to b. The smallest g(t, b) occurs when ¢%”) = b + 1, ie, when b = t%%) + 1 or 
b = ¢%5) — 1. (There is currently a great deal of interest in the prime factorization 
of ¢* +1 for small ¢ and large k; see [1].) In the particular case of t = 2, the 
questions of whether 2 + 1 are prime are well known. Prime numbers of the form 
2* + 1 are the Fermat primes, and only five of them are known: 3, 5, 17, 257, and 
65537. Thus, for example, B(2,65537) = 65536/32 = 2048. Prime numbers of the 
form 2* — 1 are the Mersenne primes, and only about 30 are known. (The largest 
Mersenne prime currently known is 27!°°! — 1.) Thus, for example, there are 315 
bracelets when b is the Mersenne prime 8191. It is not known whether there are an 
infinite number of Fermat or Mersenne primes. If there are, then clearly B(2, b) is 
unbounded for prime b. We can still derive this conclusion, however, by looking at 
the prime factors of the Fermat numbers. 


THEOREM 5. Let t be a fixed integer greater than 1. Then B(t, b) is unbounded for 
prime numbers b. 


Proof. We use the following well-known lemma (whose proof is not hard—see 
[15, p. 343], for example): If p is an odd prime divisor of t? + 1, then p = 2"*1i + 1 
for some positive integer i. Now for any positive integer M we can guarantee that 
2"*17 + 1 is not prime for 1 <i <M by taking n = []”,$(2i + 1). Indeed, since 
2?@:+)) = 1 (mod 2i + 1), we have 2” = 1 (mod 2i + 1), and, therefore, 2"*1i + 1 
= 2i+ 1=0 (mod2i + 1). Thus we can take b to be an odd prime divisor of 
t*" + 1, which must exist since even if ¢t* + 1 is not odd, it is congruent to 2 
(mod 4). By the lemma and the choice of n, we know that b = 2”*'i + 1 for some 
i > M. On the other hand, the quasi-order of ¢ is at most 2”, since 17 = —1 
(mod b), so by Theorem 3, B(t, b) is at least (b — 1)/2"*' =i > M. 


6. Powers of primes. In the last section we looked at the quasi-order g(t, b) and 
the number of bracelets B(t, b), especially for odd prime b. We now make some 
further progress in the case in which b is a prime power. Of course Theorem 3 
applies in the case of powers of odd primes. Here we want to relate g(t, b) to 
q(t, p) and B(t, b) to B(t, p) when b = p” where p:p is an odd prime, and also to 
compute g(t, b) and B(t, 5) when b = 2”. 
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We first state without proof a fundamental result relating the order of t mod b to 
the order of ¢ mod pb [12, p. 364]. We need to assume at least that t # +1 here, but 
for simplicity we will assume that ¢ > 1. 


PROPOSITION 5. Let b = p" and b’ = p"*", where p is an odd prime and n > 1. 
Assume t > 1 and (t, b) = 1. Then o(t, b’) is equal to either o(t, b) or po(t, b), and 
the second possibility holds precisely on the set of alln > N(t, p) for some N(t, p) > 1 
depending on t and p. 


In other words, as n increases, the order of t mod p” eventually increases by the 
same factor as p” increases, i.e., eventually ¢(b)/o(t, 5) becomes constant as n 
increases. In fact, it seems to be only rarely that ¢(b)/o(t, b) increases at all, even 
as b goes from p to p’. Primes p for which this does occur seem not to have been 
given a name in the number theory literature, so we will call them Wieferich primes, 
after an early twentieth-century mathematician who studied them in connection 
with Fermat’s Last Theorem [16]. 


DEFINITION. A prime p is called Wieferich with respect to ¢ if the order of 
t mod p” is the same as the order of ¢ mod p. 


The only Wieferich primes with respect to ¢t = 2, less than 31 million, are 1093 
and 3511 ([{8] gives the complete—and very small—table of primes known (in 1965) 
to be Wieferich with respect to prime ¢ < 43). 

We now show that the behavior of the order of t mod p” as n increases extends 
also to the behavior of the quasi-order, and hence that the number of bracelets for 
b = p” becomes constant for large enough n. 


THEOREM 6. Let b = p” and b’ = p"*!, where p is an odd prime and n> 1. 
Assume t > 1 and (t, b) = 1. Then 


g(t, b’) ~~ o(t, b’) _ ifn < N(t, p) 
q(t,b) — o(t, b) 


- p ifn>N(t, p) 
where N(t, p) is the number guaranteed by Proposition 5. 


Proof. The final equality follows from Proposition 5. As for the first, because of 
the remark above Proposition 2, there are two cases to consider. 


i) Suppose q(t, b) = o(t, b) = k, so t* = 1 (mod b). Hence k must be odd (other- 
wise t*/2 = +1 (mod b), as in the proof of Theorem 3.d). By Proposition 5, o(t, b’) 
is also odd. Therefore, since g(t, b’) cannot be o(t, b’)/2, it must equal o(t, b’), 
and the desired equality follows. 

ii) Suppose g(t, b) = o(t, b)/2 = k. Thus o(t, b) = 2k, and so o(t, b’) is also even 
by Proposition 5. Therefore by the same reasoning as in (i), g(t, b’) must be 
o(t, b’)/2 and not o(t, b’), and the desired equality follows. 


COROLLARY 2. Let p be an odd prime and t > 1 a number not divisible by p. If p is 
not Wieferich with respect to t, then 


B(t, p") = B(t, p). 
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In any case, 


B(t, p") = pmin(n NC PY-IB(z, DP); 
where N(t, p) is the number guaranteed by Proposition 5. 


In other words, the number of bracelets for a prime power is eventually (and 
nearly always, it seems) independent of the power. 

We now turn briefly to the case b = 2", with n > 2. Thus Z} is not cyclic (unless 
b = 4), but it turns out [14, p. 205] that Z*/( — 1) is. In contrast to Theorem 3, 
here usually neither ¢ nor —? generates the bracelet containing 1 (i.e., the subgroup 
(t, —1)). We summarize what happens in this case but omit the proof. In terms of 
the classification given in TABLE 1, all types are possible here except Type 3 and 
Type 5c; almost always it is Type 4 or Type 5d. 


THEOREM 7. Let b = 2”, n > 2, and let t be an odd integer. Then 


B(t, b) = [= 


tt1b 
4 a) 


where the sign is chosen so that (t + 1)/4 is an integer. Also, 


qn-2 
t,b) = ; 
Furthermore, t is nonbasic mod b unless t = —1 (modb),. 


7. The general composite case. We look finally at what can be said about g(t, b) 
and B(t, b) for b composite. 

For simplicity, we will carry out the analysis in the case b = p, p,, where p, and 
Pp, are distinct odd primes, but the argument will generalize to yield Theorems 8 and 9 
below. Let (¢, 5) = 1, and let r, and r, be the quasi-orders of ¢ mod p, and 
mod p,, respectively. We will need to compare the highest powers of 2 dividing r, 
and r,, sO we write r, = 2" rj and r, = 2*2rj, where rj and ry are odd. Our goal is 
to determine g(t, b) and B(t, b) in terms of q(t, p;) and B(t, p;), i = 1,2. 

We first recall what is essentially the uniqueness part of the Chinese Remainder 
Theorem. 


PROPOSITION 6. If a = b (modm,) fori = 1,2,...,n, and if the m, are pairwise 
relatively prime, then a = b (modm,m, -:- m,). 


Since ¢*" = 1 (mod p,), clearly ¢7!’» 2) = 1 (mod p,). Hence 7!» "2] = 1 (mod b) 
by Proposition 6. Therefore, g(t, 5) is a divisor of 2[7,, r,]. On the other hand, since 
a ©) = +] (mod b), we have 17”) = +1 (mod p,) for i = 1,2. Thus by Proposi- 
tion 2, g(t, b) is a multiple of both r, and r, and hence of their least common 
multiple. Combining these two statements, we see that either g(t, b) = [r,, r,] or 
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q(t, 6) = 2[r,, r]. To determine which formula applies, we need to look at the four 
cases. 


Case 1. t" = 1 (mod p,) for i = 1,2. Then ¢!»"2! = 1 (mod b) by Proposition 6, 
so g(t, b) =[n, 1]. 


Case 2. t" = —1 (mod p,) for i = 1,2. If r, and r, have the same highest power 
of 2 as a factor, 1e., x; = x, in the notation established above, then [7,, 7]/r, is 
odd, so t!»"2! = —1 (mod p,), for i = 1,2. Thus ¢!» "2! = —1 (mod 5) by Proposi- 
tion 6, and hence q(t, b) = [r,, 1]. If, however, x, # x,, then one of [7,, 7,]/r, and 
[r1, %]/r is even and the other is odd, so ¢!"»") has different values reduced 
modulo p, and modulo p). Therefore ¢!""’2/ cannot be congruent to either 1 or —1 
(mod bd). It follows that g(t, b) = 2[r,, r,] in this subcase. 


Case 3. t". = —1 (mod p,) and t” = 1 (mod p,). Using the same reasoning as 
in Case 2, we see that g(t, b) = [r7,, m] if x, < x, and q(t, 6) = 2[r,, r,] if x, > xp. 

Case 4. t" = 1 (mod p,) and t” = —1 (mod p,). The situation is similar to 
Case 3. 


As a corollary of this analysis, we see that, except for a factor of (r,, 7,) and a 
possible factor of 2, the “number of bracelets” function is multiplicative. Indeed, 
since B(t, b) = $(b)/(2q(t, b)), we see that if g(t, b) = [r,, %], then 


_o(b) | _ (ri — Yr— 1) = 2(r,,%)B(t, p,) B(t, po), 


B(t,b) = 2[ 7,4] 2ryh/(1; 1%) 


and if g(t, 5) = 2[7,, 7], then 


Bt, b) — Faia ~ Pet _ (r,, r,) Bt, p,) Bt, P2). 


As one example, let ¢ = 2, p,; = 5, and p, = 7. Then r, = 2, 7, =3, 1" = -1 
(mod p,), and t’? = 1 (mod p,). Case 3 applies, with x, > x,, so g(2, 35) = 2[2, 3] 
= 12, and B(2,35) = B(2,5)B(Q,7) = 1. For another example, suppose ¢ = 2, 
Pp, = 3, and p, = 11. Then we are in Case 2 with 2! = —1 (mod 3) and 2° = —1 
(mod 11), so g(2, 33) = [1,5] = 5 and hence B(2, 33) = 2 B(2, 3) B(2, 11) = 2. 

Nothing in the foregoing discussion except the calculations of B(t, b) required 
that p, and p, were odd primes, only that each was greater than 2 and ( p,, p,) = 1. 
Thus, generalizing in a straightforward way to arbitrary b, we obtain the following 
theorems, which give formulas for computing quasi-orders and numbers of bracelets 
for composite numbers in terms of these invariants for their prime-power factors. 
(In Corollary 3 we see that the number of bracelets ultimately does not even depend 
on the powers.) All eight types (see TABLE 1) are possible. We leave verification of 
the details to the reader. 


THEOREM 8. Let b = [1?_, py, where the p, are distinct primes dividing b and 
b # 2 (mod 4); or let b = 2[17_, p?, where the p, are distinct odd primes dividing b. 
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Suppose t > 1 and (t,b) = 1. Let r, be the quasi-order of t mod p;', and for each i 
write r, = 2™r/, where r/ is odd. Then 


a) if t is nonbasic (mod p;') for all i, then t is nonbasic (modb), and q(t, b) = 


(11, To5- 0-9 Nal 
b) if t is basic (mod p?) for alli, then 
1) ifx,; =X, = ++: =x,, thent is basic (modb) and q(t, b) = [r, %),---, Tal, 
and 
li) otherwise, t is nonbasic (mod b) and q(t, b) = 2[1, %,..-, Ml; 


c) if tis basic (mod p;') for some i, and t is nonbasic (mod p’:) for some j, then t is 
nonbasic (mod b), and 


1) if max(X,, X5,.-.,X,) = x; for some i for which t is basic (mod p;'), then 
g(t, b) = 2[r,, 45,.--,7,], and 
ll) if max(X1, X,.-.,X,) > x, for every i for which t is basic (mod p;'), then 


q(t, b) — [n1, Vyy 0205 Vy) 


THEOREM 9. Let b =[I?_, p*', where the p, are distinct primes dividing b and 
b # 2 (mod 4); or let b = 21 1}?_, pj, where the p, are distinct odd primes dividing b. 
Suppose t > 1 and (t, b) = 1. Let r, be the quasi-order of t mod p*. Then 


aL) eee Yr, 


B(t, b) = 2”-17-* ———__. 
( ) [ris Moye Mel 


"Bt, ps), 


where e is either 0 or 1, depending on which of the cases in Theorem 8 applies: « = 0 
in cases (a), (b.i), and (c.ii), and ¢ = 1 in the cases (b.ii) and (c.1). 


Note that Theorem 9 also provides a simpler proof that B(t, b) is unbounded as 
a function of (not necessarily prime) b, since we need only take n large to make 
B(t,b) large. 

Finally suppose b = []?_, p/, where the p, are distinct primes dividing b, with 
t > 1 and (t, b) = 1. If the exponents e, are large enough, then it is not hard to 
show by Theorems 5 and 6 and Corollary 2 that all the terms on the right-hand side 
of the displayed equation in Theorem 9 are independent of the exponents. Thus we 
have the following Corollary—which we find most remarkable—that the number of 
bracelets for b is independent of the exponents in the prime factorization of b once the 
exponents get large enough. 


COROLLARY 3. Let Pj, P>,---, P, be distinct primes, none of which divides t > 1. 
Then there exist constants E,, E,,..., E,, and C (depending on p,, Py,-.-, Py, and t) 
such that if b = \'_, p?' withe,; > E; fori =1,2,...,n, then B(t, b) = C. 


As a comprehensive example, let us study b = 720, t = 7. Since 720 factors as 
24375, we first apply Theorem 7 to obtain B(7,16) = 2, g(7,16) = 2, and 77 =1 
(mod 16). Then we apply Corollary 2 (since 3 is not Wieferich with respect to 7) to 
obtain B(7,9) = B(7,3) = BY,3) =1 and g(7,9) = $(9)/2 = 3; and we apply 
Theorem 3b (or calculate directly) to obtain 7? = 1 (mod 9). Finally we compute 
that B(7,5) = B(2,5) = 1, 9(7,5) = (2,5) = 2, and 7* = —1 (mod5). If we com- 
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bine all of these by Theorem 8.c.i, then we get g(7,720) = 2[2,3,2] = 12 and 
7'* = 1 (mod 720); and applying Theorem 9, we see that B(7,720) = 2?-1-1(2- 3 - 
2/{2,3,2])- 2-1-1 = 8. On the other hand, to see the force of Corollary 3, we 
have that if e, > 5, e, > 1, and e, > 2, then B(7,2°3°25%) = 80, since Theorem 
8.c.11 applies and 5 is Wieferich with respect to 7 (N(in fact 7,5) = 2). 


Added in proof: We note that Theorem 8 answers a question raised by Man 
Keung Siu [17], who obtained for the case ¢ = 2 a sharpening of a part of Theo- 
rem 3. 


8. Afterword. We leave the reader with a comment, a generalization, and an 
Open question. 

In some sense our investigation (except for Section 7 and the end of Section 6) 
has focused on a structure which is extremely well understood: the cyclic group. 
Nevertheless, interesting questions emerged, and the progress was somewhat im- 
peded by famous unsolved problems in elementary number theory, such as whether 
2 is a primitive root for infinitely many primes. We found it fascinating that this 
problem touched so many corners of elementary number theory. It could serve as a 
fruitful area of “research” for an undergraduate following an introductory number 
theory course. 

The quasi-order of t mod b was defined to be the least positive integer k such 
that ¢* represents an element of the subgroup {—1,1} of Z*. More generally, for 
other subgroups S of Z*, we could define the quasiorder of ¢ mod b relative to S. 
Does anything interesting emerge? 

Looking at the Appendix, we note that the sizes of the bracelets for a given b 
tend to be roughly equal, and that the bracelet containing 1 is usually the bracelet 
with the smallest size. Are any theorems lurking here? 


Appendix: Some data. Listed here are the number of bracelets and the weights 
and sizes of the bracelets, for ¢ = 2, b odd, 3 < b < 99. The size of the bracelet 
containing | is marked with * when there is more than one bracelet. 


b B(2, b) weights SiZeS 
3 1 2 2 
5 1 4 3 
7 1 6 5 
9 1 6 4 
11 1 10 8 
13 1 12 9 
15 1 8 6 
17 2 8,8 5*,7 
19 1 18 14 
21 1 12 8 
23 1 22 17 
25 1 20 15 
27 1 18 14 
29 1 28 21 
31 3 10, 10, 10 7*,7,9 
33 2 10,10 6*,8 
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b B(2, b) weights sIZes 
35 1 24 18 

37 1 36 27 

39 1 24 18 

41 2 20, 20 13*,17 

43 3 14,14,14 10*, 10,12 
45 1 24 18 

47 1 46 35 

49 1 42 31 

51 2 16, 16 12*,12 

53 1 52 39 

55 1 40 30 

57 2 18,18 12*,14 

59 1 58 44 

61 1 60 45 

63 3 12,12,12 8*, 10, 10 
65 4 12,12,12,12 7*,9,9,11 
67 1 66 51 

69 1 44 32 

71 1 70 53 

73 4 18, 18,18, 18 11*, 13, 15,15 
75 1 40 30 

77 1 60 44 

79 1 78 55 

81 1 54 40 

83 1 82 62 

85 4 16, 16, 16, 16 10*,12,12,14 
87 1 56 42 

89 4 22, 22, 22, 22 15*,15,17,19 
91 3 24, 24, 24 16, 18*, 20 
93 3 20, 20, 20 14*, 14,16 
95 1 72 54 

97 2 48, 48 35*, 37 

99 2 30, 30 22, 24* 
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Integrals, an Introduction to Analytic Number Theory 


ILAN VARDI, Stanford University 


ILAN VARDI: I got my Ph.D. in Number Theory from M.I.T. in 1982, as a 
student of Dorian Goldfeld. I then spent a year at the Institute for Advanced 
Study. I was an acting assistant professor at Stanford from 1983 to 1985. 
After realizing that not everybody cared about Kloosterman Sums, I learned 
how to use a computer and tried out some applied math. I’m now interested 
in special functions related to determinants of Laplacians. 


1. Introduction. An examination of Gradzhteyn and Ryzhyk’s book of integral 
tables reveals a large number of difficult and obscure integral formulas. In my 
opinion one of the most remarkable is given on p. 532 


(al 
a] 


I(s) = fre dt, s>0O 


7 mi 
J og log tan x dx = 5 108 v2a |, (1) 
4 


a/ 


where 


is the classical [’-function. The reference given is to Bierens de Haan [2]. Failing to 
locate the proof of this formula, I decided to study equation (1) in some depth. It 
turns out that this formula requires some fairly involved analysis to prove, and also 
serves as a good example of how nontrivial number theory can be embedded in an 
integral formula. 
The key to equation (1) is the Dirichlet L-function 
1 1 1 1 1 
= —_—_ — + — — — eee 
(s) 35 5° 7° 
This is a well-known function; for example every calculus student knows the 
formula 


LU 1 _— 
= ——-4ftr-ee- = -, 
(1) 3. 5 4 


Also, by the alternating series test L(s) converges for 0 < s < 1. However, much 
more is known and Hurwitz proved that L(s) can be analytically continued to an 
entire function in the whole complex plane. He did this by proving the functional 
equation 


L(1—s) = [=) sin r(s) Ls) (2) 
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What we will, in fact, show is that 


nm /2 d 
f log log tan x dx = ZEW) 


gm /4 s=l 


Invoking the well-known formulas 
rq) =1 
I'’(1) = ~7, 
where y is Euler’s constant, 
1 


] | 
l+24+254+-:-4+- 
2 3 n 


n—- oo 


y= lim | 
equation (3) becomes 


a/2 w , 
f loglogtan xdx = ~y— + L’(1). 
m/4 4 


309 


(3) 


— log | = .577215664901532860606512..., 


So the proof of equation (1) will consist of 2 parts: a) establishing (3) b) 


expressing L’(1) in terms of logarithms of I’-functions. 


2. Proof of equation (3). We begin with a general Dirichlet series 


rs) =  L, 


n=1 n 


which, if f is of polynomial growth, will converge absolutely in a half-plane 
Re(s) > c. We now use the technique first developed by Riemann to study the 


Riemann §-function 


C(s) = et ldt= e-™ (nt)? d(nt , 
SO 


I'(s oe) 
ts) =| ets! dt. 
0 


Hence, by absolute convergence, one gets that for Re(s) > c 


2 f(n) 


AY 
=) 0 


n 


T'(s)F(s) =T(s) = E s(n) [ene dt 


| 
© 
8 
a 
Ms 
—s 
on 
x 
eee” 
av) 
= 
ae 
1 
2. 


Now let z = e‘, this gives 


r(s)F(s) = f [E rim)z")[toe) 


1 
0 
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Now we add the restriction that f(n) be a periodic function. That is, there is a 
positive integer g such that f(n + q) = f(n) for all n (for technical reasons also 
assume that f(q) = 0). With these assumptions we have that for |z| < 1 


oe oe q-1 
Lsf(njz"= Led f(mg + near 
n=] m=0 n=1 
q-1 
LMM? aC, f) 
a ee ee eee 


where 
q-1 
P(z,f)= di f(n)z”. 
n=] 
We have thus obtained the formula: 


PeAlloes] ds 
F(s)T(s) = f er ee (4) 


Z 


This formula was first obtained by Dirichlet (see [3]) to derive his class number 
formula of which L(1) = 7/4 is the simplest case. Differentiating equation (4) by 


Leibniz’s rule gives 
1 \s7 1 
[tog — | 


“ F(s)P(s) = [Pf — > logloe| ~| = 


Z 
Now if F(s) converges absolutely at s = 1 this will yield 
1 1 \ dz 
F’(1) — yF(1) = { P(z, flogloe| =) S. (5) 
0 z} Zz 
To prove equation (1) we let g = 4 and pick f(n) to be the quadratic character 
(mod 4) that is 
0 if n =0(mod4) 
1 ifn =1(mod4) 
0 ifn =2(mod4) 
—1 if n= 3(mod4) 


X4(n) = 


x4 1s called the quadratic character (mod 4) because for (1, 4) = 1 


x4(”) _ | ] if dx st. x? =n (mod 4) 
—1 otherwise, 


while x,(”) = Oif (”, qg) > 1. 
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So we have 
P(z,x)=z-2° 
and equation (5) becomes 


1 
La) 7 ; (z— z* )log log — ds 
(I) yz =| — 


1 —z* Z 
1 1 dz ian | du 
= [toe os| = ar in I og og us a) 
n /2 
= f log log tan x dx. 
a /4 


3. Evaluating L’(1). It turns out that it 1s much easier, first, to evaluate L’(0), 
then use the functional equation L(s) — L(1 — s) to obtain the value for L’(1). 
To compute L’(0) we follow a method due to André Weil [7]. Let 


((s,a)= 


n=l 


———3 > O<a<l, 
(n+ a) ¢ 


be the Hurwitz §-function. It is easily shown to converge for Re(s) > 1. Using the 
integral formula [8] 


e 4 


P(s)s(a,s) = fo 


one can show that §{(a, s) can be analytically continued to the whole complex plane 
with only a simple pole at s = 1. The relevance of {(a, 5) is due to the formula 


uae) “He 


thus evaluating ¢’(0, a) will yield the value of L’(0) (for ease of notation we have 


te! dt 


written ¢’(s, a) to denote 3. §(s, a)). Weil’s observation is the following: note that 
s 


fors>1 


1 
f(s,a+1)=f(s,a)- — 


as’ 
thus 
{'(s,a+1) = &(s,a) + a~‘loga, 
and at s=0 
§(0,a + 1) = £(0, a) + loga. 
Letting 


G(a) = O49, 
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we See that G(a) satisfies the functional equation 
G(a+1)=aG(a). 


Further, one has that 


W ogG(a)= Si — = +0, fora>0 
—~ log G(a) = ——, > 0, fora> 0, 
da’ nai (n+a) 


and that 


G(a) is analytic for a > 0. 


These however are the exact conditions for the Bohr-Mollerup Theorem for the 
uniqueness of the Gamma function [1]. Thus one has that 


G(a) = G(1)T(a). 


One sees that G(1) = £’(0,1), and on noting that {(s,1) = §(s), where {(s) is the 
Riemann ¢-function, one has 


G(1) = ¢’(0). 
It is well known that §’(0) = —(1/2)log27 (e.g., [6], [8]), and so 
l(a) 


§’(0, a) = log aa 


Substituting this in the formula for L(s) one derives 


L’(0) = log — L(0)log 4. 


By the functional equation and L(1) = 7/4 one gets that 


1 
L(0) = * 
And once again by the functional equation 
3 
2d 1 {3 loa 
— GUE) _ = 5 los4 + ane + 5 loss» 
4 


and thus 


7 
L'(1) = y> + Flog ( 
rl — 

4 


This concludes the proof of equation (1). 
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4. More formulas! There are actually quite a number of identities in Gradzhteyn 
and Ryzhyk similar to (1). For example, there are 


2 
1 1 dx T r(5) 
freetos| —) = ——log 
0 l 


x/ltx4+x? J3 


1 l a on > 2 | r{2 2 
[ este) ae g log2m — log (|). page 572. (7) 


One sees that in equation (6) 3 plays the “key role” and in equation (7) 6 is the 
“magic number.” To explain this one introduces Dirichlet characters (mod q) 


x is a Dirichlet character (mod q) if 


x(1) = 1 
x(n +4) = x(n) Vn 
x(n) =0 if(n,g)>1 


x(mn) = x(m)x(n) Ym, n. 


The corresponding Dirichlet Z-function is 


L(s,x) = x xu , Re(s)>0 


n=] 


and can be continued to an entire function if x is not the trivial character 
Xo(n) =1 if(n,q)=1 


Now the analogous character to y, in equation (6) is the quadratic character 
(mod 3) 


0 if n = 0(mod3) 
xX3(n) = 1 if n= 1 (mod3) 
-1 ifn =2(mod3), 


and in equation (7) the corresponding character is the quadratic character (mod 6) 
xX .(”). Hence we have the Z-functions 


L( 1 1 1 1 
; = — — + — — —..- 
s x3) 2s 4s 55 


1 are 
= — —- + —_— ee e@ 
5, Xe) 5° 7 


314 ILAN VARDI [April 


The proofs of (6) and (7) are completely analogous to our proof of equation (1). 
One can further explain how the numbers 3,4,6 play the key roles in our 
formulas. First rewrite equation (1) in the same form as (6) and (7) 


| | ] dx WT 
i oslo Tae 2 8 rs 
4 


Note that the solutions of x? +1 are 4th roots of unity, i and —i, and one 


explains why L(s, x,) is involved by noting that it can be shown from the Quadratic 
Reciprocity Theorem that 


SQ (5) = L(s,x4)S(s), 


where §q;)(5) is the Dedekind zeta function of the field Q(i), and the classical 
definition (e.g., [5]) of the Dedekind ¢-function of the number field K is 


1 
fx (s) _ X N(A)*" 
A ideal 


Similarly, x? + x +1 is the irreducible polynomial for the 3rd roots of unity, 


—1/2 4 , and, as above, L(s, x;) appears because 


Sou/=3)(5) = L(s,x3)S(s). 


Similarly, x? — x + 1 gives the 6th roots of 1, so, as above, one expects L(s, x.) to 
play the central role. 


5. Exercises. 


1) Show that 


e oe log n 
[ log(—loglog y) dy = — 7 Ye. 
1 n=] Nn. 
Hint: consider 
L y — 
p(s) = J (n~ 1)tn- 


2) Find a similar formula for 


[“ los( —loglog log y) dy. 


4 
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Dice with Fair Sums 


LEWIS C. ROBERTSON: B.S., M.S. (University of Chicago), Ph.D. (UCLA). 


RAE MICHAEL SHORTT: I received a B.A. in mathematics at Amherst College 
in 1978 and my doctorate at M.I.T. in 1982. 

My interests include measure theory, probability, and descriptive set 
theory. 


STEPHEN G. LANDRY: I received my B.A. in mathematics from the University 
of New Haven in 1984. Since then I have been a graduate student at 
Wesleyan University. 


Consider the following situation. We are given two n-sided dice with the numbers 
1 through n on their sides. These dice have the property that when both are rolled 
independently, the sum of the numbers showing behaves (as a random variable) as if 
the two dice were each fair. Can we conclude that each die is in fact fair? It 
surprised the authors to discover that the answer is sometimes no, depending on the 
particular value of n being considered. What follows is an exposition of this 
discovery. 

Let X be a random variable. Suppose that X takes on only finitely many values 
and that each of these values is a positive integer. Let n be the largest integer such 
that P(X =n) > 0. We then call X an n-sided die. Define the polynomial p,(x) = 


ro tryx +--+ +r,_1x"—4, where 
P(X =k +1) 
r, = —— >. fork = 0,1,...,n—1. 
P(X =n) 


We call py the companion polynomial for the random variable X. It is easy to see 
that a polynomial is the companion polynomial for some die if and only if it is 
monic and has nonnegative coefficients. Then 


r 
P(X =k +1) = ————*— fork = 0,1,...,n—-1. 
Yotret+ oes +h, 4 


What we have called a companion polynomial is simply a normalized version of the 
generating function (see [1], Chapter 11). An n-sided die is fair if its companion is 
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the cyclic polynomial 
f(x)=1tx--- +x" 1 
LEMMA 1.1. Let X and Y be, respectively, n- and m-sided dice with companion 


polynomials py and py. If X and Y are independent, then W= X + Y is an 
(n + m)-sided die whose companion polynomial is p (x) = xp y(X)py(%). 


Proof. We put 
Py(x) =mtrnxters try! 


Py(x) =Sygtsyxt--- +s, x7} 
Py(x) py(x) =ty +t x+--- +t, 9xntm-? 
Pw x) =Ujptuyxt-::: + Uy pm xem), 
Then for k = 0,1,...,1n+m—1, 
P(W=k+1) 2 P(X=iandY=k+1-i) 
uy, = P(W=n+m) ~ 2d — p(Wantm) 
" P(X=i) P(Y=k+1-i) 
~ P(X=n) PVs m) 


n—l 


n 
Shy — » PSp-1-1° 
i=] 1=0 


Note that because n and m are the greatest possible values of X and Y, respectively, 
it follows that 


P(W=n+m)=P(X =nand Y=m) = P(X =n)P(Y=™m). 


Above, we have examined all the ways in which X + Y can equal k + 1, and then 
used the independence of X and Y. Multiplication of the polynomials py and py, 
yields 


n—-1 


t,4 = » 1S. 43 
1=0 


where we interpret the r, = 0 for i > n or i < 1 and s, = 0 fori > m or i < 1. The 
equality ¢, _, = u, implies that 


Pw(x) — xp y(x)py(x). 


Call a polynomial p(x) palindromic if its coefficients are the same when read 
forwards and backwards. Equivalently, we may define a polynomial p of degree n 
to be palindromic if p(x) = x"p(x~'). It is not hard to see that the product of two 
palindromic polynomials is again palindromic. 

If two random variables X and Y have the same distribution (or “law’’), we write 
L(X) = L(Y). Note that two dice have the same distribution if and only if they 
have the same companion polynomial. 
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The following result answers one half of our questions about dice whose sum is 
fair. 


THEOREM 1.2. Suppose thatn © {1,2,...,9} U {11,13}. Let X and Y be indepen- 
dent fair n-sided dice. Let U and V be independent dice, each of which has <n sides. 
Suppose that L(U + V) = L(X + Y). Then U and V are fair n-sided dice. 


Proof. This involves a certain amount of case-checking. For example, suppose 
n= 6. Let py, Py, Py, py be the companion polynomials for X, Y, U,V, respec- 
tively. The companion polynomial for X¥ + Y and U + V is (by lemma 1.1) 


xpy(x)py(x) = xpy(x) py(x) = x[fe(x)]’. 


Now f(x) = (1 + x)q,(x)q,(x), where g,(x)=1+x+ x? and g,(x)=1-x 
+ x* are irreducible quadratic polynomials. Considering the degrees of these 
polynomials, we see that deg( p,,) = deg( p,,) = 5. Only two combinations of the 
irreducible factors (1 + x), g,(x), G(x) are possible: 


Pu(x) = py (x) = (1 + x)ay(x)aqn(*) = fo() 


py(x) = +x)\a(x)) — py(x) = + x)(qn(x)). 


However, (1 + x)(q,(x))? has a couple of negative coefficients and hence cannot be 
the companion polynomial of any random variable. We are left with the case 
Py = Py =Px=Py- Then U and V are fair 6-sided dice. 

A similar procedure must be carried out for each n. One writes 


(1 + x)gq,(x)--- q,,(x) n even 


j,(x) = | 


Q(x) +°* An(x) n odd ” 
where 
n—2 
n even 
m = ’ 
n— 
n odd 
and g, --- q,, are irreducible quadratic factors. One has q,(x) = 1 — k,(p)x + x’, 


where k,(p) = 2cos(2p 7/n) for p =1,..., m. All of the real factorizations of 
[ f,(x)]* into products of these irreducible factors must be calculated. 

For each n, these cases were examined (for the presence of negative coefficients) 
using a computer and MACSYMA, a large symbolic manipulation program devel- 
oped at the M.I.T. Laboratory for Computer Science. In each case, it was de- 
termined that only the “fair” factorizations of [f,(x)]? yielded polynomials with 
nonnegative coefficients. A copy of the computer output is available upon request. 


The theorem is no longer true if we relax the condition that U and V have <n 
sides. For example, suppose n = 6. Allowing the dice U and V to have more than 
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Six spots on a given side, we may consider a factorization 
2 
[ fe(x)]° = py(x)py(x) = [1 + x) q(x) fa + x)qy(x)(q2(x)) |, 
= [14+ 2x 4+ 2x74 x7][1 4+ x7 4+ x7 4+ x74 x°4 x’, 
which yields the following. 


Example 1.3. Let U and V be independent random variables with distributions 
given by 


Event 


W. W. Funkenbusch pointed out to the authors that the dice U and V can be 
physically realized by taking a pair of ordinary dice and changing the number of 
spots on their sides. 


U; v:[1]3] 4/5] 6)8 
Of course, according to our definition of “n-sided die,” given above, the dice U and 
V here are “4-sided” and “8-sided,” respectively. 


THEOREM 1.4. Let X, Y,U,V be n-sided dice. Suppose that X and Y are indepen- 
dent fair dice and that L(U+ V) = L(X + Y). Let py and p, be the companion 
polynomials for U and V. Then 

1) py and p, are palindromic. 

2) If py = Py, then U and V are fair dice. 

3) Suppose that the coefficients of p,, and p, are all positive rational numbers. 
Then U and V are fair dice. 


Proof. 1) It follows from lemma 1.1 that p,(x)p,(x) =[f,(«)]”?. Now the 
complex roots of f, are the n — 1 roots of unity, excluding x = 1. We pair each root 
with its complex conjugate and write 


(x + 1)q,(x)--- q,,(x) n even 


f(x) = 
Q(x) °** Am(x) n odd 

where q,,...,9,, are reducible quadratics of the form g(x) = (x — w)(x — w) = 
1 + kx + x*. The irreducible factors of p,, and p, are thus all palindromic. So p, 
and p, are palindromic. 
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2) In case py = py, we see that p,(x)p,(x) =[py(x)]* =[f,(x)]’, so that 
(Py —f, Pu + f,) iS the zero polynomial. It follows that p, = pp, =f,,, so that U 
and V are fair dice. 

3) The roots of p,(x) and p,(x) are algebraic integers. Gauss’ lemma (3; 
Theorem 9.7] implies that the coefficients of p, and p, are ordinary (“rational’’) 
integers. Put 


Py(x) = to + ryx + ry x? + eee +r, 4x"? + rx" } 
py(x) = Sg + SX + Sx? to... +5, 9x"? 4 5, 4x"! 
, —_ — 
[ f,(x)] — lo + t,x + t x? + ---. +5, 3x7" 3 4 ty, 9x°" 2 


Then ¢, = minf{k +1,2n-—1—k}=n95, + rys,_) + +++ +r,59. Now py and py, 
are monic, and from part 1, palindromic. So r,_, = 1 = 7% and s,_, = 1 = S59. Then 
t; = 2 = 1795; + 7,Sy = S, + 1. Since all the coefficients are assumed to be positive, 
the only possibility is that 1 = 5, =s,_, and 1 =r, =1r,_,. Then consider ¢, = 3 
= 1S. + 8, + HS) =5,+1+ hn, forcing s,=s5, ,=1 and r,=r,_,=1. One 
proceeds inductively to obtain 4 =7, = ++: ="%_-1) =S59 = Sp = 1° = 5,-, = 1. 
Both U and V are fair dice. 


In part 3 of theorem 1.4, the assumption of rationality for the coefficients cannot 
be eliminated. In theorem 1.14 below, we exhibit independent, but unfair, n-sided 
dice whose sum behaves as if the dice were fair. 


One further note: the coefficients 7, 7,,...,7,_, are all rational if and only if 
Sq> Sy,-++5 S,—, are all rational. This is because, when 1, r,,..., 7, are known, the 
coefficients so, 5,,..., 5, -, may be obtained by solving the system of linear equa- 
tions 


using Cramer’s Rule. Since the matrix of coefficients and the min{k + 1,2n — 1 — 
k}\ are rational, it follows that s),..., 5, are rational. 

We now turn our attention to Chebyshev polynomials. They are a most useful 
computational tool for handling the factorization of cyclic polynomials. 

The Chebyshev polynomials Sp, S,, S,,...(actually, these are versions of what 
are termed “Chebyshev polynomials of the first kind’) have many uses in mathe- 
matical analysis, and there are many equivalent methods of defining them (see, e.g. 
[2] or [4]). One way is to set So(x) = 1 and S,(x) =x and use the recurrence 
relation S,,;(x) = xS,(x) — S,_,(x). A matrix formulation is 


F ea lar —S,-1(%) 
1 ox} | S,-.(x) S,(x) } 


We summarize the basic facts about Chebyshev polynomials that we shall need in 


LEMMA 1.5. Let So, S;, S5,... be the sequence of Chebyshev polynomials. 
1) (x — 24-5 S,,(x) = Si(x) — S;-4(%) — 1 forj = 1,2,.... 
2) (sin #)S;_,(2cos @) = sin( 76) for j = 1,2,.... 
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3) Let a and B be real numbers in [—1,1] such that neither 1 — az + z? nor 
1 — Bz +z? vanishes for complex values z in the open disk \z| < 1/2. Then for 
jz) < 1/2 


l1—az+2? 


1- Bz 4 72 =1+ (B a a) DL Si(B)e. 


Proof. 1) Define the matrix 


0 -Il 
A |‘ a 
and consider the identity 
I1+A+A*4+-+--4+A"!=(A-J) -'(A"-T) 
_ | —_ xX _ —S,,-2(x) —1 —S,4(x) 
x—2 1 1 S,-1(x) S,, (x) —1 . 


Inspection of the lower right-hand entry yields the desired result. 

2) This well-known identity follows from the basic recurrence relation by a 
simple induction argument. 

3) Define a function of two variables 


e(28) = ¥ S(B)2! 


Then 


(B - z)e(z,B) = B + ¥ (B58) - s_\(B))z! 


= f+ +¥s S)44(B)z/ 


by the recurrence relation. So 
z(B — z)g(z,B) — Bz = ys 41(B8)2/** = g(z, 8B) —1— Bz, 
j= 
which leads to 


e(z,B) = ¥S(B)z/ = 


Now 


l-—az+2z° (B-a)z 


To prea?) 1 per ze \B~ v)78l2 8): 
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so that 
1—az+z?’ 


——_—__ = _ jt 
Ta preg 71+ (8-0) YS(): 
as desired. It remains only to note that |S,(8)| < 2” whenever |8| < 1 (proof by 
induction), so that the power series we have been manipulating so freely in fact 
converges for |z| < 1/2. (The reader might want to press on and show convergence 
for |z| < 1, which does hold.) 

In view of theorem 1.2, it involves no loss of generality in considering n-sided 
dice with fair sums where n > 10. With this restriction on n, we introduce some 


notation. Given n > 10, choose { p,q} € {1,2,...,m}, where p # q and 
n— 1] 
2 
n—2 
2 


if n is odd 


m= 


if n iS even. 


Define k,( p) = 2cos(27p/n) and set 


1—k,(p)x + x? 


x Sn (%). 


h ,G; = 
nl Po 5 X) 1—k,(q)x+x 


Put 
A,,( p,q) = cos(2aqg/n) — cos(27p/n) 
= 2sin(7(p + g)/n)sin(a( p — q)/n), 
and define c; = c,( p,q, n) by 
h, (p,q; x) =) +¢,x + ¢,x* + "rs $C 
and d, = d(p, g,n)= c,(q, p, n), so that 
h,(q, p>x)=dy+d,xt+d,x?> +--+ +d, ,x"}. 
Define 


2pqT 


$(p) = sin{ = + in| 


*1PE) gl 2 Dee 


n n 


THEOREM 1.6. With notation as above, the following identities hold: 


c= 14 dy(poa)|1 : cn 0") ata ]} 


d,=14X,(q, p)(t - cos{ 27)) “a (p)(sin( 22) 
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Proof. Since 1 — k,(p)z + z? and 1 — k,(q)z + z* are nonzero in the complex 
disk |z| < 1/2, we have 
1 — ax + x? 


Topraxt bt (B- aU S(B)x xl < 12 


J 


fora =k,(p) and B = k,(@q) or vice versa (lemma 1.5.3). So 


h,(p,4qs x) = {1 + 20,(p, g)(So(Kn(q))x + Si(k,(q))x? + +++ )} 
x {1 +(x+tx*4--5 +x" *)} 
= {1+ A(x)} x {1 + B(x)} 
= {1+ A(x) + B(x) + A(x) B(x)}. 
We calculate 
A(x)B(x) = 20,(p, g)(So(kn(g))x + Si(Kn(q))x? + ++) 
X(x tx? +--+ +x""1) 
=A,(p.9)(So(Kn(q)) x? + (So(kn(9)) + Si(Kn(q))) <3 + +++). 
Then 
h, (p,q; x) =1+ x(2A,(p, 4) So(kn(g)) + 1) 
+x°(2A,,(p,g)(So(k,(4)) + Si(k,(4@))) + 1) 
+x°(20,( Pp, g)(So(kn()) + Si(k,0g)) + S2(k,(9))) + 1) 


+... ; 
so that 
jr-l 
C; =I+t+ 2A,( Pp, q) » Sn(kn(q)) 
m=(0 
-1 
= 1+ 2A,(p,4)(ki(9) — 2) (Si(kn(@)) — Sia(kn(@)) 1) 
for 7 = 0,1,...,2—1 (lemma 1.5.1). Note that this infinite series expansion of 


h,( p,q; x) 1s in fact just a polynomial. Application of lemma 1.5.2 shows that 


yf sin((j + 1)2mq/n) — sin(2aqj/n) 
c= 1+ 20,0, a)(kn(g) — 2) ema ~ sin(2g/n) 


2aq\\~? ; -1 

=1+A,(p,q){1- cos =A $;(q)(sin(27g/n)) 

as desired. The formula for d, is obtained by reversing the roles of p and q. 
THEOREM 1.7. With notation as above, one has c, > 0 if and only if 


Aaa co) ~ of 2 MAE)) «il ein 2 


n 
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and d, > 0 if and only if 


(4; p)[oos{ 2? - cos{ 7} < sin( 7 )sin| aa} 


n 


Proof. Using theorem 1.6, we note the following equivalent statements, each of 
which is equivalent with c, > 0: 


—A,,(p,9)¢,(4) < [i 7 cos| 2) sin| 
—A,(p,9)%,(4) < a sin( )) sin 


—r,(p, al sin( = Jeos{ = + 2e0s{ AT nl - al 


(areal £2) co 2*D#E)) «sal in 2 


as desired. The result for d ; follows in like manner. 


COROLLARY 1.8. With notation as above, suppose that p < q, so that \, (p,q) < 0 
<X,(q, p). Then h,( p,q; x) has strictly positive coefficients if 


+q)7 —p)a 7 2qu 
2 sin PaO | ig SP) < tan( $ sin| “2, (*) 
n n 2n n 
and h,(q, p; x) has strictly positive coefficients if 
+q)q —p)a pT {&pa 
n n 2n n 


Proof. Assuming (*) holds, we have 


(4, p){eos{ ~ cos NET) <A,(4, p)[eos{ + i 
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so that, by theorem 1.7, each c, is positive. Now assume (**) and note 
2) + 1)pa 7 yr 
r,,(9; p)(ecos{ “> SP") — cos| 7 <iX,(q, p(t — cos =") 
n n n 


WT WT 
< An( qs p)tan| 5 }sin( =) 
2 n 


n 


_{2pa\ (pt 
< in| jsin{ 2) 
n n 


so that, by theorem 1.7, each d fat positive. 
Define a function Q by the rule 


sin(az/n) 


O(a, b,n) = 2sin(ba/n) — 


COROLLARY 1.9. With notation as above, assume that gq = p+1<1n/2. Then 
h,( p,q; 2) has strictly positive coefficients if 


{7 qu 
sin| — < tan (2p + 2,2p+1,n), 
n 2n 
and h,(q, p; x) has strictly positive coefficients if 


(% pT 
sin{ ~ tan = < O(2p,2p+1,n). 
n 2n 


Proof. Immediate consequence of corollary 1.8. 


COROLLARY 1.10. Given n > 10, take p = (n — 3)/2,q =(n — 1)/2 if n is odd; 
take p = (n — 4)/2, q =(n — 2)/2 ifn is even. Define 6(n) © {1,2} by 2p + 8(n) 
=n — 2. Thenh,( p,q; x) has strictly positive coefficients if 


T qu 
sin( = < tan =) 9(3(n), 8(1) + 1,n). 
n n 
Also h,(q, p; x) has strictly positive coefficients if 


sin( ~ Jtan( = < O(6(n) + 2,6(n) + 1,7). 


Proof. This follows from corollary 1.9, noting that 


| (2p + 7) _ in S(n)7 


sin 


n n 


n n 


| (2p + —*) _ | (d(n) + a) 


n 


in| -7) = sin( P=) 
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THEOREM 1.11. Given n > 10, put 6(n) = 1 ifn is odd and 8(n) = 2 if n is even. 
Set m = (n — 6(n))/2. Then h,(m, m — 1; x) has strictly positive coefficients. 


Proof. By corollary 1.10, it suffices to show that 


sin( ~ tan = < QO(8(n) + 2,8(n) + 1,27). 
Now Q(6(n) + 2, 6(n) + 1,7) > 1/2 because 


in| (6(n) + a . in (8(n) + a 0 


n n 


Also, 


because 2p < n — 3. Thus 


sin( = }tan{ | < sin < 1/2 < Q(6(n) + 2,8(n) + 1,7), 


n 


as desired. 
LEMMA 1.12. Ifn > 10, then Q(2,3,n) > 1/3. 


Proof. For 6 = 7/n, we need to show that (sin 2@)/(sin 36) > 2/3. Now 


cos 26 
cos 6 + sin | | < 3/2 
sin 24 


because (cos 2@)/cos @ < 1, and hence 
sin 36 = sin20 cos 8 + sin 8 cos 26 < (3/2)sin 20. 


The results follow, noting that 0 < sin2@ < sin 38. 


THEOREM 1.13. Given n > 10, put 6(n) = 1 ifn is odd and 8(n) = 2 if n is even. 
Set m= (n — 8(n))/2. Then h,(m — 1, m; x) has strictly positive coefficients for 
n > 14 and for n = 12. 


Proof. By corollary 1.9, it suffices to show that 


sin( — < tan 5) 0(3(n), 8(n) +1,n), 


where g = m. For n odd, this is the same as sin(27/n) < (1/2)tan(m7/2n). For n 
even, lemma 1.12 serves to establish the sufficiency of sin(7/n) < (1/3)tan(m7/2n). 
In either case, success at a given value n, implies success at all n > n, such that 
n =n, (mod 2). Thus the proof is complete for n > 14 upon inspection of the 
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following table of calculator approximations: 


n sin(7/n) sin(27/n) (1 /3)tan(ma7/2n) (1/2)tan(m27/2n) 


12 2588 2558 
13 4647 4430 
14 2225 .2658 
15 4067 4502 


s e < t 9 by) . 


Finally, we are ready for a result complementary to theorem 1.2. Combined, the 
two results provide a complete answer to the question of whether unfair n-sided dice 
can have a fair sum. 


THEOREM 1.14. Suppose that n © {10,12} U {14,15,...}. Let X and Y be inde- 
pendent fair n-sided dice. There are independent n-sided dice U and V such that 
L(U+ V)=L(X+ Y), but U and V are not fair. 


Proof. Case 1: Suppose n © {12} U {14,15,...}. Put 6=1 if n is odd and 
0 = 2if n is even. Set m = (n — 6)/2. By theorems 1.11 and 1.13, 4,(m — 1, m; x) 
and h,(m,m — 1; x) are polynomials with strictly positive coefficients. So let U 
and V be n-sided dice whose companion polynomials are 


Py(x) =h,(m — 1,m; x) py(x) =h,(m,m—1; x). 


Then py(x)p,(x) =[f,(x)I’, so that L(U + V) = L(X + Y) as desired. 
Case 2: Suppose n = 10. Computation of 


~S,-2(2) sgl ={t iy 
Si(x)  S(x)} [2 x 


—S5,3(x) — S5,.1(x) 
a S,(x) 


yields the identity S,, = (S, — S,_,)(S, + S._,). Taking r = 2 gives 
(2cos 8)’ + 2cos9 -1=0 
S,(2cos@) =0 iff or 
(2cos 0)” — 2cos8 -1=0 
Thus we may tabulate the values of k,9(p) = 2. cos(27p/10): 
ky9(1) =(1+ V5 )/2 k4o(2) =(-1+ V5)/2 
k1(3) _ (1 — V5) /2 k19(4) _ (-1 — V5 )/2. 


(This explicit form for cos(36°) does not seem to be widely known.) A manageable 
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hand computation yields 


hig (3,43 x) = (1 + x)(1 — kyo (1) x + x?2)(1 — kyo (2)x + x?)(1 — keyg (3) x + x2)? 
= (x9 +1) +0- (x8 + x) + (1/2)(1 + VS) (x7 + x7) + 0+ (x8 + x3) + (0° + x4) 
hig (4,3; x) = (1 + x)(1 — kyo (1) x + x2) (1 — kyo (2).x + x2) - (1 — kyo (4) x + x?) 
= (x? +1) + 2(x8 + x) + (1/2) - (5 — V5)( x7 + x?) + (3 — ¥5)(x° + x3) 
+(4— 5)(x5 + x4). 
Each of these polynomials has non-negative coefficients. (Note that since there are 


some zero coefficients, machine approximations are not to be relied on for this 
example.) 


Let U and V be 10-sided dice whose companion polynomials are 
Py (x) = Ayo(3, 4; x) Py (x) = Ayo (4,3; x). 
As before, L(U + V) = L(X + Y), but U and V are not fair. 


Note. A computer search (using MACSYMA) revealed this as the only such 
example for n < 11. 
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UNSOLVED PROBLEMS 


EDITED BY RICHARD GUY 


In this department the MONTHLY presents easily stated unsolved problems dealing with notions 
ordinarily encountered in undergraduate mathematics. Each problem should be accompanied by 
relevant references (if any are known to the author) and by a brief description of known partial results. 
Manuscripts should be sent to Richard Guy, Department of Mathematics and Statistics, The 
University of Calgary, Calgary, Alberta, Canada T2N IN4. 


Three Old Problems about Polynomials with Real Roots 


RAPHAEL M. ROBINSON 
Department of Mathematics, University of California, Berkeley, CA 94720 


We consider only polynomials all of whose roots are real. For such a polynomial 
of degree n > 1, the span is the maximum distance between two roots, and the gap 
is the minimum distance. All three problems concern spans, and the third also 
involves gaps. 

Problem 1. The polynomials considered here are monic and lie in Z[x], that is, 
have integer coefficients and leading coefficient 1. It was shown in [1] that there are 
infinitely many monic polynomials in Z[x] which are irreducible and have all their 
roots in [a, b], provided that b — a > 4. Polya had proved earlier that the number is 
finite if b — a < 4. If b — a = 4, then there are infinitely many such polynomials if 
a and b are integers. Are there any other cases with b — a = 4 where the number is 
infinite? Among the monic polynomials in Z[x] whose roots all lie in [—2, 2.5] but 
not all in [—2,2], are there infinitely many which are irreducible and have span 
< 4? If the answer to the first question is yes, then so is the answer to the second, 
but perhaps not conversely. 

Problem 2. Here we allow polynomials in R[x], that is, polynomials with real 
coefficients. An attempt was made in [2] to find the maximum possible span for the 
Ath derivative of a polynomial f(x) all of whose roots lie in [—1,1]. The only 
nontrivial cases are those with k +2 <n<2k+4+ 1. It was shown that in these 
cases the maximum can be attained only when all the roots of f(x) lie at 1 or —1. 
The obvious conjecture is that these roots must be distributed as equally as possible 
between the two end points. How can this be proved? 

Problem 3. A study was made in [3] of monic polynomials in Z[x] having real 
roots and span < 4. The objective was to determine all such polynomials of degree 
n < 8 which are irreducible. In the process, it was necessary to compute some 
reducible polynomials with distinct roots. It was noticed that there is a strong 
correlation between reducibility and the smallness of the gap. For example, among 
the monic polynomials in Z[x] with span < 4, there are 17 essentially different 
sextics which are irreducible, and all have gaps > 0.23. On the other hand, there are 
19 such sextics which are the product of two irreducible cubics, and 12 of these have 
gaps < 0.15. Is there a general theorem which determines values of n, s, and g, so 
that a monic polynomial in Z[x] of degree n with real roots is reducible if it has 


329 


330 UNSOLVED PROBLEMS [April 


span <s and gap < g? Many reducible polynomials with distinct roots should 
satisfy the conditions. One suitable triple is n = 6, s = 4, and g = 0.23. 
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A Challenging Definite Integral 


H. S. M. COXETER 
Department of Mathematics, University of Toronto, Ontario, MSS 1A1, Canada 


In spherical 3-space, that is, on the 3-sphere x? + x3 + xf + xj = 1 in Euclidean 
4-space, a certain tetrahedron of known volume (which Schlafli called an ortho- 
scheme) can be dissected into three smaller orthoschemes (two of them congruent) 
whose volumes can be expressed in terms of Schlafli functions [2, pp. 177-179; 1, 
pp. vii, 6-12, 195]. It follows that 


f+2¢=2/15, 


where 


=f" arc Sec t dt = [ee arc sec a 
(t + 2) (t + 


y(t + 1)(t + 3) ° 


N. J. A. Sloane has used a computer to show that 
f = 0.02268 05970 96406 8, 


= 0.05532 63681 18463 3. 


The problem is to establish the precise value of f+ 2g without appealing to 
geometry or the computer! 
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How Big a Slice Can You Make Through a Cube? 


RAPHAEL M. ROBINSON 
Department of Mathematics, University of California, Berkeley, CA 94720 


Prove or disprove that for each n > 2, the maximum (n — 1)-dimensional volume 
of a cross section of an n-dimensional unit cube is /2. 


LETTERS TO THE EDITOR 


Editor: 


In the Notes section of this MONTHLY, Aug.—Sept. 1987, pp. 662-663, Boo Rim 
Choe offers an elegant proof of Y1/n* = 77/6. It must, however, be remarked that 
this proof is by no means new: In a slightly different form it had been published by 
Euler in 1743 ({1]); cf. [2] (p. 388; footnote to nr.210) and [3]. (Stackel’s paper, 
mentioned in [3], is reprinted in Euler’s Opera Omnia I, 14, pp. 156-176.) 


1. L. Euler, Demonstration de la somme de cette suite 1+ 44+ 5+ %+# 
+ 3¢ + etc., Journal littéraire d’ Allemagne, 2:1 (1743) 115-127. (Opera 
Omnia, I, 14, 177-186.) 

2. K. Knopp, Theorie und Anwendung der unendlichen Reihen, Grundlehren 
der mathematischen Wissenschaften 2, 5th ed., Springer-Verlag, 1964. 

3. K. Knopp and I. Schur, Uber die Herleitung der Gleichung L°_ 1 /n? = 17/6, 
Archiv der Math. und Physik, 27 (1918) 174-176. (1. Schur, Gesammelte 
Abh., IT, 246-248; Springer-Verlag 1973.) 


Gerhard Turnwald 
Universitat Tiibingen 
Mathematisches Institut 
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NOTES 


EDITED BY DENNIS DETURCK, RICHARD LIBERA, AND ANITA E. SOLOW 


Two Discrete Forms of the Jordan Curve Theorem 


LAWRENCE NEFF STOUT 
Department of Mathematics, Illinois Wesleyan University, Bloomington, IL 61702 


The Jordan curve theorem is one of those frustrating results in topology: it is 
intuitively clear but quite hard to prove. In this note we will look at two discrete 
analogs of the Jordan curve theorem that are easy to prove by an induction 
argument coupled with some geometric intuition. One of the surprises is that when 
we discretize the plane we get two Jordan curve theorems rather than one, a 
consequence of the interplay between two natural products in the category of 
graphs. Topology in this context has been studied by Farmer in [2]. 

To state the discrete versions, we need to know what the discrete analog of the 
plane is and what plays the role of a simple closed curve. Since the plane is the 
topological product of two lines, we take as our discrete analog the product of two 
discrete lines. We will use undirected graphs for our analogs of spaces, with vertices 
for points and edges connecting points which are to be thought of as touching. 


DEFINITION 1. A discrete 1 point line [1, 7] is a graph with vertices {1,2,..., 7} 
and edges connecting each vertex to itself and to its successor. The discrete line L is 
a similar graph based on all of the integers. 


DEFINITION 2. A discrete n point circle is a discrete n point line with n and 1 
connected by an edge. 


There are two important products in the category of graphs: the categorical 
product and the tight product. The tight product is used in building graphs using a 
sort of prime factorization in Behzad and Chartrand [1]. 


DEFINITION 3. The product of two graphs (V,, E,)[1(V;, E,) has the set V, x V, 
as vertices and has (v,, v,) connected to (vj, v5) by the edge (e,, e,) if e, connects 
v, and vj and e, connects v, and v3. 


DEFINITION 4. The tight product of two graphs has V, X V; as its set of vertices 
and has an edge connecting (v,, v,) and (v}, v5) if and only if v, = vj and there is 
an edge connecting v, and v3, or v, = v4 and there is an edge connecting v, and vu}. 
We denote this as (V,, £,)O(V;, E,). 


If we take the product of two lines we get a patch of the plane with points 
connected which are nearest neighbors vertically, horizontally, or diagonally. If we 
take the tight product we leave out the diagonal connections. 


The analog of continuous functions will be mappings of graphs: vertices are 
taken to vertices and edges to edges. A closed curve is the image of a circle under a 
graph map. It is simple if the map also reflects adjacency; that is, if c(v) has an 
edge connecting it with c(v’) then v and v’ had an edge connecting them too. 
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Simple curves then are forbidden to touch themselves, not just forbidden to cross 
themselves. This puts us in a position to state the two forms of the Jordan curve 
theorem. 


THEOREM (Jordan curve theorem for tight closed curves). If s is a simple closed 
curve with domain having at least 8 points in LOL, then L X L\ im(s) has exactly 
two product path components. 


THEOREM (Jordan curve theorem for product closed curves). If s is a simple 
closed curve with domain having at least 4 points in LI\L, then L X L\ im(s) has 
exactly two tight path components. 


@ e r) e e e ° . . . NAIA, | 
ee oe oe NVAMASONV ae 
ae Pero, — WN NAS 
ee ee ANAIZIMS 
| i fot ° A INMN 
ai nt el Forts ss 
Fic. 1. Fic. 2. 


Notice that in Figure 1 the interior of the tight product closed curve is not 
connected in the tight product space. The interior is, however, connected in the 
product space, which allows diagonal connections. In the second illustration we 
have a simple closed curve in the product sense which fails to disconnect the 
categorical product space. If we use the tight product instead, then the interior is 
not connected to the exterior and each forms a connected set. The minimum size 
restriction eliminates the trivial cases in the next illustration. 


, ott. 2. LT MAN. CW 
ee ae rn ae 
JN 
Fic. 3. 


Proof (for product closed curves). Since a simple closed curve involves only a 
finite number of points we can move it into the first quadrant and guarantee that 
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the coordinates of points are bigger than 0 and less than m for sufficiently large m. 
We define the rank of s as the triple (N, X, Y) where N is the number of distinct 
points in the closed curve and (X, Y) is the point in the closed curve with largest 
first coordinate X and largest second coordinate Y of the points of im(s) with that 
first coordinate. Ranks are ordered lexicographically. This is a well-ordering, so 
strong induction on rank is a valid proof technique. 

The smallest simple closed curve for this theorem has N = 4. It forms a diamond 
surrounding a single point which forms the inside component. All other points are 
connected to the point (0,0) by a tight path. The requirement that a simple curve 
reflect adjacency eliminates other possible curves of length four. Thus the theorem is 
true for closed curves with length 4. 

Now suppose that the theorem has been proved for all closed curves with rank 
less than (N, X, Y) and that s is a simple closed curve with rank (N,_X, Y). We will 
reduce the rank by moving the point (X, Y) to (X — 1, Y). The points in the closed 
curve s which were adjacent to (X, Y) could only be among (X, Y — 1), (X — 1, 
Y — 1), and (X¥ — 1, Y + 1). (Two points are adjacent to (X, Y) and they must be 
nonadjacent, hence, (X — 1, Y) is not one of the possible points.) All of these are 
adjacent to (X — 1, Y) so the result is still a closed curve, though it may not be a 
simple closed curve. Observe that moving this point reduces the rank. If the new 
closed curve is a simple closed curve then we are done since the interior of the 
original curve is the interior of the curve of lower rank with the point (X — 1, Y), 
which is tight adjacent to it, added. The exterior of the original curve is the exterior 
of the new curve with the point (X, Y) removed. This is still tight connected since 
any tight path passing through (X, Y) in the exterior of the lower rank curve can 
take a detour through (X, Y + 1),(X + 1, Y +1) and (X + 1, Y). 

There are two ways for the resulting closed curve to fail to be simple: either the 
point (X — 1, Y) is adjacent to one of the points two steps away from (X, Y) in s, 
or it is adjacent to a point more than two steps away. If (X,Y) was s(h) and 
(X — 1, Y) is adjacent to s(h — 2) then we can remove s(h — 1). If (X, Y) was s(h) 
and (X — 1, Y) is adjacent to s(# + 2) then we can remove s(h + 1). Removing 
these points, if necessary, will further reduce the rank. The interior of the resulting 
curve is tight connected to (X,Y), so the interior of the original curve is tight 
connected. Any tight path passing through one of the points removed has a detour 
which avoids them and stays in the exterior. Figure 4 shows how this works for a 
typical case. 


Suppose that (X, Y) is s(h) and (X — 1, Y) is adjacent to s(k) where k is more 
than two away from h. Then by moving to (X — 1, Y) we pinch the closed curve 
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into two closed curves which have a tight path connecting their interiors which 
passes through the point (X — 1, Y) and each of which is strictly shorter than our 
original loop. (See Figure 5.) Since they have smaller ranks they each divide the 
product into exactly two tight pieces. The interior of s is then the union of the 
interiors of these two new closed curves plus the point (X — 1, Y). It remains to 
show that the exterior is tight path connected. 
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The exterior is the intersection of the exteriors of the two new closed curves. Call 
the new closed curves s, and s, and renumber so that the intersection points are at 
t= 0 and ¢=1, with (X — 1, Y) =s,(1). Let p and gq be in ext(s,) O ext(s,). 
Since ext(s,) 1s tight path connected there is a tight path in ext(s,) from p to gq. If 
that path is also in ext(s,) then nothing more needs to be done. If not then there are 
points p’ and q’ such that p’ is the last point in the path for which the segment 
from p to p’ 1s in ext(s,) and q’ 1s the first so that the segment from q’ to g is in 
ext(s,). It follows that both p’ and q’ are adjacent to points in s,. Thus to prove 
the theorem it will suffice to show that the set E of all points adjacent to s, and in 
the exterior of both curves is tight connected. 

Since the original curve was simple we know that s,(0) and s,(1) are the only 
points in s, that are adjacent to points in s,. We will show that F is tight path 
connected by walking around s, starting at (XY, Y) and observing what happens in 
each nine point patch with an element of s, at the center. It is not difficult to list all 
of the ways that a product path can pass through a nine-point patch (see Figure 6) 
and in all cases the points on either side of the path form tight path connected sets. 
Since s, 1s of finite length we can piece together such patches to see that the set E is 
tight path connected. 
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The proof of the theorem for simple closed curves in the tight product is a 
similar, though slightly less difficult, induction argument. The rank is defined the 
same way as in the product case. The reduction is done by moving the point (X, Y) 
to (X — 1, Y — 1) which either gives a simple closed curve of lower rank or pinches 
the curve in two or gives a curve which can be shortened. Analysis of the possible 
nine-point patches again allows us to show that the exterior is connected. 


The author would like to thank the referees and Randall Weiss for suggestions which improved this 
note. 
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Some Polynomial Identities that Imply Commutativity for Rings 
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1. Introduction. Johnsen, Outcalt, and Yaqub [5] considered the ring-theoretic 
analogue of a well-known group-theoretic result, which states that a group G 
satisfying (xy)? = x’y? for all x and y in G is necessarily abelian. It is also an easy 
exercise to show that a group G is abelian if (xy)? = yx’y for all x and y in G; the 
corresponding ring-theory version has not yet appeared in any text. It is surprising 
that the ring theoretic analogue of many such results escaped the attention of the 
research workers, though the commutativity of the rings satisfying other identities 
such as xy*x = yxy has been considered fully [1], [3], [6]. The reason for this sort 
of omission is understandable. We know that the essential mechanism in the proof 
of such results in groups is cancellation, which is not permissible in a general class of 
rings. Only a few results could be proved by going through several permutations of 
the substitutions of, say, y by x + y and x by x +1 starting with the given 
identity. To obtain other results, complicated combinatorial arguments had to be 
used [2], [4]. In this note our objective is to establish a result which allows a limited 
cancellation property in rings with unity. The proof depends on the simple strategy 
of substituting x + 1 for x to get another identity simpler than the original one. 
Indeed, we prove the following: 


THEOREM A. Let R be an associative ring with unity 1 and let F( X,Y, Z) be a 
polynomial with coefficients from elements of R where the indeterminates commute 
neither with each other nor with the elements of R. Suppose that F is homogeneous in X 
of degree n and homogeneous in Y of degree m and that F(x, y, xy — yx) = 0 forall x 
and y in R. Then m!ntF(, 1, xy — yx) = 0 for all x and y in R. 
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In Section 3 we shall use our theorem to find some ring-theory version of the 
group-theoretic results including that of Johnsen, Outcalt, and Yaqub [5] and that of 
Awtar [2]. 


2. Proof of Main Theorem. Let R[X, Y, Z] denote the ring of the polynomials in 
noncommuting indeterminates X,Y, Z over R. Define an automorphism o on 
R{_X, Y, Z] by: 

o[ F(X, Y, Z)] = F(X +1,Y, Z) (1) 


and a o-derivation A =o — /: 
A| F(X, Y, Z)] = F(X +1, Y, Z) — F(X, Y, Z). (2) 


Easy computations show that for any two polynomials F and G in RLX, Y, Z], 
we have 


A[F+ G] =A[F] + A[G] (3) 
A[ FG] = (A[F])(o[G]) + F(A[G]) (4) 


and an induction gives the Leibniz formula 


w[FG] = ¥ ("\(ATF])(ow-"[e)). (5) 


r=0 
This allows us to prove: 


LEMMA. If F is homogeneous of degree nin X, then A"[ F(X, Y, Z)| = n!F(, Y, Z) 
and A"[ F(X, Y,Z)]=90 form>n. 


Proof of Lemma. By (3) it suffices to prove the lemma when F(X, Y, Z) is a 
monomial. It can be proved by induction on n. If n = 0, FLX, Y , Z) is independent 
of X and A F(X, Y, Z)] = F(X, Y, Z) = FU,Y, Z). Again A[FCX, Y, Z)] = 
F(X +1, Y, Z) — F(X, Y , Z) = 0 and, hence, A”[ F(X, Y, Z)] = 0 for all m > n. 

For the induction step, write the monomial F(X, Y, Z) as AXG, where A is a 
monomial with no X’s in it, and G is a monomial of degree n — 1 in X. Then 
A[.A] = 0 and by (4), A[AX] = A; hence, A’. AX] = A’-![ A] = 0 for r > 1 by the 
case n = 0. Again by using (5), we get . 


A"[ AXG] = (AX)A"[G] + nA(oA"|[G]). 


By the induction hypothesis, the first term on the right side is zero if m > n. The 
second term is zero if m > n; if m =n, this term equals nA(n — 1)!G(1, Y, Z) = 
n!F(1, Y, Z), which proves the lemma. 


Proof of Theorem A. If F(x, y, xy — yx) = 0 for all x and y in R, then on 
replacing x by x + 1, we get 


F(x +1, y,(x + 1)y — p(x +:1)) = F(x +:1, y, xy — yx) = 0. 


That is, if F(x, y, xy — yx) is zero, the same is true for o[F], A[F], and A”[F}. 
Hence, n!F(1, y, xy — yx) =0 for all x and y in R. By applying the whole 
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procedure again on the polynomial F(1, Y, Z), which is homogeneous of degree m 
in Y by using a new derivation, A’ defined as 


A’ | F(X, Y, Z)] = FO, ¥ + 1, Z) — F(Y, Z). 


The result is the conclusion of the theorem. 


3. Applications to Commutativity Theorems. We can derive a number of results 
with the help of our theorem proved above. As we have claimed in the beginning, 
even those results which could be proved earlier using very complicated combina- 
torial arguments will become corollaries of our theorem. We need just to select a 
suitable polynomial F(X, Y, Z). 

To begin with we prove the following result. Let us assume hence onward that R 
is an associative ring with unity 1. 


PROPOSITION 1. Let R be a ring satisfying (xy)? = yx’y for all x and y in R. Then 
R is commutative. 


Proof. Take F(X, Y, Z) = ZXY. Then indeed F(x, y, xy — yx) = (xy)? — yx’y 
= 0. Hence, by applying Theorem A, F(1,1, xy — yx) = 0, that is, xy = yx and the 
ring R is commutative. 

Similarly, by taking the polynomial F(X, Y, Z) = XZY we prove the result due 
to Johnsen, Outcalt, and Yaqub [5]. As has been shown in example 3 of [5], if we 
replace the identity (xy)? = x*y’ by (xy)? = x°y’, then the commutativity is not 
guaranteed. The following result suggests that in this case the commutativity fails 
only in rings that have no 2- or 3-torsion. 


PROPOSITION 2. Let R be a ring satisfying (xy)? = x*y? for all x andy in R. If 6 
is not a zero divisor in R, then R is commutative. 


Proof. Take F(X, Y, Z) = X?ZY? + XZXY* + XYXZY, then F(x, y, xy — yx) 
=ix°y? — (xy)? = 0 and by our Theorem A, n!m!F(1,1, xy — yx) = 2!2!3(xy — 
yx) =0. But 2 and 3 are not zero divisors in R, so xy — yx = 0, which gives 
commutativity. 

In fact, applying Theorem A we can derive even more general results. As an 
example we prove below a generalization of Proposition 2 which was earlier 
established by Awtar [2]. 


PROPOSITION 3. Letn > I be a positive integer and R be a ring in which no prime 
number <n is a zero divisor. If R satisfies (xy)" = x"y" for all x andy in R, then R 
is commutative. 


Proof. Just as we did in the proof of Proposition 2, we take F(X, Y, Z) to be a 
sum of n(n — 1)/2 monomials each of which is a product of one Z by n — 1 X’s 
and n — 1 Y’s (it takes one term to move each Y in XYXY... to the right, past one 
X). Then we have F(x, y, xy — yx) = x”y” — (xy)”" = 0, and by Theorem A, 


min!F(1,1, xy — yx) = ((n - 1)!)’- (n(n — 1)/2)(xy — yx) = 0. 


This implies that xy — yx = 0 if all primes dividing ((n — 1)!)?(n(n — 1)/2) are 
not zero divisors. 
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REMARKS 1. If we take F(X, Y, Z) = XZ — 2ZY, the polynomial identity (x + 
2y)xy = xy(x + 2y), or if we take F(X, Y, Z) = XYZ, then the polynomial iden- 
tity (xy)? = xy*x for all x and y in R implies commutativity. Other examples can 
be constructed ad libitum ad infinitum. 

2. More subtle commutativity theorems, which do not work for all rings with 
unity, also often assume polynomial identities of the form F(x, y, xy — yx) = 0, 
but with F(1,1, Z) = 0. 
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An Overlooked Example of Nonunique Factorization 


HALE F. TROTTER 
Department of Mathematics, Princeton University, NJ 08544 


On the face of it, the familiar identity 
sin’t = 1 — cos*t = (1 + cosr)(1 — cost) (1) 


asserts that two different-looking pairs of factors have the same product. It seems to 
have gone unnoticed, however, that (1) is actually a valid example of nonunique 
factorization in an integral domain when looked at in the proper context. Its 
familiarity makes it a particularly attractive example to present to students encoun- 
tering nonunique factorization for the first time. Just as the usual textbook examples 
involving integers in quadratic number fields, such as 2 - 3 = (1 + ¥—5 (1 — V—5), 
show that unique factorization can fail in rings very much like the integers, the 
example treated here shows that it can fail in a ring very much like the ring of 
polynomials over a field. 

Of course we have to do more than simply remark that the two sides of (1) look 
different. We must specify the ring we are working in, and then show that the 
factors sint, 1+ cost, and 1 — cos? are irreducible, and that sin?t is not the 
product of one of the other factors and a unit (invertible element) of the ring. 

We shall work with the real trigonometric polynomials, that is, the functions 
representable as finite sums of the form 

k 
ayg+ > (a,cos nt + b,sin nt) (2) 
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in which the a’s and b’s are real numbers. Students who have seen anything of 
Fourier series find it natural enough to consider these functions, although they may 
not have seen them called trigonometric polynomials, or considered the question of 
whether they form a ring. The familiar Fourier coefficient formulas a, = 
(27) ‘7 f(x) dx, a, =m ‘{", f(x) cosnxdx, and b, = 77 1{™, f(x)sin nx dx for 
n > 0, show that the coefficients in (2) are uniquely determined by the function. 

The degree of a nonzero trigonometric polynomial is defined as the largest value 
of n for which a, and 5, are not both zero. The following well-known lemma shows 
that the trigonometric polynomials form a ring, and that degrees behave as they do 
for ordinary polynomials. 


LemMA. The product of a trigonometric polynomial of degree m and one of degree n 
is a trigonometric polynomial of degree m + n. 


Proof. The assertion of the lemma is obvious if m or n is 0, because a 
trigonometric polynomial of degree zero is simply a constant function. From now 
on we assume m,n > 0. Recall the standard identities for expressing products of 
sines and cosines in terms of sums and differences of other sines and cosines: 


(sin a)(sin b) = [cos(a — b) — cos(a + b)| /2 
(cos a)(cos b) = [cos(a — b) + cos(a + b)| /2 
(sin a)(cos b) = [sin(a + b) + sin(a — b)] /2. 


Applying these to the product of pcosmt + qsin mt and rcosnt + ssinnt and 
collecting terms, gives the result 


Acos(m —n)t + Bsin(m —n)t + Ccos(mt+n)t+ Dsin(m+n)t, (3) 


where A =(pr+ qs)/2, B=(ps— qr)/2, C=(pr-qs)/2, and D=(ps + 
gr)/2. When m > n, (3) is already in the form (2). If n > m, replacing cos(m — n) 
by cos(m — m) and sin(m — n) by —sin(n — m) puts it in the proper form, while if 
m =n it is necessary to replace sinO by 0 and cos0O by 1. Direct calculation gives 


C?+ D?=(p?+q’\(r?4+s7)/4, 


which shows that if neither factor is zero (so p* + gq? #0 and r* + s* #0) then 
C? + D* # 0, so the product has degree m + n. 

Now consider the product of any two trigonometric polynomials of respective 
degrees m and n. It is a sum of products of terms of the type just considered, so it is 
a trigonometric polynomial. The product of the two high-order terms gives a 
non-zero term of degree m + n which cannot be cancelled by any other term in the 
product, so the result has degree m + n as claimed. 


PROPOSITION. The trigonometric polynomials form an integral domain. Further- 
more, 


(a) The units (invertible elements) in this domain are the elements of degree 0, 
that is, the constant functions. 

(b) All elements of degree 1, including sint,1 + cost, and 1 — cost, are irre- 
ducible. 
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The proposition follows at once from the lemma, just as with ordinary polynomi- 
als, and we leave the details to the reader. 

It follows from (a) and (b) that the factors in (1) are irreducible and that sin ¢ is 
not the product of one of the other factors with a unit. Hence we have a genuine 
case of nonunique factorization. 

One can very well stop here in an elementary discussion, but the example does 
raise another point that may be of interest. 

The proof of the lemma uses the fact that the sum of the squares of two real 
numbers is zero only when both are zero, and breaks down if complex coefficients 
are allowed. Using the complex exponential forms of the sine and cosine shows that 
the ring of trigonometric polynomials with complex coefficients is the same as the 
ring of polynomials in positive and negative powers of z =e" with complex 
coefficients. To see that this is a unique factorization ring, define the degree of a 
polynomial in z and z‘ as the difference between the largest and smallest 
exponents appearing in non-zero terms. With this definition, the elements of degree 
zero are the monomials, which are exactly the invertible elements in this ring. The 
usual proof that ordinary polynomials over a field form a Euclidean ring then goes 
through with no essential change. 

What is it about the change of coefficients that alters the nature of factorization 
in the ring? For one thing, introducing complex coefficients produces many more 
units—all the non-zero constant multiples of powers of z= cost+isin¢ and 
z~' = cost —isin¢. Our particular example breaks down because the factors 
involved cease to be irreducible. We have 


sint = (z—z ')/(2i) =z-'(z —- 1)(z + 1)/(2i), 


1 — cost =(-z+2-—27')/2 = —z7"z—-1)’/2, 


and 
1+ cost =z "z+ 1)°/2, 
so both sides of (1) become 
—z~*(z—1)(z+1)’/4 


when expressed as a product of irreducible factors. 

A ring of algebraic integers can sometimes be enlarged to another in a way that 
restores unique factorization, although the problem of how and when it can be done 
is not at all elementary, and as far as I know is not solved in general. For example, 
the ring Z[y—3] consisting of numbers of the form a+ b/—3 with a and b 
integers does not have unique factorization, as the equation 2-2 =(1 + V-3) 
(1 — ¥—3) shows. Unique factorization can be restored in this case by enlarging to 
the ring of all algebraic integers in the field Q(/—3), which is Z[w], where 
w = (1 + Y—3)/2 is a complex cube root of one. This does not work for the ring 
Z{V—5 ] used in the example at the beginning of the paper, because that is already 
the ring of all algebraic integers in the field Q(V—5). The ring of all algebraic 
integers in the enlarged field Q(/—5, i), however, which can be shown to be the 
ring Z[n] where n = (i + ¥—5)/2 is a root of x* + 3x? + 1, is an enlargement of 
Z[{y/—5] that does have unique factorization. I do not know an elementary proof of 
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the last assertion, but it is easily established by standard arguments based on 
Minkowski’s estimate, as illustrated in [1, chapter 12], [2, chapter 13], or [3, chapter 
5]. 
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Two Counterexamples in General Topology 
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In [3] Albert Wilansky inserted two axioms between the separation axioms 7, 
and T,, namely the US-axiom (every convergent sequence has a unique limit) and 
the KC-axiom (every compact subset is closed). He showed that 7, > KC > US > 
T, and discussed at length the problem of constructing counterexamples of compact 
spaces showing the failure of each of the reverse implications, proving several 
interesting results in the process. As a consequence of theorems 4 and 5 of [3], it was 
brought out that, if a 7,-space X is not a k-space (a k-space is one in which a set is 
closed iff its intersection with each closed compact set is closed) then its one point 
compactification X* is US but not KC. An example of a T,-space, which is not a 
k-space is given in [3], example 7. Since this example involves the Cech compactifi- 
cation, it seems worthwhile to have the following two elementary examples. 


Example 1. The Appert space A (see [1], p. 117) whose ground set is the set of all 
positive integers and E C A is open iff either 1 € E or 1 € E and 


Jim -{ y xe()| = 1. 


r=1 


Example 2. The Fortissimo space F (see [1], p. 53) whose ground set is any 
uncountable set with a particular point p and E C F 1s open iff either p € E or 
p © E and F\ E is countable. 

One easily verifies that both A and F above are noncompact 75-spaces, that both 
are pseudofinite (i.e., all compact subsets are finite) and that neither is discrete. 
Hence, neither of them is a k-space so that both A* and Ft are US but not KC. 
This can in fact be shown directly. While A*\ {1} and F*\{ p} are compact 
nonclosed subsets of A* and F”*, respectively, one easily imitates the proof of 
theorem 4 in [3] to show that they are US spaces. 

Other examples may be found in [2, example 2.3] and [4, p. 345]. 
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THE TEACHING OF MATHEMATICS 
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An Elementary Approach to y” = —y 


J. L. BRENNER, 
10 Phillips Rd., Palo Alto, CA 94303 


THEOREM. Every solution of y"” + k*y = 0(k > 0) on the interval a < x < b has 
the form y = c,coskx + c,sin kx, where c,, c, are constants. 


Proof. Without loss of generality, take k = 1 (the general case comes from the 
change of independent variable t = kx). Define 


Cc, = ycost ~ y’sint, 


(1) 


Cy =ysint + y’cost, 


Since y” + y =0, it is clear that c{ = c= 0, so that c,,c, are constant. 
Eliminate y’ from equations (1). 0 


There are many other proofs of this theorem. From y’(y’” + y) = 0, one finds 
12 2 4.2 2 2 _ og 
y’* + y*=r* =const, so that (if r > 0)dy/yr° —y° = +dt, y=rsin(tt + C). 
See also [1, pp. 83-84]. 
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A Simple Proof for the Simplicity of 4; 


BENNO ARTMANN 
Fachbereich Mathematik, TH Darmstadt, SchloBgartenstr. 7, D-6100 Darmstadt, West Germany 


Dedicated to Giinther Pickert on the occasion of his 70th birthday 


The alternating group A, is usually the first example of a simple nonabelian 
group for the student. The group A, is isomorphic to the group D of rotations of a 
regular dodecahedron (or icosahedron). I think it may be worthwhile to see a proof 
for the simplicity of D whose ideas are immediately transparent, easily remembered, 
and use nothing more than the concept of a group and a homomorphism between 
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E 3259. Proposed by Jordi Dou, Barcelona, Spain. 


Let R be a semicircular region bounded by a line LZ and a semicircle S with 
center on L. Suppose P, and P, are given points in the interior of R. We wish to 
find parallel lines /,,/, through P,, P,, respectively, such that 


where C,, D, are the intersections of /, with S and L and C,, D, are the 
intersections of /, with S and L. Give a necessary and sufficient condition on 
P,, P, for such parallel lines to exist. 


E 3260. Proposed by Peter Andrews and Edward T. H. Wang, Wilfrid Laurier 
University, Waterloo, Ontario, Canada. 


In how many ways can n white squares and n black squares be chosen from a 2n 
by 2n chessboard in such a way that no two of the chosen squares lie in the same 
row or the same column? 


350 
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E 3261. Proposed by Detlef Laugwitz, Technische Hochschule, Darmstadt, West 
Germany. 


Let G be the group of 3 by 3 orthogonal matrices with rational entries. Let S be 
the subset of G consisting of the six 3 by 3 permutation matrices and all matrices of 
the form 


a b 

- - Q 

Cc Cc 

b a ; 
—-—- —- QO 

Cc Cc 

0 0 1 


where a, b, c are integers with a? + b* = c* > 0. Does S generate G? 


E 3262. Proposed by R. G. E. Pinch, Emmanuel College, Cambridge and C. 
Boyd, University of Edinburgh. 


It is known that every natural number n can be expressed as the sum of four 
squares of integers, and that three squares suffice unless n is of the form 4°(85 + 7). 
Show that every sufficiently large natural number can be expressed as the sum of at 
most five squares of composite numbers (i.e., squares of positive integers divisible 
either by the square of a prime or by the product of two distinct primes). 


E 3263. Proposed by Walther Janous, Ursulinengymnasium, Innsbruck, Austria. 


For n > 2 let 
H,= {X=(x,...,%X,)i x +++ +x, = 1, x, >0,...,x, > 0}. 


Forl1 <k<nand X€H, let 


S,(X) = : ee 
(=D 
where the sum extends over all k-tuples (i,,...,i,) with 1 <i, < ++: <i, <n. 


Put 


M,(n) = sup S,(X). 


XE A, 


It is not hard to see that M,(n) = (n — 1)~” (D.S. Mitrinovic, Analytic Inequali- 
ties, Springer, Berlin, 1970, (3.2.46) on p. 214). 
(a) Show that M,(n) = 1. 
(b) Show that M,(4) = 4/27. 
*(c) For what pairs k,n with 3 < k < nis it true that M,(n) = (")(n —1)°*9 
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E 3264. Proposed by Stanley Rabinowitz, Alliant Computer Systems Corp, Little- 
ton, MA. 
Let P,/Q,, be the nth convergent for the continued fraction 
1 1 1 =6/1 


1+2+ 34+ 44+ 
1e., let P; = 1, QO, = 1, P, =2, Q, = 3, and 


P,=nP,_,+ Po, QO, = nQ,-1 + QO,-2 (n> 3). 


(Cf. Hardy and Wright, An Introduction to the Theory of Numbers, Chapter 10.) 
Give asymptotic estimates for P, and Q,. 


9 


SOLUTIONS OF ELEMENTARY PROBLEMS 
Maximal Minimal Card Shufflings 


E 3143 [1986, 299]. Proposed by Allen J. Schwenk, Western Michigan University, 
Kalamazoo, MI. 


A riffle shuffle of a deck of cards is the commonly used technique of cutting the 
deck into two portions (not necessarily equal), then, elevating the corners slightly, 
allowing each portion to fall card by card (not necessarily alternating) merging with 
the other portion, and finally pushing them together to reconstitute the pack. Given 
a deck of n cards in arbitrarily permuted order 7, determine as a function of 7 the 
minimum number of riffle shuffles that could possibly produce the identity se- 
quence 1,2,..., n. Describe a procedure that attains this minimum. Which original 
sequences require the most shuffles? 


Solution by the proposer. For the permutation 7 = a,,..., a,, let a descent be a 
position j such that a, > a,,,. By convention, we always consider position n to be 
a descent; thus the identity permutation is the only sequence with 1 descent. We 
show that the minimum number of shuffles required is [log.d (a)|, where d() is 
the total number of descents in 7. 

By partitioning the sequence 7 at the descent positions, we may view 7 as 
consisting of d = d(7) blocks B,,..., B, that are strictly increasing. Now perform 
a riffle shuffle by cutting the deck after the block B,,,., and merging the two 
portions so that block B, merges with block B,,/.),; to form a new increasing block 
C, for each j < |d/2). If d is odd, the last block B,,,, is unaltered. Thus, one 
shuffle transforms 7 to a sequence with [d/2] descents, and iterating this procedure 
produces the identity permutation after |log,d(1)| shuffles. 

On the other hand, any cut of the original deck leaves at least [d/2] descents in 
one portion of the deck, and merging cannot reduce this number. By induction on 
d, at least [log,d(2)]| shuffles are needed to complete the job, so [log,d()] are 
needed for a sequence with d descents. 

The maximum required number of shuffles is k = [log,n]|, and the permutations 
requiring this many shuffles are those with more than 2*~! descents. 


Also solved by K. Schilling and by L. Szuecs. 
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Generating Sets in a Topological Space 


E 3144 [1986, 299]. Proposed by Edwin Buchman, California State University, 
Fullerton, CA. 


Determine the maximum number of sets in a topological space which can be 
generated from one set by application (possibly repeated, in any order) of the 
operations of taking the complement, interior, and boundary of a set. 


Solution by Jesus Ferrer, Cami Collado, Oliva (Valencia), Spain. At most 34 
different sets can be constructed from one set by repeatedly taking the complement, 
interior, or boundary. Let A’, A°, A* denote the complement, interior, and boundary 
of a set A. Then the 34 sets that can be constructed appear in the following tree. 


A A*® At* A*®*’ 
A*9 Ax Ox A*O#! 
A*o A*X00 A007 


A° Ao* Ao! 
A” Ao A0/0# Aol! 


Aoror A01010 A019101 
A’ A’? A’o* A10%0 Al 0x07 
Al ow? 
A’ A/o0 A/010% A010 #! 
Aloo" A/9/070 A/010107 


The lack of additional sets follows from A* = A’°’ ) A®’, A’* = A*, A*/o = 
A*", A®*9 = J = 40 Aox# = 4 A’010/0/0 — 47070. AP1070/0 — 4010. A1010/10% — 
A/010% A01010% = 400% A¥*01010 — 4% 0. At010% = A*o* A*t** = 4** Also A10*9 — 
@ and A’°*” = VX 

To see that the number 34 cannot be reduced, let X be the real numbers with the 
usual topology, and let @ be the rationals. Then all 34 sets are distinct when A is 
the set 


A = [0,1) U (1,2) U {3} U (Qn [4, 00)). 


Editorial comment. This problem is very similar to Advanced Problem 5996, [1974, 1034; 1978, 283]. 
That problem asks the same question for {closure, interior, union}, the answer being that at most 13 
distinct sets can be constructed (see also Closure, interior, and union in finite topological spaces, Colloq. 
Math., 38 (1977) 41-51, by L. E. Moser). Kuratowski showed that at most 14 distinct sets can be 
constructed using {closure, interior, complement} (see Sur l’opération A de l’analyses situs, Fund. Math., 
3 (1922) 182-199. 


Also solved by M. Bowron, J. Hook, O. P. Lossers (Netherlands), O. Matou’ (Czechoslovakia), W. D. 
McIntosh, L. F. Meyers, R. Patenaude, J. P. Robertson, and the proposer. 
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An Integral of Cosines 


E 3145 [1986, 299]. Proposed by Clinton J. Kolaski, University of Minnesota, 
Duluth. 


Show that 


X= 7 


7 COS NX — COS ny sin ny 
| ————4 (n = 0,1,2,...). 
0 COSx — COS y sin y 


Solution I by W. O. Egerland and C. E. Hansen, University of Baltimore. Put 


cos nx — cos ny 


f(x,y) = 


cos x — cos y 


for n = 0,1,2,... and (x, y) € R*. Since there is a polynomial T, of degree n such 
that cos n@ = T, (cos @), it follows that f,(x, y) may be expressed as a polynomial 
in cos x and cos y of degree n — 1. Clearly fo(x, y) = 0, f(x, vy) = 1, L£(% y) = 
2cos y + 2cos x. The addition formula for the cosine immediately gives 


’ 2 cos nx cos x — 2. cos ny cos y 

+ , SS 

frail, y) + fruits y cos x — cos y (1) 
= 2cos nx + 2f,(x, y)cos y 

for n = 1,2,... . In view of the identity 


sin(r + 1)y + sin(r — 1) y = 2sin rycos y, 


a Straightforward induction argument shows that the recurrence (1) and the above 
initial values of f(x, y) imply the following explicit formula 


sin n nal sin(n — k 
h(x, y)=— a Sg ee ae (2) 
sin y k=1 sim y 


Thus for fixed y we have the indefinite integral 


sin 1 "~1l sin kx sin(n —k 
[hos y) de = x +2 sn) 


sin y poy. CK sin y 


The result of the problem follows. 


Solution IIT by Kwang Kyu Park, Korea Advanced Institute of Science and 
Technology, Seoul, Korea. The following formulas are well known 


00 1 — rcosx 
y r"COS NX = 
n=0 1—2rcosx+r 


(Ir| < 1), (4) 


(po __" (\r| < 1). (5) 


l1—2rcosx+r* 1-r? 


rsin y 


(|r| < 1). (6) 


io.) 
> r"sin ny = ———————_,, 
4=0 1—2cosy+r 
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Since 
COS nx — Cos ny 
cos x — cos y 
for (x, y) © R’, the series 


o° ~=6COS NX — COS ny 
H 


r 
9 COS x — cos y 


converges uniformly in x, y provided |r| < 1. Thus, if |r| < 1, we have by (4) 
7 COS NX — COS ny 


00 
Fr pn [OMT OS 
n20 420 «OSX — COS y 


a 2 COS NX — COS ny 
= / So or" ———————__ dx 
0 


,=0.  COSX — cos y 


7 1 — rcos x 1 —rcos y dx 
-| : ene a 


—2rcosx+r* 1-—2rcosy+r*]} cos x — cos y 


7 r—r dx 
= Jj 1—2rcosy+r?1—2rcosx+r?. 
Using (5) and (6) in turn, we obtain 


7 COS NX — COS ny r—r T co sin ny 


co 
a Ma 
,-9 20 COSX — cos y 1—2rceosy+r’l1-—-r 


Comparing coefficients of r”, we get the desired formula. 
paring g 


Solution III by Kee-wai Lau, Hong Kong. Denote the integral of the problem by 
I,. Substituting z = e’* we have 


21,= 


n 


~ztz2t-(e?+e") 2z 


9 


gr COS NX — COS ny a zg he (e'"” + e"”) dz 
pom enm yy 
Cc 


_q COSX — COS y 


where c is the positive orientation of the unit circle. It follows that 


21 


n 


I 


« (z-e%\(z-e'%) 2" 
= -if > gn—kei(k-Vy y zh—to— trl yg —n dz. 
Ck=1 r=1 


The coefficient of z~' in the last integrand is 


n n 
y ellkK-lyp—i(n—k)y _ e (thy y eck. 
k=1 k=1 


By the residue theorem J, = 7 sin ny/sin y if y is not an integral multiple of 7 and 
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IT, = na(—1)™"* if y = mm. This calculation could also be carried out in terms 
of the original variable of integration x, using the orthogonality of the functions 
e'™*(n = 0, +1, £2,...) over [—7, 7] instead of the residue theorem. 


Editorial Comment. S. K. Ntouyan and Hang-Fai Yeung (Australia) each pointed out that the 
indefinite integral (3) occurs as a problem on page 205 of Joseph Edwards, A Treatise of the Integral 
Calculus, Volume 1, Macmillan, London, 1921. Edwards attributes (3) to Hermite. 

Integrating (1) (with respect to x) over [—7, 7] gives 

Pia. 7 24,cosy t+ f,_, = 90. 
Most solvers based their solutions on this recurrence formula and the initial values 7) = 0 and /, = 7, 
using either mathematical induction or the theory of second-order linear difference equations with 
constant coefficients. M. S. Klamkin, Morris Morduchow, and R. A. Struble observed that this method of 
solving the problem occurs explicitly on pages 92-93 of H. Glauert, The Elements of Aerofoil and 
Airscrew Theory, Cambridge University Press, Cambridge, 1947, and on page 80 of L. M. Milne-Thom- 
son, Theoretical Aerodynamics, Macmillan, London, 1952. 
If A is any positive real number and 0 < y < 7, the quotient 


cos Ax — cosaAy _ sin{A(x + y)/2}sinf{A(x — y)/2} 


COS X — COS y sin{(x + y)/2}sin{(x — y)/2} 
is bounded for 0 < x < a7 and thus the integral 


acosAx — cosaAy 
j= [SEEN 
0 cosx — cos y 


exists. In Siam Review 24 (1982), 83-85, Solution of Problem 81-5, J. A. Boa shows that 


msindy 2sindam 2 (-1)"sin(n + 1)y 


au n+AXQ4+1 


h = 
/. sin y sin y 


If A is rational, the series here can be summed in finite form. Cf. also Siam Review, 29 (1987) 303-305, 
Solution of Problem 86-10. 


Solved also by 35 other readers and the proposer. 


The Countable Co-countable Algebra 


E 3147 [1986, 400]. Proposed by A. Wilansky, Lehigh University, Bethlehem, PA. 


Let (X, B, 1) be a measure space such that all singletons are measurable. Let 
f(x) = p({x}). Must f be measurable? 


Solution by G. Turnwald, Mathematisches Institut der Universitat, Tiibingen, West 
Germany. The answer is No. Let X be the disjoint union of uncountable sets X, and 
X,, and let B be the o-algebra of subsets A of X such that A or X — A is countable. 
Let u(A) = |A 0 X,|, 1f A 1X; 1s finite, and otherwise u( A) = oo. Since f(x) = 1 
on X, and f(x) =0 on X,, f-*(1) and f +(0) are uncountable. Hence f is not 
measurable. 


Editorial comment. Several solvers noted that the answer is Yes if #( X) is finite, since then f(x) = 0 
except on a countable set, which implies the measurability of /f. 
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Also solved by J. Ferrer (Spain), J. A. Goldstein, E. Hertz, A. A. Jagers (Netherlands), D. H. King, R. 
Levy, J. Ling and P. G. Walsh (Canada), O. P. Lossers (Netherlands), L. A. Lucas, D. Neuenschwander 
(student, Switzerland), Oxford Running Club (University of Mississippi), V. Pambuccian (Romania), E. 
Posti (Finland), D. Ramachandran, R. H. Scissors, A. V. Stanoyevitch, J. C. Tripp, and the proposer. 


Matching Socks 


E 3148 [1986, 400]. Proposed by Rick Luttmann, Sonoma State University, 
Rohnert Park, CA. 


Let n distinct pairs of socks be put into the laundry. (It is assumed that each of 
the 2n socks has precisely one mate.) When the laundry is returned, the socks are 
drawn out one at a time. Each is matched with its mate, if the mate has previously 
been drawn. Find a formula for the expected number E(k) of pairs formed after k 
socks have been drawn. 


Solution I by K. Griinbaum and S. Pedersen, Copenhagen V., Denmark. The 
probability that the 7th and jth socks form a matching pair is 1/(2n — 1). For 
1<i<j<k, let X,,=1 if they match, else X,, = 0. Then the number of pairs 
drawn is 2412,<,<,X,,, So the expectation E(k) is E(UX,,) = LE(X,,) = 


(K)/(2n — 1). 


Solution II by Mark Bowron, Lynnwood, WA. The probability that pair i is 
present among the first k socks is (2"- 2) / ( an). Let Y, = 1 if pair 7 is present, else 


Y, = 0. Then the number of pairs present is 1”_,Y,, and the expectation is 
E(XY,) = DE(¥) = n( 2” - 2) (22 = 1k(k —1)/(2n - 1). 
k—2 k ; 


Solution IIT by Richard A. Groeneveld, Iowa State University, Ames. Let X, be 
the number of pairs contained in the first k socks. Let Y,,, = 11f the k + Ist sock 
matches a previously drawn sock, else Y,,, = 0. The conditional probability is 
Prcy,,, = 1|X,) = (k — 2X,)/Qn — k). The conditional expectation is 
E(X,4;|X,) = ECX, + Yp4.,|)X%,) = X, + (Kk - 2X,)/(2n — k), which implies 
E(k +1) = E(k) + (k — 2E(k))/Qn — k). The recurrence, with initial condition 
E(1) = 0, is easily solved to obtain E(k) = $k(k — 1)/Qn — 1). 

Editorial comments. Several readers computed the full probability distribution for the number of pairs 
present among the first k socks and then computed the expectation of that directly. Several readers 
generalized the problem to r-legged beings, for r > 2. More generally, if we have n sets of socks from 


creatures with a,,d5,...,a, legs respectively, D. E. Knuth remarked that the expected number of 
complete sets of matching socks after k socks have been drawn at random is 


E(E)/(e) 


where 5s = a, + a, + ++: +a,. Many commented that this problem illustrates the power of the linearity 
of the expectation over dependent random variables, as used in all solutions above. R. W. van der Waall 
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(Netherlands) pointed out that for even k the problem (with three solutions) appeared in Dutch in 
Nieuw Archief voor Wiskunde, Series 3, 15 (1967), 86-87 (problem 123). 
Solved by 42 readers and the proposer. Partially solved by two others. 


An Exponential Inequality 


F 3151 [1986, 401]. Proposed by Peter Ivady, Institute for Economy and Organiza- 
tion, Budapest, Hungary. 


Let x > 0, x #1, A >1and0 < B < 2 be real numbers. Prove that 
x*—1)\8 xB — J 
< A} —,— |}. 
x-] x? —] 

Solution by M. S. Klamkin, University of Alberta. Replacing x by 1/x leaves the 
inequality unchanged, so it suffices to consider only x > 1 (it is trivial for x = 0). 
Because (e7*4* — 1)/(e*4! — 1) = (e*4’/e) - (sinh Aat/sinh at), the hyperbolic 
substitution x = e*' converts the inequality to 

sinh A Bt sinh Br 

TTR SL 

sinh®) f sinh? 7 
fort >0,A>1, and2> B20. 

Equation (1) holds with equality for A = 1, so it suffices to show that the left side 
is a nondecreasing function of A, or equivalently that its logarithmic derivative with 
respect to A is non-negative, ie., (1/A) + BtcothABt — Btcoth At > 0. By multi- 


plying through by A sinh Ar- sinh Afr and using the addition formula for sinh, we 
transform this inequality into 


(1) 


sinh At sinh ABrt > ABr sinh Ar( B — 1). (2) 


Since sinh is negative for negative arguments, (2) holds for 1 > B > 0, and we 
need only consider 2 > B > 1. At B = 2, (2) reduces to (sinh At)(sinh 2At — 2At) > 
0, which follows from sinh y > y for y > 0. To establish (2) for 2 > B > 1, it 
suffices to show that the logarithmic derivative of the left side with respect to B is 
less than that of the right side. This reduces to showing 


\t 1 Nt 
———— < — + —__—__.. 
tanhABr  B  tanhaAr(B - 1) 
This follows immediately from the fact that tanh is an increasing function. 


Also solved by E. Grosswald, V. Pambuccian (Romania), R. E. Shafer, and the proposer. 


Partitioning a Collection of Infinite Sets 


E 3152 [1986, 401]. Proposed by Leopoldo Nachbin, University of Rochester, 
Rochester, NY. 


Let S be a collection of infinite sets. Consider the following partition property 
(PTP): For every X € S, infinite subsets Y, Cc X can be assigned such that 
Yy 1 Yy, = © if X41, X, © S are distinct. Prove that 
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1) S has property (PTP) if it is countable; 
2) For every uncountable cardinal number N, there is some S whose power is N, 
but which fails to have property (PTP). 


Solution by Kenneth Schilling, University of Michigan, Flint. (1) Suppose that 
S = {X,;: i & N}, where N denotes the natural numbers. Let (p;: i © N) be an 
enumeration of N x N. We now choose elements y, € U,X, recursively. If p, = 
(m,n), let y, be any element of X,, other than y), y,,..., y,_,. Finally, for m € N, 
let Yy ={y,: p; = (m,n) for some n € N}. Since p, runs through all of N x N, 
each set Y, is countably infinite, and by construction the sets Y, are disjoint for 
distinct X, € S. 

(2) Let S be any uncountable collection of infinite sets, uncountably many of 
which are subsets of N. Then if sets Yy, X € S, exist as stated in the problem (even 
if the condition “Y, infinite” is weakened to “Y, nonempty”), then U{Yy: 
X CN, X © S} must be uncountable. Since this set is a subset of N, this is 
impossible. 

Editorial comment. Problems like (1) are discussed in great generality by P. Erdés, F. Galvin, and R. 
Rado in “Transversals and multitransversals”, J. London Math. Soc. (2) 20 (1979) 387-395. See also W. 
Sierpinski, Cardinal and Ordinal Numbers, p. 459. Assertion (2) is equivalent to a restatement of the 


theorem on p. 95 of Paul Alexandroff et Paul Urysohn, “Mémoire sur les espaces topologiques 
compacts,” Verhandelingen Kon. Adad. Wetensch. Amsterdam, 14 (1929) No. 1. 


Also solved by R. E. Bernstein, R. Gilmer, S. Gudder and J. Hagler, Humboldt State Univ. Problem 
Group, T. Jager, R. Levy, O. P. Lossers (Netherlands), O. MatouS (Czechoslovakia), E. Mendelson, Univ. 
of Newcastle Problem Solving Class (Australia), V. Pambuccian (Romania), A. K. Wayman and L. Janos, 
and the referee. 
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6570. Proposed by L. A. Rubel, University of Illinois at Urbana-Champaign. 


(a) Let (z,) and (z/) be sequences in C, neither with a finite limit point. Assume 
that if a complex number w occurs exactly k times in (z,) where k > 1, then it 
occurs exactly k — 1 times in (z’). (Subject to this restriction, we allow finite or 
even empty sequences.) Show that there exists an entire function f such that the 
zeros of f are exactly at the points of (z,) and the zeros of f’ are exactly at the 
points of (z/), with the proper multiplicity in each case. 

(b) Can one similarly prescribe three sequences (z,,),(z;,),(z,’) with correspond- 
ing assertions about the zeros of f, f’, and f’”? 


6571. Proposed by Glenn Ierley, Michigan Technological University, Houghton, MI. 


(a) Let A(n) be the maximum area of a polygon with n sides of lengths 
1,2,...,, where n > 4. It is known that the maximum area occurs for a polygon 
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inscribed in a circle. (Cf. G. Polya, Mathematics and Plausible Reasoning, Volume 1, 
Princeton, 1954, pp. 174-177.) Let B(n) be the area of a regular polygon with n 
sides and perimeter 1 + 2 + --- +n. Prove that 


(b) For 1/2 < q < 1 let A(q, n) be the maximum area of a polygon with n sides 
of lengths 1, g, q’,...,q"', respectively, where n is large enough so that g + q° 
+--+ +q""!> 1. Let B(q,n) be the area of a regular polygon with n sides and 
perimeter 1+ 9+ q?+--: +9" '. Prove that c(q) = lim, ,,,A(q, n)/B(q, n) 


exists and find 


_ 1-c(q) 
lim ———. 
q>i- (1-4) 


SOLUTIONS OF ADVANCED PROBLEMS 
Persistence of a Distribution Function 


6522 [1986, 485]. Proposed by Gunnar Blom, University of Lund and Lund 
Institute of Technology, Sweden. 


Let X,, X,,... be an infinite sequence of independent random variables with the 
common continuous distribution function F. Let X,, be the first variable that is less 
than exactly one of all its predecessors X,,..., X,_,. Determine the distribution 
function of Xy. 


Solution by Robert B. Israel, University of British Columbia, Vancouver, BC, 
Canada. The distribution function of X, is F. In fact, for any positive integer m 
this statement is true if “exactly one” is replaced by “exactly m”’. 

For any k > m, the probability that X, is less than exactly m of its predecessors 
is 1/k (since X, is equally likely to be the first, second,..., Ath order statistic). 
Note that this is independent of the ordering of X,,..., X,_, among themselves. 
Thus the probability that X, is the first one less than exactly m of its predecessors 


1S 


1 
mt+i1 


1 
 mt+2 


pore xy) =(04 fej 


Since the set of values of a given set of i.i.d. random variables is independent of 
the order in which they occur, the conditional distribution of X, given Xy = X, 18 
the distribution of the (kK — m)th order statistic for X,,..., X,. Namely, for any x, 
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if P(X, < x) = p, and q = 1 — p, then 


(k _ 
P(Xy < x|Xyv=X,) = dL vila /q’ 
J=9 


so that 
[o@) m 4a) k 
P(Xy<x)= ) —— ~) | |p 
" k=m+1 k(k a 1) j=0 J 
Write this as R,, in order to make the dependence on m explicit. We shall prove by 
induction on m that R,, = p. Note that the series converges absolutely for 0 < p < 1. 
First consider the case m = 1 (which is the problem as stated). We have 


R, pk + kp*~'q) 


00 1 
Lu inka! 


k=2 


where the sum over j comprises the terms for k = m that are present in R,,_, but 
not in R,, and the sum over k comprises the terms for 7 = m that are present in 


R,, but not R,,_,. Now 


m —Jryf m m m 
yy ala ‘gi/=(p+q) —9q"=1-4q 
j=0 

while 
0 m o (i+ m-— 2)! 
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which converges for |p| < 1; for p = 1 the formula is trivial.) Thus 
R m 1 a 1 

— _— _ _ m + 

me me Fy 


(q-q")=p 


as required. 

The above generalization of 6522 was also proved by Barthel W. Huff, Eugene 
Salamin, and Glenn A. Stoops. Both Marcel F. Neuts and the proposer remark that 
it is sufficient to establish the result for the case in which F is the uniform 
distribution on (0,1). The proposer (who also provided a solution for m = 1 based 
on order statistics) used this to provide a noncalculational argument, based on the 
notion of “records,” for the truth of the result. He adds that the result exists in the 
literature on “records” and is implicit in a paper of Charles M. Goldie and L. C. G. 
Rogers, The k-record Processes are i1d., Z. fiir Wahrscheinlichkeitstheorie, 67 
(1984) 197-211. 

Neuts added the following remark to his solution. “This result is quite remark- 
able. It shows that the distribution of the first near-record X,, is the same as that of 
the underlying random variables. This would be very difficult to infer, for example, 
from simulation runs. Because of the heavy tail of the distribution of N, the 
empirical distribution of X,, over many replicated runs may be expected to converge 
only very slowly to F.” 


Also solved by Thomas N. Delmer, Ellen Hertz, James M. Meehan, G. S. Rogers, Kenneth Schilling, 
David G. Weinman, Western Maryland College Problems Group, and Douglas P. Wiens (Canada). 
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Combinatorics Theory and Applications. By V. Krishnamurthy. East-West Press PVT 
LTD, 1985. vi + 483 pp. $84.50. 


Applied Combinatorics. By Alan Tucker. John Wiley & Sons, Inc. 1984. v + 447 pp. 
$31.95 


Combinatorics for Computer Science. By S. Gill Williamson. Computer Science 
Press, 1985. vi + 479 pp. 


Constructive Combinatorics. By Dennis Stanton and Dennis White. Springer-Verlag, 
New York, 1986. x + 183 pp. $19.80 


ALBERT NIJENHUIS 
Department of Mathematics, University of Pennsylvania, Philadelphia PA 19104 


The last few years have seen the appearance of a handful of undergraduate texts 
in combinatorics. They reflect a renewed interest in what is beyond doubt one of the 
oldest branches of mathematics. Pressed by questions in various parts of science, 
many colleges and universities are introducing courses that help biologists analyze 
DNA chains or trains of nerve impulses, or that help computer scientists set up 
models for hardware and software designs. Social scientists find combinatorial tools 
useful to represent relationships between the objects of their study. 

Oddly enough, while combinatorics ties in well with probability theory and 
Statistics, its relationship with other branches of “pure” mathematics often seems to 
be limited to binomial coefficients and an occasional Stirling number. In any case, 
the interest in teaching combinatorics in many mathematics departments has so far 
been disappointing. 

Still, a strong case can be made for the teaching of combinatorics. First of all, it 
is a fun subject. Students love it because it is something they really can sink their 
teeth into. And teachers need not fear: it is an easy field to get into. I recently 
taught one semester of combinatorics from the Tucker book listed above, and I 
would call it a success. There are plenty of problems to do, and they are not like the 
usual calculus problems: find a similar worked-out example and plug in new 
numbers. Here the logical structure of the subject is so simple that students actually 
understand what they are doing. After two years of rote calculus, that is a welcome 
change. This is the first time that they have again a sense of control: concrete 
constructs and non-trivial relationships. 

Let me contrast this with an abstract algebra course I taught. We did groups and 
rings, after linear algebra. Still, mildly speaking, it was a great disappointment to me 
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and my colleagues. The difficulty is not in the logical complication (what is simpler 
than a group?), but in the incredible generality of the group concept. In such 
circumstances the simplicity of the proofs is of no help: it re-inforces the feeling that 
the subject is just thin air. To make it more concrete, you have to do lots of 
examples: permutations, Rubik’s cube, and what-have-you. But time does not 
permit that. So, we deal with only a tiny handful of examples. An aggressive 
student, after having seen the cyclic group of order p, may look for a non-abelian 
group of order p*. Too bad for the time spent; later he learns there aren’t any. And 
chances are he never gets to see a non-abelian group of order p°, though he does 
learn theorems about them. He also learns about centers of groups, but how many 
centers does he get to see? —I could go on. 

I believe that combinatorics is not only a field rich in applications in other fields; 
it is also a good preparation for abstract algebra. At the very least, permutation 
groups have lots of combinatorial applications. Combinatorics gives you some 
objects that are worth permuting. And you get to see Burnside’s Lemma, and 
perhaps the counting theorem of Polya. By contrast, a number of algebra books I 
checked don’t believe in having groups act on sets of real things, and don’t even 
mention the counting theorems. Combinatorics also provides a rich variety of 
mappings. Some are isomorphisms of structures that students can touch with their 
own hands. Calculus certainly did not provide any examples of homomorphisms, 
but combinatorics does. The canonical map from a thin-air group to an even 
thinner-air factor group is a cruel early example of a structure-preserving map. 

The topics that are covered by the numerous books on combinatorics, including 
the four listed above, differ considerably. This is not just due to the differing aims 
of the books, but just as much to the absence of a tradition in a subject that is so 
new to the curriculum. Elementary texts usually cover the following topics, to 
varying degrees. 

Sets and relations: some try to make it a big subject, while others consider it 
more a matter of setting up decent notation. 

Induction should be included, and usually is. This is one of the ideas least 
understood by immature students. All know vaguely that it takes the place of the 
“ ..” 1n proofs, but only well-designed exercises illustrate its subtleties. 

Logic is included in some, but often omitted. It is very hard to produce a body of 
theory that is truly relevant, but some logical notation is very useful. 

Counting is one of the most important topics; in fact so much so that doing just 
that would make a fairly decent course. It certainly is a very good place to start a 
course and get the students to work. 

There are a few refinements of counting. One is listing the objects you are 
counting. But usually there are so many that you are forced to develop algorithms, 
and let a computer do the work. 

Another refinement of counting is ranking and unranking. What is the place in a 
listing that a particular object would have? (Determine it without doing the actual 
listing!) And conversely, given an integer, what is the object in your list at that 
address? 

Counting techniques include bijections. When you find out, by a nonconstructive 
method, that two interesting sets have the same number of elements, can you 
produce an intelligent bijection? If you do, chances are you have learned something 
worthwhile. 
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The result of a counting is not always a neat little formula. Sometimes you get a 
recurrence relation, perhaps similar to the formulas in a Pascal triangle, which can 
then be further analyzed, either exactly or only asymptotically. Generating func- 
tions, usually formal power series, play an essential role and are fun to work with. 
Inclusion-exclusion is another instructive counting method. 

Graph theory is a very large subject. Opinions differ as to whether all of it 
belongs to combinatorics, or only to the larger area of discrete mathematics. There 
are so many different approaches, ranging from graph colorings and imbeddings to 
network flows and sophisticated notions of connectivity. Social scientists, computer 
scientists, and biologists are likely to ask different questions that have little more in 
common than the definition of a graph. 

Tableaux, coding theory, finite state machines and the analysis of algorithms are 
just a few more topics that one finds in some of the texts. One can debate whether 
they are central to the subject, though each is very important at a more advanced 
level. 

Now a few words about the books listed at the beginning, which were used as a 
springboard for the general comments. 

The text by Krishnamurthy is rather fast moving, and at a graduate level. It does 
start from the beginning, whatever that is, and expects considerable maturity. Much 
of the space is devoted to collections of exercises. It provides a useful background 
for anyone teaching combinatorics. It contains some chapters on Schur functions 
and the characters of the symmetric group. 

The book by Stanton and White expects some general background in combina- 
torics, and deals in depth with a small number of topics: listings, partially ordered 
sets, bijections, and involutions. Its presentation is lively, and includes some 
algorithms that actually prove theorems. Its approach is hardly traditional, and 
brings lots of new ideas. I would recommend that readers have a computer at their 
side, to test some of their own impulses. 

Tucker’s book is truly for beginners, at the freshman or sophomore level. It gives 
very few theorems, but lots of exercises. I did Part II (enumeration) before Part I 
(graphs, etc.). The level is just a bit below what one would choose for slightly older 
(3rd or 4th year) students because of the few theorems. No doubt the authors 
deliberately mentioned matchings but no matching theorems, network flows but not 
the max-flow-min-cut theorem, etc. 

Williamson’s book contains a very large number of topics, specifically aimed at 
computer scientists. Its elaborate table of contents helps a reader in search of a 
definition or theorem. But if you decide to use this book in a course, you will have 
to provide additional background. For example, it starts with sets, but does not 
define cartesian products. It does define relations quite formally, but does not 
define functions as special kinds of relations. The word “induction” does not appear 
in the index, nor is it explained. It is used extensively, of course. An early exercise 
asks for a “‘careful proof,” but at that point the author has not yet given any such 
proofs, thus keeping the reader guessing as to what that means. And the instructor 
_will also have to supply definitions for “tree” and “rooted tree,” since the book 
only deals with a special case. Similarly, “tiling of a board” and “backtracking” will 
have to be defined by the teacher. These are just a few examples. 

Fortunately, there is an ample choice of books on combinatorics, at virtually all 
levels. The reader should have no difficulty locating a suitable one. 
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Applied and Computational Complex Analysis. By Peter Henrici. Wiley-Interscience, 
1986. vil 637 pp. 


Louis AUSLANDER 
Department of Mathematics, The Graduate School and University Center, 
The City University of New York, New York, NY 10036 


This book is not a “textbook” in the ordinary sense of the word. The usual 
textbook is something that is trying to be all things to all people. The late Professor 
Henrici has written a book that is the distillation of his life’s work as a scholar and 
research mathematician. The book is a lively and lovely piece of work and, like any 
truly interesting creation, it is full of tensions and contradictions. 

The fact that there is a section on digital signal processing, in a book whose last 
chapter contains a proof of the Bieberbach Conjecture, may suggest that the author 
has made a rather eclectic choice of topics, but the flow of ideas is very orderly. The 
presentation of material is rigorous, but the author avoids overburdening the reader 
by choosing hypotheses for his results that are weak enough for many practical 
problems, yet strong enough to yield accessible proofs. This is strikingly demon- 
strated in his discussion of Cauchy integrals in section 14.1 and the results of 
Calderon discussed in the notes at the end of the section. There are many existential 
results, but care is taken to ensure effective computation methods. For instance, in 
Professor Henrici’s discussion in § 14.6, entitled “Cauchy Integrals on Straight Line 
Segments,” he arrives at Theorem 14.6a. This is immediately followed by the 
statement that “Theorem 14.6a does not express what from a numerical point of 
view may be its most significant aspect.” Professor Henrici then reformulates 
Theorem 14.6a as Algorithm 14.6b, which provides a numerical method for carrying 
out a desired calculation. 

In the Introduction, he states: 


Authors who primarily write for professional mathematicians may cultivate a style where a 
large number of facts are presented as concisely and economically as possible. However, the 
present work is not directed exclusively, and perhaps not even primarily, toward such mathemati- 
cians. A lifelong career in teaching this kind of reader has convinced me that, however great their 
appreciation for the logical coherence of the subject, their even greater concern is why they should 
be interested in it. Thus, time and again, I have allotted valuable space to the task of motivating 
what is ahead. Moreover, whenever facts or “theorems” are stated—and there are plenty of these 
—JI have endeavored to find formulations that in their essence are intelligible also to readers who 
did not memorize all the preceding definitions. If I am accused of wordiness and of being, on 
occasion, repetitive, this is the price I must pay for attempting to reach a larger audience. 


The final contradiction is that this book is, in my opinion, admirably suited for 
educating mathematicians. 


An Outline of Set Theory. By James M. Henle. Problem Books in Mathematics, 
Springer-Verlag, New York, 1986, viii + 145 pp. 


C. SMORYNSKI 
Mathematics Institute, Rijksuniversiteit te Utrecht, 3508 TA Utrecht, Netherlands 


That there is something wrong with the present teaching of mathematics is 
indisputable. Controversy arises when one tries to pinpoint exactly what is wrong, 
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and it intensifies when blame is apportioned or panaceas are offered. However, 
there are points upon which agreement can be had—particularly if the points are 
sufficiently blunted and they are not fingers of blame. 

What can we agree on? Mathematical education in its broadest form is a failure: 
the average educated person is unaware that mathematics is still alive, that it is not 
the case that all mathematical discoveries were made long ago. At the 1986 
International Congress of Mathematicians, the winners of the Fields Medals and the 
Nevanlinna Prize were asked by reporters about the use of the computer in their 
work, never imagining that the computer is a businessman’s rather than a mathema- 
tician’s tool (even though they correctly perceived what some mathematicians have 
not, that the computer is mathematical in nature). This ignorance is a matter of 
miseducation. Partly it is caused by the ignorance of the teachers (high school 
mathematics teachers generally receive their degrees in education and learn less 
mathematics than mathematics majors, usually very little modern mathematics), and 
partly by the textbooks: it has often been remarked how textbooks in the sciences 
include some history, while college calculus books, generally the only mathematics 
texts to include any history, only have a few biographical remarks ranging in length 
from footnotes to half-page paragraphs. 

The mathematical training of mathematicians and engineers could also be 
improved. Twenty years ago a course in the calculus supplied the equivalent of 
mathematical maturity, the appropriate prerequisite to upper division course work. 
Since then, the calculus textbooks have been diluted by the elimination of proofs, 
the labelling as optional of all conceptual material, and the encouragement of 
passivity on the part of the student by providing ready-made reviews with boxed 
formulae. Linear algebra is now supposed to provide the bridge between the 
calculus and higher mathematics, but I see linear algebra texts (as well as texts on 
differential equations) being produced by the same visionless authors with the same 
panoply of brain-softening “aids” to students—the cancer is spreading. 

Progressing yet farther away from any point of common agreement, I will even 
say that there is something wrong with education in higher mathematics. The 
“ Definition- Theorem-Proof” style of textbook writing is the most efficient means of 
transmitting a large quantity of information and it should not be lightly discarded. 
In a classroom, the dead facts can be brought to life by an informed instructor. But 
what happens if the instructor himself was textbook trained? He will not be able to 
bring the artifacts to life, because they are as much artifacts to him as they are to his 
students. And what will the textbooks such an instructor writes look like? 

How do we breathe new life into the teaching of mathematics? There are two 
proposals crystalizing in the pedagogic vapours. First there is Harold Edwards’ 
exhortation to “Read the Masters” and his guidebooks for those wishing to do so. 
At the other extreme is the philosophy of letting the students “discover” for 
themselves, a philosophy inherent in Springer-Verlag’s new series Problem Books in 
Mathematics. Both proposals work, but I am not equally satisfied with their 
workings. Whereas I can sing lofty paeans to Edwards and his achievements, I can 
only caution against putting too much faith in problems courses and the readily 
misapplied Moore method: For, the life these latter breathe into the subject is not 
the real thing, but an artificial life like Frankenstein’s monster and, however much 
sympathy the monster may evoke, it is yet a teratological creation lacking a soul. 
James Henle’s An Outline of Set Theory exemplifies this poverty of spirit. 
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Consider the actual historical development of set theory. Working on the 
uniqueness of Fourier series, Georg Cantor obtained his results first for series 
convergent at all points, then for series convergent at all but finitely many 
exceptional points, and finally for series with infinitely many nicely distributed 
exceptional points. The introduction of ordinal numbers was necessary for a 
definition of “nicely distributed” and Cantor’s research changed directions. For this 
he drew fire from Leopold Kronecker. 

The seemingly tiny fact that the cardinality of the real plane is the same as that 
of the real line is not so tiny when one considers it in its historical context: to 
Cantor, one-dimensional and two-dimensional space were two distinct objects and, 
despite his attempt to prove their dissimilarity, he actually proved them to be 
virtually the same! In a letter to Richard Dedekind about his construction of a 
one-one correspondence between the unit interval and the unit square, Cantor wrote 
“T see it, but I don’t believe it.”” Dedekind replied that dimensionality might still be 
a genuine concept: he didn’t think the one-one correspondence could be continuous. 
This was in 1874; in 1890 Peano showed that the unit square was a continuous 
image of the unit interval. Perhaps the map could be one-one as well. It was not 
until 1909 that L. E. J. Brouwer proved the general invariance of dimension. 

Set theory explicitly assumed a foundational role around the turn of the century 
when Gottlob Frege decided to base all of mathematics on it. Cantor’s theories of 
ordinal and cardinal numbers subsumed the arithmetic of the natural numbers, and 
both Cantor and Dedekind had set-theoretic constructions of the real numbers from 
there. Unfortunately, Frege’s axiomatic system was flawed—as Bertrand Russell 
noted. Avoiding the paradoxes was something that worried the philosophers, but it 
had little or no effect on mathematics or the development of set theory. Ernst 
Zermelo’s axiomatization in 1908 accompanied his second proof of the Well-Order- 
ing Theorem from the Axiom of Choice. Zermelo’s first proof had met with much 
criticism and he supplied his new proof with an explicit list of axioms used so that it 
may be more convincing. About twenty years later, he offered the world the 
cumulative hierarchy of sets as an intuitive model of these axioms—actually, of an 
improved set of axioms resulting from some fine tuning by Abraham Fraenkel and 
Thoralf Skolem. . 

This minihistory of set theory falls a bit short of completeness, but, inadequate as 
it is, it does indicate the dynamic life of the subject and the excitement that could be 
conveyed to a student with an historically based text, or, at least, a scholarly written 
one. It hints at what the potential mathematicians among the students may 
experience in their own lifetimes: the unexpected discovery of something interesting, 
the incredible surprises and complete reversals of one’s beliefs or intuitions, the 
timespan in the solution of problems, the subsumption of one programme by 
another, and the slow crystalization of concepts. The mention of Kronecker and the 
criticism of Zermelo’s original proof also bring the human element into the story, 
both in its destructive and constructive aspects—two aspects that are ever at hand. 

Now consider the static picture offered by the problems text. Henle has tried to 
go beyond the “Definition-Theorem-Problem” format by prefacing the work with a 
35 page introduction including history, philosophy, a statement of intent, and 
discussions of his method and bias. He also sprinkles occasional two or three 
sentence remarks of a historical or philosophical nature throughout his book. The 
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picture he paints can be glimpsed from a few quotes from his history: 

Mathematics is a living creature, growing as occasions demand and circumstances permit. Every 
now and then it must pause to organize and reflect on what it is and where it comes from. This 
happened in the sixth century B.C. when Euclid thought he had derived most of the mathematical 
results known at the time from five postulates. By the end of the nineteenth century, it was ready 
to happen again... In searching for underlying principles, mathematicians were led naturally to 
sets... [mention of Russell’s paradox]... Over the decades following the discovery of such 
problems, a collection of principles or axioms was formed which appeared (and still appears) to 
avoid paradoxes. The system is called Zermelo-Fraenkel set theory or ZF after its originators, 
Ernst Zermelo and Abraham Fraenkel. In addition to occupying a strategic location in mathe- 
matics, ZF is studied for itself by a growing number of mathematicians. The father of modern set 
theory was Georg Cantor. 


The story outlined (I have not omitted anything central) is wholly false, but that is 
hardly the issue—one of the first corollaries of the general floccinaucinihilipilifica- 
tion of scholarship in mathematics is the irrelevance of history. What is at issue 1s 
the one-dimensionality and simplicity (simplistic-icity?) of the artificial life created. 
Real life—healthy life—is rich in its complexity; artificial life—like Rabbi Loeb’s 
golem or Henle’s history—is unhealthy in its simplicity. 

If one concomitant of the problem solver’s perspective is the attitude that history 
doesn’t matter—any story will do (but, the simpler the better)—another is the 
attitude that the results themselves are not important. Like the fisherman who 
doesn’t eat fish, the mathematical problem solver is only out for the hunt. And, the 
Great White Hunter leading the safari selects the game suitable for his clients, who 
will see but a little of the jungle—or, perhaps, only the savannah and none of the 
jungle. This is exactly what happens in the problems course, as is illustrated by An 
Outline of Set Theory: nothing is taken very far. Dedekind’s construction of the reals 
stops with the definition of 1/r, thus almost but not quite proving the reals to be a 
field. The treatment of ordinal exponentiation is given in connexion with the 
Goodstein-Kirby-Paris Theorem, a recent “combinatorial independence result” 
closely tied to the Cantor Normal Form for ordinals less than e9. Neither e, nor the 
normal form are mentioned. Cardinal arithmetic is omitted from the discussion of 
cardinals. Etc. 

Other than to suggest that he be raked over the coals, or at least flogged, for the 
sentence about Euclid, I do not want to attack Prof. Henle for having written an 
unscholarly book. In this he has been thoroughly professional. I wish but to attack 
this highly professional perspective of mathematics as mere problem solving. 
Although some may attribute the lifelessness and shallowness of his book to his 
dubious goal of running a problem solving course for average students, I think the 
blame lies entirely in the too narrow perspective. An Outline of Set Theory is a vivid 
illustration of the need for something more: To instill some life into our mathe- 
matics texts, we also need some scholarship. 
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Mathematics Appreciation, T(13: 1). Mathe- 
matics in Daily Life: Making Decisions and Solving 
Problems. Joanne Simpson Growney. McGraw-Hill, 
1986, xxiii + 453 pp, $33.95. (ISBN: 0-07-025015-4] 
Covers the use of elementary mathematics, mostly 
arithmetic, in problem solving and decision making. 
Does not emphasize specific mathematical topics. In- 
tended for use in college or university “general educa- 
tion” mathematics courses. Could also be of interest 
to elementary and secondary school teachers. Alge- 
brais a helpful, but not necessary, prerequisite. Very 
easy to read. Includes many exercises varying in level 


of difficulty. RH 


Education, P, L. Implementation Handbook for 
the Comprehensive Mathematics Program. Man- 
ford Byrd, Jr. (Board of Education of the City of 
Chicago, 1819 W. Pershing Rd., Chicago, IL 60609), 
1987, (P). Kindergarten-Grade 3, vill + 154 pp; 
Grades 4-6, vill + 191 pp; Grades 7-8, villi + 138 
pp. A detailed set of objectives and examples for 
each topic to be covered in each reporting period in 
grades K-8 in the Chicago public schools. Includes 
creative calculator examples, as well as geometry, 
data analysis, measurement, and applications in each 
grade. LAS 


History, P, L**. Reminiscences About a Great 
Physicist: Paul Adrien Maurice Dirac. Ed: Behram 
N. Kursunoglu, Eugene P. Wigner. Cambridge U 
Pr, 1987, xviii + 297 pp, $49.50. (ISBN: 0-521- 
34013-6] Two dozen reflections by close friends of 
Dirac on both scientific and personal associations. 
Contributors include his widow Margit Dirac, her 
brother Eugene Wigner, P.A.M. Dirac himself (on 
the inadequacies of quantum field theory), and many 
world-famous physicists such as Harish-Chandra, 
Fred Hoyle, and Abdus Salam. LAS 


History, $(17), P*. The Historical Development 
of Quantum Theory, Volume 5: Erwin Schrodinger 
and the Rise of Wave Mechanics, Part 2: The Cre- 


ation of Wave Mechanics; Early Response and Appli- 
cations 1925-1926. Jagdish Mehra, Helmut Rechen- 
berg. Springer-Verlag, 1987, ix + 613 pp, $79.95. 
(ISBN: 0-387-96377-4] Wave mechanics began when 
Schrodinger showed in his fundamental papers of 
1926 that the quantum states of the hydrogen atom 
could be represented as an eigenvalue problem for 
an appropriate operator. This volume reconstructs 
the scientific attitude of this revolutionary period. 
Schrodinger’s own thoughts and motivation have al- 
ways been obscure due to his lack of correspondence. 
The authors attempt to address this by presenting in 
detail the contributions of de Broglie, Einstein, and 
others along with Schrédinger’s own work. (Part 1, 
TR, January 1988.) MR 


Foundations, T**(14: 1), S**, L*. Bridge to Ab- 
stract Mathematics: Mathematical Proof and Struc- 
tures. Ronald P. Morash. Math. Ser. Random 
House, 1987, xiii + 395 pp, $24. [ISBN: 0-394-35429- 
X] A text for a course which bridges the gap be- 
tween beginning courses like calculus and more ad- 
vanced courses in which students deal seriously with 
mathematical proofs. Topics include sets, logic, ap- 
plications of logic, methods of mathematical proof, 
relations, functions, and the construction of number 
systems. Readable with lots of exercises of varying 
difficulty and solutions to selected exercises. CEC 


Foundations, P, L. Particles and Paradozes: The 
Limits of Quantum Logic. Peter Gibbins. Cam- 
bridge U Pr, 1987, xi + 181 pp, $34.50; $11.95 (P). 
[ISBN: 0-521-33498-5; 0-521-33691-0] An exposition 
and exposé of attempts to intepret the paradoxes of 
quantum mechanics (e.g., the twin slit experiment) 
via quantum logic. The author holds that quantum 
logic—indeed, any logic—is inadequate to the task of 
providing a foundation for quantum mechanics. “We 
are left with a mystery” of “just how odd the physical 
world must be.” LAS 


Foundations, T(17-18), S, P, L. Varieties of Con- 
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structive Mathematics. Douglas Bridges, Fred Rich- 
man. London Math. Soc. Lect. Note Ser., V. 
97. Cambridge U Pr, 1987, x + 149 pp, $19.95 (P). 
[ISBN: 0-521-31802-5] An introduction to the spirit 
and practice of modern constructive mathematics, 
requiring a minimal background in philosophy and 
formal logic. The book introduces three varieties 
of constructive mathematics: Bishop’s constructive 
mathematics, Brouwer’s intuitionistic mathematics, 
and the constructive recursive mathematics of the 
Russian school of Markov. LCL 


Combinatorics, T(17-18), P, L. Combinatorial 
Geometries. Ed: Neil White. Eucycl. of Math. & 
Its Applic., V. 29. Cambridge U Pr, 1987, xii + 212 
pp, $39.50. [ISBN: 0-521-33339-3] This is volume 
two in a three-volume series on matroid theory. Top- 
ics covered include coordinatizations, matching the- 
ory, transversed and simplicial matroids, the Mobius 
function and combinatorial optimization. Includes 
exercises. LC 


Number Theory, T*(16-18: 1), S, P, L*. A 
Course in Number Theory and Cryptography. Neal 
Koblitz. Grad. Texts in Math., V. 114. Springer- 
Verlag, 1987, 208 pp, $34. [ISBN: 0-387-96576-9] A 
good introduction to the elementary number theory 
and algebra (e.g., finite fields) underlying public-key 
cryptosystems, including an exposition of the ellip- 
tic curve approach. Unfortunately, the cheap fuzzy 
printing is unpleasant to look at. Springer shouldn’t 
let this happen in the GTM series. BC 


Number Theory, T**(14: 1), S, L*. Elementary 
Number Theory. Charles Vanden Eynden. Math. 
Ser. Random House, 1987, xii + 266 pp, $27. 
[ISBN: 0-394-35359-5] A well-written introduction to 
the subject. Extensive exercise sets include many 
routine problems as well as a good collection of 
theoretical problems. Lots of historical material. 
Advanced topics include public key cryptography, 
quadratic residues, continued fractions, some dio- 
phantine equations, and Pell’s equation. CEC 


Linear Algebra, T(i5-17: 1). Applied Linear 
Algebra, Third Edition. Ben Noble, James W. 
Daniel. Prentice-Hall, 1988, xvi + 521 pp. [ISBN: 
0-13-041260-0] Major changes in this edition include 
greater emphasis on triangular forms, over 500 new 
exercises, and additional material on some topics pre- 
viously discussed only in exercises (e.g., the Cayley- 
Hamilton and Perron-Frobenius theorems). Some ex- 
amples and problems using computer software for nu- 
merical linear algebra have also been added. (First 
Edition, TR, October 1969; Second Edition, TR, 
November 1978.) AO 


Linear Algebra, P. Invariant Theory and Superal- 
gebras. Frank D. Grosshans, Gian-Carlo Rota, Joel 
A. Stein. CBMS Reg. Conf. Ser. in Math., No. 
69. AMS, 1987, xxi + 80 pp, $16 (P). [ISBN: 0- 
8218-0719-6] A superalgebra is one part symmetric, 
one part exterior, and one part “divided powers.” 
The authors prove an extension of the “standard ba- 
sis theorem,” and apply it to compute invariants of 
symmetric and skew-symmetric tensors. Includes a 
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very clear and fairly long synopsis, and many exam- 
ples. BC 


Linear Algebra, T?(13: 1). <A First Course in 
Linear Algebra with Concurrent Ezamples. A.G. 
Hamilton. Cambridge U Pr, 1987, viii + 148 pp, 
$39.50; $12.95 (P). (ISBN: 0-521-32516-1; 0-521- 
31041-5] Not enough material for a standard course. 
Covers solutions to linear equations, linear depen- 
dence, determinants through the 3x3 case, and some 
three-dimensional geometry. Answers to most exer- 


cises in back. GG 


Linear Algebra, T*(14: 1, 2), L. Linear Alge- 
bra. John B. Fraleigh, Raymond A. Beauregard. 
Addison-Wesley, 1987, xv + 519 pp, $35.95. [ISBN: 
0-201-15459-5] Includes standard topics: linear sys- 
tems, vector spaces and linear transformations, de- 
terminants, eigenvalues, and orthogonality, all with 
emphasis on R". No Jordan forms. Applications in- 
clude computer solutions of linear systems and eigen- 
values, least squares, and linear programming. Also 
has chapter on calculus applications. Has answers to 
odd exercises; software to accompany text available. 
Good for course with applied flavor. GG 


Linear Algebra, T(16-17: 1), S*, L. Nonnega- 
tive Matrices and Applicable Topics in Linear Alge- 
bra. Alexander Graham. Math. & Its Applic. Hal- 
sted Pr, 1987, 264 pp, $89.95. [ISBN: 0-470-20855- 
4] Introduction for general readers, includes back- 
ground material (including aspects of graph theory; 
unitary, Hermitian, and normal matrices; positive 
definite matrices), followed by the main theme of the 
book and applications. Important concepts amply il- 
lustrated with worked examples and problems (with 
solutions). LCL 


Group Theory, P. Lecture Notes in Mathematics- 
1281: Group Theory. Ed: O.H. Kegel, F. Menegazzo, 
G. Zacher. Springer-Verlag, 1987, vii + 179 pp, 
$16.30 (P). [ISBN: 0-387-18399-X] A collection of 
19 short papers which were presented at the In- 
ternational Conference on Group Theory held at 
Brixen/Bressanone, Italy, May 25-31, 1986. The pa- 
pers consider quite a range of topics; many involve 
nilpotent groups and several concern locally finite 
groups. LW 


Group Theory, S(18), P. Unitary Representa- 
tions of Reductive [ie Groups. David A. Vogan, 
Jr. Annals of Math. Stud., No. 118. Princeton 
U Pr, 1987, x + 308 pp, $60; $19.50 (P). [ISBN: 
0-691-08481-5; 0-691-08482-3] General survey based 
loosely on January 1986 Hermann Weyl Lectures 
given at the Institute for Advanced Study. Introduc- 
tion gives excellent perspective. Some proofs omitted 
or sketched. GG 


Group Theory, P. Essays in Group Theory. Ed: 
S.M. Gersten. Math. Sci. Res. Inst., V. 8. Springer- 
Verlag, 1987, 342 pp, $32. [ISBN: 0-387-96618-8] A 
collection of five papers based on a seminar presented 
in 1985 at MSRI, Berkeley. Baumslag and Shalen ex- 
amine presentations by the classical method of rep- 
resentations; Stalling and Gersten deal with appli- 
cations of diagrams to group theory, and Gromov 
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and Shalen with hyperbolic groups and actions on R- 
trees. The papers are all well-written and free of the 
typical terseness of papers appearing in journals. LW 


Algebra, T(18: 1, 2), P. Commutator Theory for 
Congruence Modular Varieties. Ralph Freese, Ralph 
McKenzie. London Math. Soc. Lect. Note Ser., 
V. 125. Cambridge U Pr, 1987, 227 pp, $27.95 (P). 
[ISBN: 0-521-34832-3] A development of a theory of 
commutators in the setting of congruence modular 
varieties. This theory generalizes the notion of the 
commutator of a group. Includes many exercises to 
solutions and extensive bibliographical notes. SG 


Algebra, P. Noetherian Rings and Their Applica- 
tions. Ed: Lance W. Small. Math. Surv. & Mono., 
No. 24. AMS, 1987, ix + 118 pp, $38. [{ISBN: 0- 
8218-1525-3] A collection of six papers (at just over 
$6 apiece) on such topics as the Goldie rank of a mod- 
ule, semisimple Lie algebras, and Noetherian group 
tings. SG 

Algebra, P. Lecture Notes in Mathematics-1271: 
Algebraic Groups, Utrecht 1986. Ed: A.M. Cohen, 
et al. Springer-Verlag, 1987, viii + 284 pp, $24.30 
(P). [ISBN: 0-387-18234-9] Fourteen papers from the 
April 1986 symposium honoring the 350th anniver- 
sary of the University of Utrecht and T.A. Springer’s 
60th birthday. GG 


Algebra, T(14-15: 1). Rings and Factorization. 
David Sharpe. Cambridge U Pr, 1987, ix + 111 
pp, $14.95 (P); $34.50. [ISBN: 0-521-33718-6; 0-521- 
33072-6] A very readable and enjoyable introduction 
to the concepts of rings, fields, prime elements, and 
unique factorization. Assumes no background in ab- 
stract algebra. Includes examples of concrete appli- 
cations, such as factoring polynomials and Fermat’s 
two-squares theorem. Contains many exercises along 
with hints and solutions. A fun and smooth intro- 
duction to some abstract mathematical ideas. RH 


Algebra, P. Structures Paragraduées (Groupes, An- 
neauz, Modules). Marc Krasner, Mirjana Vukovic. 
Papers in Pure & Appl. Math., No. 77. Queen’s U, 
1987, 163 pp, (P). 


Algebra, T*(16: 1, 2), S, L*. Introduction to Ab- 
stract Algebra. Elbert A. Walker. Math. Ser. Ran- 
dom House, 1987, viii + 355 pp, $30. [ISBN: 0-394- 
35611-X] A sophisticated introduction to algebra for 
undergraduates. Chapter titles include sets, groups, 
vector spaces, rings and modules, linear transforma- 
tions, fields, and topics from both group and ring 
theory. Includes excellent collections of exercises, 
but few of them are routine. Plenty of material for 
a year-long course. CEC 


Algebra, P. Lecture Notes in Mathematics-1280: 
Jordan Triple Systems by the Grid Approach. Er- 
hard Neher. Springer-Verlag, 1987, xii + 193 pp, 
$20 (P). [ISBN: 0-387-18362-0] 

Real Analysis, P. Regular Variation, Eztensions 
and Tauberian Theorems. J.L. Geluk, L. de 
Haan. CWI Tract, V. 40. Math Centrum, 1987, 
132 pp, Dfl. 20.30 (P). [ISBN: 90-6196-324-9] A 
self-contained introduction to the theory of regular 
variation and its main extensions. It is shown that 
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regularly varying functions are a natural setting for 
Tauberian theorems of the Laplace type. Also in- 
cludes some results for general kernel transforms. In- 
cludes references. CEC 


Complex Analysis, S(18), P. Lectures on Coun- 
terezamples in Several Complez Variables. John Erik 
Forness, Berit Stensgnes. Math. Notes 33. Prince- 
ton U Pr, 1987, 247 pp, $22.50 (P). [ISBN: 0-691- 
08456-4] Lecture notes from a graduate course at 
Princeton. Begins with a brief introduction to basics 
of several complex variables. Topics include coun- 
terexamples to smoothing of plurisubharmonic func- 
tions, CR-manifolds, Stein neighborhood basis, peak 
sets, inner functions, Runge exhaustion. Presented 
in a Clear, but somewhat informal style. No prob- 
lems. BH 


Complex Analysis, P. Lecture Notes in Mathemat- 
tcs-1275, 1276, 1277: Complez Analysis. Ed: Car- 
los A. Berenstein. Springer-Verlag, 1987, (P). J, xv 
+ 331 pp, $28.60, [ISBN: 0-387-18356-6]; JJ, ix + 
320 pp, $28.60, [ISBN: 0-387-18357-4]; IJ, x + 350 
pp, $32.90. [ISBN: 0-387-18355-8] Proceedings of the 
sixteenth Special Year at the University of Maryland 
from July 1985 to December 1986. A collection of 
54 papers on topics from complex analysis, including 
both survey articles and new results. RH 


Complex Analysis, T*(16-17: 2, 3), L. Complez 
Functions: An Algebraic and Geometric Viewpoint. 
Gareth A. Jones, David Singerman. Cambridge U 
Pr, 1987, xiv + 342 pp, $59.50; $16.95 (P). [ISBN: 
0-521-30893-3; 0-521-31366-X] Presents “main ideas 
about complex functions and Riemann surfaces” as- 
suming only fundamentals of abstract algebra, real 
and complex analysis, and topology. Topics are Rie- 
mann sphere, Mobius transformations, elliptic func- 
tions, analytic continuation and Riemann surfaces, 
hyperbolic geometry, and the modular group. Well 
motivated with good selection of problems at end of 
each chapter. BH 


Differential Equations, P. Perturbation Methods, 
Bifurcation Theory and Computer Algebra. Richard 
H. Rand, Dieter Armbruster. Appl. Math. Sci., V. 
65. Springer-Verlag, 1987, ix + 243 pp, $29.80 (P). 
[ISBN: 0-387-96589-0] Contains MACSYMA pro- 
grams for several popular perturbation methods as 
well as methods for studying bifurcations. A brief 
introduction to MACSYMA is provided as an ap- 
pendix. AO 


Differential Equations, P. Ordinary and Partzal 
Differential Equations. Ed: B.D. Sleeman, R.J. 
Jarvis. Longman Scientific & Technical (US Distr: 
Wiley), 1987, 216 pp, $48.95 (P). [ISBN: 0-470- 
20839-2] Proceedings of the ninth Dundee Confer- 
ence, 1986, with fourteen papers by seventeen au- 
thors; topics include coherent excitations, one- and 
two-parametric bifurcation diagrams, inverse scat- 
tering, limit cycle configurations of quadratic sys- 
tems, concentration effects in Euler equations, ho- 
moclinic orbits and chaos in delay equations, the dis- 
crete Nagumo equation, radiation of sound by a mov- 
ing sphere, fisheries stock assessment, Cahn-Hilliard 
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model of phase transition, second-order elliptic equa- 
tions, biHamiltonian systems, applications of differ- 
ential equations in biology, and fibered structures in 
optimal design. RSF 

Differential Equations, T*(16-17: 1, 2). Ap- 
plied Mathematics: A Contemporary Approach. J. 
David Logan. Wiley, 1987, xviii + 572 pp, $44.95. 
[ISBN: 0-471-85083-7] A survey text covering most of 
the traditional topics and a few nontraditional ones 
as well: dimensional analysis and scaling, perturba- 
tion methods, calculus of variations, partial differen- 
tial equations of mathematical physics, stability and 
bifurcation, similarity methods, and numerical tech- 
niques for partial differential equations. AO 


Partial Differential Equations, P. Contributions 
to Nonlinear Partial Differential Equations, Volume 
IT, Ed: J.I. Diaz, P.L. Lions. Res. Notes in Math. 
Ser., V. 155. Longman Scientific & Technical (US 
Distr: Wiley), 1987, 308 pp, $57.95 (P). [ISBN: 0- 
470-20810-4] The proceedings of the Second Franco- 
Spanish Colloquium on nonlinear partial differential 
equations held in Paris in December 1985. CEC 


Partial Differential Equations, P. Lecture Notes 
in Mathemaiics-1248: Nonlinear Semigroups, Partial 
Differential Equations and Attractors. Ed: T.L. Gill, 
W.W. Zachary. Springer-Verlag, 1987, ix + 185 pp, 
$18 (P). [ISBN: 0-387-17741-8] This volume consti- 
tutes the proceedings of the symposium on nonlin- 
ear semigroups, partial differential equations, and at- 
tractors held at Howard University in Washington, 
D.C. on August 5-8, 1985. CEC 


Partial Differential Equations, P. Lecture Notes 
in Mathematics-1241: Singularities in Linear Wave 
Propagation. Lars Garding. Springer-Verlag, 1987, 
125 pp, $13.10 (P). [ISBN: 0-387-18001-X] The aim 
of these lectures is to present the use of microlocal 
theory in the analysis of singularities in linear wave 
propagation. LCL 


Partial Differential Equations, P. Hyperbolic 
Equations. Ed: F. Colombini, M.K.V. Murthy. Res. 
Notes in Math. Ser., V. 158. Longman Scientific & 
Technical (US Distr: Wiley), 1987, 286 pp, $51.95 
(P). [ISBN: 0-470-20869-4] Proceedings of the con- 
ference on Hyperbolic Equations and Related Topics, 
University of Padova, 1985. LC 


Partial Differential Equations, S(18), P. Vis- 
cosity Solutions and Optimal Control. Robert J. El- 
liott. Res. Notes in Math. Ser., V. 165. Longman 
Scientific & Technical (US Distr: Wiley), 1987, 95 
pp, $39.95 (P). [ISBN: 0-470-20918-6] Discusses the 
concept of a viscosity solution and its relationship 
to a cost/value function of an optimal control prob- 
lem or differential game. A viscosity function is any 
solution to the Hamilton-Jacobi-Bellman differential 
equation. Topics covered include dynamic program- 
ming and differential games with alternate, overlap- 
ping play. MR 

Partial Differential Equations, P. Wave Motion: 
Theory, Modelling, and Computation. Ed: Alexan- 
dre J. Chorin, Andrew J. Majda. Math. Sci. Res. 
Instit., V. 7. Springer-Verlag, 1987, 336 pp, $32. 
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(ISBN: 0-387-96594-7] Twelve papers from the pro- 
ceedings of a conference given at the Mathemati- 
cal Sciences Research Institute, Berkeley, California, 
June 1986, in honor of the 60th birthday of Peter 
Lax. LCL 


Partial Differential Equations, P. The Homotopy 
Indez and Partial Differential Equations. Krzysztof 
P. Rybakowski. Universitext. Springer-Verlag, 1987, 
xii + 208 pp, $39.50 (P). [ISBN: 0-387-18067-2] Ho- 
motopy index theory originated on compact spaces. 
As usual, the non-compact case is problematic. The 
author gives a unified approach, with applications to 
partial differential equations and other topics (e.g., 
Morse theory). BC 


Numerical Analysis, P. Lecture Notes in Engi- 
neering-27: The Best Approzimation Method: An In- 
troduction. Th. V. Hromadka II, Ch.-Ch. Yen, G.F. 
Pinder. Springer-Verlag, 1987, xiii + 168 pp, $26.70 
(P). [ISBN: 0-387-17572-5] A generalized Fourier ex- 
pansion method for solving linear operator equa- 
tions. Mathematical background in metric spaces 
and Lebesgue integration. Application to engineer- 
ing problems including computer codes. RWN 


Numerical Analysis, P. Algorithms for Approzi- 
mation. Ed: J.C. Mason, M.G. Cox. Inst. of Math. 
& Its Applic. Conf. Ser., V. 10. Clarendon Pr, 1987, 
xvi + 694 pp, $125. [ISBN: 0-19-853612-7] 41 contri- 
butions at an international conference held July 15- 
19, 1985 at the Royal Military College of Science in 
Shrivenham, England. They concern algorithm de- 
velopment in spline approximation and smoothing, 
spline interpolation and shape preservation, multi- 
variate interpolation, least-square methods, ratio1:al 
approximation, complex and nonlinear approxima- 
tion, computer-aided design and blending; applica- 
tions in numerical analysis, partial differential equa- 
tions, other disciplines; software. DFA 


Operator Theory, S(18), P. Summing and Nu- 
clear Norms in Banach Space Theory. G.J.O. Jame- 
son. Math. Soc. Stud. Texts, V. 8. Cambridge U 
Pr, 1987, xi + 174 pp, $39.50; $13.95 (P). [ISBN: 
0-521-34134-5; 0-521-34937-0] Assumes basic knowl- 
edge of Banach and Hilbert space theory. Main top- 
ics are Pietsch’s theorem on psumming operators 
and Grothendieck’s inequality and their applications. 
Exercises and numerous examples scattered through- 
out. BH 


Functional Analysis, T(15: 1). Introduction to 
the Analysis of Metric Spaces. J.R. Giles. Aus- 
tralian Math. Soc. Lect. Ser., V. 3. Cambridge 
U Pr, 1987, xiv + 257 pp, $49.50; $16.95 (P). 
[ISBN: 0-521-35051-4; 0-521-35928-7] Follows educa- 
tional style of Dieudonné used in British universities: 
detailed study of analysis of metric spaces before in- 
troducing topological spaces. Normed linear spaces 
treated as special subfamily of metric spaces. Very 
helpful geometrically-suggestive diagrams. Numer- 
ous good exercises at the end of each section. Con- 
nectedness, Axiom of Choice not included. MS 


Functional Analysis, P. Lecture Notes in Mathe- 
matics-1267: Geometrical Aspects of Functional 
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Analysis. Ed: J. Lindenstrauss, V.D. Milman. 
Springer-Verlag, 1987, 212 pp, $20 (P). (ISBN: 0- 
387-18103-2] Proceedings of the Israel Seminar on 
geometric aspects of functional analysis held between 
October 1985 and June 1986. Central topic is convex 
sets in R” and infinite-dimensional spaces. BH 


Analysis, P. Lecture Notes in Mathematics-1257: 
On the C*-Algebras of Foliations in the Plane. Xi- 
aolu Wang. Springer-Verlag, 1987, 165 pp, $16.30 
(P). (ISBN: 0-387-17903-8] A study of C*-algebras of 
foliated open two manifolds, linking topology, graph 
theory, geometry, and analysis in an important new 
area of research. LCL 


Analysis, P. Lecture Notes in Mathematics-1265: 
Asymptotics for Orthogonal Polynomials. Walter 
Van Assche. Springer-Verlag, 1987, vi + 201 pp, 
$20 (P). (ISBN: 0-387-18023-0] A monograph which 
concentrates on the asymptotic theory of general or- 
thogonal polynomials on the real line. Includes a 
substantial list of references. CEC 


Analysis, P. A General Theory of Integration in 
Function Spaces, Including Wiener and Feynman In- 
tegration. P. Muldowney. Res. Notes in Math. Ser., 
V. 153. Longman Scientific & Technical (US Distr: 
Wiley), 1986, 115 pp, $34.95 (P). [ISBN: 0-470- 
20736-1] Begins with a brief description of Henstock’s 
general theory of integration (a simple, but powerful 
generalization of Riemann integration), then devel- 
ops theory of function space integration in more de- 
tail and applies this theory to statistical mechanics 
(Wiener integral) and quantum mechanics (Feynman 
integral), thus giving a unified treatment of these two 
integrals. BH 


Analysis, $(18), P, L. Moments in Mathematics. 
Ed: Henry J. Landau. Proc. of Symp. in Appl. 
Math., V. 37. AMS, 1987, xi + 154 pp, $30. [ISBN: 
0-8218-0114-7] Applications and ramifications of the 
classical problem of moments (when does a sequence 
of numbers represent the successive moments of a 
non-negative measure?) to geometry, Hilbert space, 
signal processing, semigroups, probability and statis- 
tics. Papers from the January 1987 AMS short 
course in San Antonio. LAS 


Analysis, P. Value Distribution Theory for Mero- 
morphic Maps. Wilhelm Stoll. Aspects of Math., 
V. E7. Friedr. Vieweg & Sohn, 1985, xi + 347 pp, 
$28 (P). [ISBN: 3-528-08906-7] Generalizes standard 
Nevanlinna theory to the following setting: f: M— 
P,, is a meromorphic map, M is a parabolic manifold 
of dimension m, and P, is n-dimensional projective 
space. Extensive bibliography. BH 


Analysis, T(16-17). Finite-Dimensitonal Spaces: 
Algebra, Geometry, and Analysis, Volume I. Wal- 
ter Noll. Mech.: Analy., V. 10. Martinus Nijhoff 
(US Distr: Kluwer Academic), 1987, xvi + 393 pp, 
$97.50. [ISBN: 90-247-3581-5] Based on a course 
originally entitled “Tensor Analysis” and then re- 
named “Multidimensional Algebra, Geometry, and 
Analysis,” the text attempts to unify these topics 
and “to present mathematics as a whole.” The pre- 
requisites for a student are courses in abstract al- 
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gebra, real analysis, and linear algebra. The top- 
ics covered include: dual, flat (affine), and inner- 
product spaces; basic point-set topology; basic dif- 
ferential calculus; spectral theory (concentrating on 
finite-dimensional spaces); and the structure of linear 
transformations. Each of the ten chapters is followed 
by a problem set. LW 


Algebraic Geometry, S(18), P. Géométrie 
algébrique réelle. J. Bochnak, M. Coste, M-F. Roy. 
Ser. of Mod. Surv. in Math., Band 12. Springer- 
Verlag, 1987, x + 373 pp, $118.30. [ISBN: 0-387- 
16951-2] If you want to make algebraic geometry re- 
ally complex, do it over R, not C’. BC 


Differential Geometry, T(18: 1), S*, P*. Te- 
ichmuller Theory and Quadratic Differentials. Fred- 
erick P. Gardiner. Wiley, 1987, xvii + 236 pp, $46.95. 
ISBN: 0-471-84539-6] Treats the moduli space of 
inequivalent Riemann surface structures on a given 
topological surface. Full proofs and exercises make 
the book self-contained for a reader with knowl- 
edge of a few topics beyond the basics of complex 
variables, differential geometry, and algebraic topol- 
ogy. GG 


Geometry, S, L**. A Budget of Trisections. Un- 
derwood Dudley. Springer-Verlag, 1987, xv + 169 
pp, $29.80. [ISBN: 0-387-96568-8] A witty, compas- 
sionate, and richly personal account of the count- 
less amateur mathematical Don Quixotes who pur- 
sue their dream of trisecting angles. Echoing De 
Morgan’s 1872 A Budget of Paradozes, Dudley de- 
scribes the trisectors and their trisections, showing 
that despite their ignorance of advanced mathemat- 
ics, trisectors are often solid citizens who through 
clever persistence produce some very good approxi- 
mations. LAS 


Geometry, T(17-18: 2, 3), S, P. Algorithms 
in Combinatorial Geometry. Herbert Edelsbrunner. 
EATCS Mono. on Theor. Comput. Sci., V. 10. 
Springer-Verlag, 1987, xv + 423 pp, $49. [ISBN: 
0-387-13722-X] Despite its title, this book offers an 
extensive introduction to combinatorial geometry, 
generally, and to its correlative field, computational 
geometry, before treating geometric algorithms and 
their applications. Exercises, research problems, and 
up-to-date bibliography. SS 

Geometry, P. Intuitive Geometry. Ed: K. 
Boréczky, G. Fejes Téth. Colloquia Mathematica So- 
cietatis Janos Bolyai, V. 48. Elsevier Science, 1987, 
708 pp, Dfl. 295. [ISBN: 0-444-879-33-1] Assortment 
of research papers on convexity, discrete and combi- 
natorial geometries, and relations between geometry 
and other fields: biology, architecture, art, and psy- 
chology. SS 


Algebraic Topology, P. Lecture Notes in Mathe- 
matics-1274: Equivariant K-Theory and Freeness of 
Group Actions on C*-Algebras. N. Christopher 
Phillips. Springer-Verlag, 1987, viii + 371 pp, $32.90 
(P). [ISBN: 0-387-18277-2] Given an action of a 
compact Lie group on a compact Hausdorff space, 
an examination of the equivariant K-theory reveals 
whether this group action is free. The category of 
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compact Hausdorff spaces is (contravariantly) equiv- 
alent to the category of commutative unital C*- 
algebras, so it becomes natural to try to extend the 
above result to general C*-algebras. There being no 
completely satisfactory notion of freeness of an action 
on a C*-algebra, the author herein defines and stud- 
ies conditions on the equivariant K-theory of a C*- 
algebra, and in case the algebra is commutative, is 
able to determine through the equivariant K-theory 
whether the underlying space is free. LW 


Differential Topology, P. Lecture Notes in 
Mathematics-1264: Rational Homotopy Theory. Wu 
Wen-tstin. Springer-Verlag, 1987, viii + 219 pp, $20 
(P). [ISBN: 0-387-13611-8] Clarifying and extending 
the works of D. Sullivan, this book studies the ration- 
al homotopy type (minimal model) of a differential 
graded algebra of differential forms on a simplicial 
complex K. In particular, the book shows that the 
minimal model is homotopically invariant, and con- 
tains complete information on the cohomology and 
homotopy ring of K. The author takes a construc- 
tional approach, and is thus able to explicitly deter- 
mine the minimal model of a fiber space as well as 
that of a fiber-square-constructed space. LW 


Topology, T*(17-18), S*, P, L. Classical Tes- 
sellations and Three-Manifolds. José Maria Mon- 
tesinos. Universitext. Springer-Verlag, 1987, xvii 
+ 230 pp, $35 (P). [ISBN: 0-387-15291-1] Writ- 
ten to provide graduate students with a source 
of geometrical intuition in low-dimensional topol- 
ogy, this fascinating monograph illuminates three of 
Thurston’s eight geometries by exploring manifolds 
of tessellations in Euclidean, spherical, and hyper- 
bolic space. Profusely illustrated with mineral crys- 
tal patterns, ornamental motifs, and mathematical 
sketching. LAS 


Dynamical Systems, T(16-17), P, L. Chaotic Vi- 
brations: An Introduction for Applied Scientists and 
Engineers. Francis C. Moon. Wiley, 1987, xv + 309 
pp, $39.95. [ISBN: 0-471-85685-1] An applied view of 
chaotic phenomena, richly illustrated with physical 
systems, mathematical models, and standard exam- 
ples (Henon maps, Lorenz attractors, Poincaré maps, 
Lyapunov exponents). Little direct mathematical 
theory, but a great deal of illustrative applications 
that help to develop intuition. LAS 


Dynamical Systems, P. Attraktoreingrenzung fur 
nichtlineare Systeme. Gennadij A. Leonov, Volker 
Reitmann. Teubner-Texte zur Math., V. 97. BG 
Teubner, 1987, 196 pp, 20M (P). [ISBN: 3-322-00427- 
9] 

Dynamical Systems, P. Autowave Processes 
in Kinetic Systems: Spatial and Temporal Self- 
Organization in Physics, Chemistry, Biology, and 
Medicine. V.A. Vasiliev, et al. Math. & Its Ap- 
plic. D Reidel (US Distr: Kluwer Academic), 1987, 
262 pp, $69. [ISBN: 90-277-2379-6] From Chapter 1: 
“An autowave process is a self-sustained wave pro- 
cess in an active nonlinear medium maintaining its 
characteristics at a constant level at the expense of an 
energy source distributed in the medium.” The key 
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word is “nonlinear.” Examples include propagation 
of phase-transition fronts, Belousov-Zhabotinskii re- 
actions, and nerve cell activity. BC 


Probability, S**, L**. Fifty Challenging Problems 
in Probabihty with Solutions. Frederick Mosteller. 
Dover, 1987, villi + 88 pp, $3.95 (P). [ISBN: 0-486- 
65355-2] Unabridged republication of the work first 
published in 1965. Stimulating and challenging prob- 
lems with leisurely, instructive, conversational solu- 


tions that demonstrate mathematical thinking at its 
best. LCL 


Probability, P*. Dependence in Probability and 
Statistics: A Survey of Recent Results. Ed: Ernst 
Eberlein, Murad S. Taqqu. Prog. in Prob. & Stat., 
V.11. Birkhauser Boston, 1986, xi + 473 pp, $39.50. 
(ISBN: 0-8176-3323-5] Twenty-three papers written 
by participants in an international conference held 
in Oberwolfach in April 1985, concerned with limit- 
ing results assuming various dependence structures. 
“These papers tie together known results, describe 
the underlying ideas, summarize the state-of-the-art, 
and state some open problems.” RSK 


Stochastic Processes, P*. Stochastic Geometry 
and Its Applications. D. Stoyan, W.S. Kendall, J. 
Mecke. Prob. & Math. Stat. Wiley, 1987, 345 
pp, $49.95. [ISBN: 0-471-90519-4] Revised transla- 
tion of Stoyan and Mecke’s 1983 book Stochastische 
Geometrie. “Topics covered include the basic theo- 
ties of point processes, random sets, fibre processes, 
tessellations, stereology and the statistical theory of 
shape.” Extensive bibliography of this relatively new 
field. RSK 


Stochastic Processes, $(17-18), P. Radically El- 
ementary Probability Theory. Edward Nelson. An- 
nals of Math. Stud., No. 117. Princeton U Pr, 
1987, ix + 97 pp, $40; $15 (P). [ISBN: 0-691-08473- 
4; 0-691-08474-2] Develops the theory of probability 
and stochastic processes using nonstandard analysis, 
leaving the results in nonstandard form. More ad- 
vanced appendix shows how the conventional theory 
of stochastic processes can be derived from nonstan- 
dard theory. Note hardcover price. RSK 


Stochastic Processes, T(18: 1, 2). Applied Prob- 
ability and Queues. Sgren Asmussen. Prob. & 
Math. Stat. Wiley, 1987, x + 318 pp, $67.95. 
[ISBN: 0-471-91173-9] Divided into three parts: Part 
A deals with simple Markovian models, particularly 
as they relate to queueing theory; Part B presents 
basic mathematical tools—renewal theory, regener- 
ative processes, and random walks; Part C investi- 
gates special models and methods, primarily dealing 
“with a more narrow class of problems associated 
with general distributions of interarrival times and 
service times.” RSK 


Elementary Statistics, T*(13: 1), S. Introduc- 
tion to Biostatistics, Second Edition. Robert R. 
Sokal, F. James Rohlf. WH Freeman, 1987, xii + 363 
pp, $37.95. [ISBN: 0-716-71805-7] Modest revision of 
the authors’ 1973 First Edition (TR, March 1974), 
again based on their more inclusive book Biome- 
try (Second Edition, TR, May 1982). Changes in- 
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clude some new descriptive procedures, more non- 
parametrics, the Bonferroni method of multiple com- 
parisons, and the use of the G statistic rather than 
the more common chi-square test to analyze fre- 
quency data. RSK 


Elementary Statistics, T(14-16), S, L. Elements 
of Statistics for the Life and Social Sciences. Brax- 
ton M. Alfred. Texts in Stat. Springer-Verlag, 1987, 
xill + 190 pp, $36. [ISBN: 0-387-96500-9] Expan- 
sive, self-contained introduction to statistical ideas 
needed for scientific work in anthropology (and pre- 
sumably other life and social sciences as well). Case 
studies used to illustrate logical argument, deductive 
thinking, prediction, and hypothesis testing. No ex- 
ercises. LCL 


Elementary Statistics, T(14-15: 1, 2). Ba- 
stc Statistical Methods for Engineers and Scientists, 
Third Edition. John B. Kennedy, Adam M. Neville. 
Harper & Row, 1986, xx + 613 pp. [ISBN: 0-06- 
043633-6] Revision of the authors’ 1976 Second Edi- 
tton. Includes new material on probability, statistical 
inference, and goodness of fit. Emphasizing practical 
considerations, it also includes chapters on rejection 
of outliers, distributions of extremes, tolerance and 
control charts, acceptance and rejection testing, and 
an introduction to the design of experiments. RSK 


Statistics, P. Advances in Multivariate Statistical 
Analysis: Pillai Memorial Volume. Ed: A.K. Gupta. 
Theory & Dec. Lib., Ser. B. D Reidel (US Distr: 
Kluwer Academic), 1987, xvi + 389 pp, $89. [ISBN: 
90-277-2531-4] Twenty-one papers providing a cross- 
section of recent developments in multivariate sta- 
tistical analysis. Dedicated to the memory of K.C. 
Sreedharan Pillai (1920-1985), it contains a short 
biography and a bibliography of his works. Note 
price. RSK 


Statistics, T(17-18: 1), S, P*. The Asymptotic 
Theory of Eztreme Order Statistics, Second Edition. 
Janos Galambos. Robert E Krieger, 1987, xv + 414 
pp, $49.50. [ISBN: 0-89874-957-3] Revision of the 
author’s 1978 First Editton published by Wiley (TR, 
December 1978). Rigorous presentation covering all 
known asymptotic models. Extensive bibliography, 
supplemented by a survey of the literature in each 
chapter. RSK 


Statistics, T(17), P, L. Kendall’s Advanced The- 
ory of Statistics, Fifth Edition of Volume 1: Distri- 
bution Theory. Alan Stuart, J. Keith Ord. Oxford 
U Pr, 1987, xvi + 604 pp, $75. [ISBN: 0-19-520561- 
8] Major revision of the 1977 Fourth Edition of the 
first volume of this classic three-volume treatise, orig- 
inally written by Kendall (1907-1983). The topics 
covered remain basically the same, but much new 
material has been added and terminology and proofs 
have been updated. RSK 

Statistics, T(14-16: 1, 2), S, L. Applied Statis- 
tics: Analysis of Variance and Regression, Second 
Edition. Olive Jean Dunn, Virginia A. Clark. Prob. 
& Math. Stat. Wiley, 1987, xii + 445 pp, $36.95. 
[ISBN: 0-471-81269-2] Revision of the authors’ 1974 
text (TR, February 1975). Each chapter now in- 
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cludes a brief description of relevant BMDP and SAS 
computer programs. Also includes new material on 
repeated measure designs and the use of dummy vari- 
ables in multiple regression and covariance analysis, 
and expanded material on variable selection and test- 
ing assumptions in multiple regression. RSK 


Statistics, T(18: 1), P. Asymptotic Distribu- 
tion Theory in Nonparametric Statistics. Manfred 
Denker. Adv. Lect. in Math. Friedr Vieweg & Sohn, 
1985, vii + 204 pp, $18 (P). [ISBN: 3-528-08905-9] 
Treats three basic types of statistics: Hoeffding’s U- 
statistics, differentiable statistical functionals, and 
statistics based on ranks. Concludes with a chap- 
ter on contiguity and efficiency. No exercises. RSK 


Statistics, T(17: 1, 2), P. Plane Answers to 
Complez Questions: The Theory of Linear Models. 
Ronald Christensen. Texts in Stat. Springer-Verlag, 
1987, xiv + 380 pp, $42. [ISBN: 0-387-96487-8] De- 
signed “to rigorously illustrate the practical appli- 
cation of the projective approach to linear models.” 
First half covers standard topics in regression analy- 
sis, analysis of variance and covariance; last part in- 
troduces various special topics such as residual anal- 
ysis, variance component estimation, and log-linear 


models. RSK 


Statistics, T(18: 1), S, P*. Measurement Error 
Models. Wayne A. Fuller. Prob. & Math. Stat. Wi- 
ley, 1987, xxiii + 440 pp, $44.95. [ISBN: 0-471-86187- 
1] Presents both theory and applications of models in 
which the explanatory variables are measured with 
error. Includes a number of real examples illustrating 
the procedures. Good bibliography. RSK 


Statistics, S(16-17), P. Design, Data, and Analy- 
sis by Some Friends of Cuthbert Daniel. Ed: Colin 
L. Mallows. Prob. & Math. Stat. Wiley, 1987, 
xix + 380 pp, $29.95. [ISBN: 0-471-83937-X] Sev- 
enteen papers, three expository, two concerned with 
design issues, and the other twelve dealing primarily 
with the analysis of data, written in recognition of 
Daniel’s great influence in this latter area. Written 
for “students and aspiring statistical consultants.” 
Includes a brief outline of Daniel’s career and a list 
of his publications. RSK 


Statistics, P. Contributions to the Theory and Ap- 
plication of Statistics: A Volume in Honor of Her- 
bert Solomon. Ed: Alan E. Gelfand. Academic Pr, 
1987, xxviii + 544 pp, $59.95. [ISBN: 0-12-279450- 
8] Twenty papers, contributed by friends and col- 
leagues, grouped into four areas where Solomon has 
made significant contributions: operations research 
and applied probability; distribution theory and ge- 
ometric probability; applications in the areas of law 
and justice, medicine and psychology; and inference 
methodology. Includes a brief biographical sketch 
and a list of Solomon’s publications. RSK 


Statistics, T(16-17: 1), S, L. Introduction to Sta- 
tistical Inference. Jack Car] Kiefer. Ed: Gary Lor- 
den. Texts in Stat. Springer-Verlag, 1987, vill + 
334 pp, $48. [ISBN: 0-387-96420-7] Text based on 
lecture notes developed by Kiefer (1924-1981) for a 
first course in statistical inference. Presents a mod- 


1988] 


ern decision-theoretic approach to inference, empha- 
sizing the need to justify the use of a procedure by 
some criterion of goodness. RSK 


Statistics, P. The Population-Sample Decomposi- 
tion Method. A.M. Wesselman. Intern. Stud. in 
Econ. & Econometrics, V. 19. Kluwer Academic, 
1987, 242 pp, $47.50. [ISBN: 90-247-3603-X] An ap- 
proach which does not require all the model assump- 
tions of classical statistical methods. The study aims 
to illustrate how the population-sample decomposi- 
tion method is applicable to a much wider class of 
statistics. LCL 


Programming, T(13-14: 1). Pascal-SC: A Com- 
puter Language for Scientific Computation. Gerd 
Bohlender, et al. Perspect. in Comput., V. 17. 
Academic Pr, 1987, ix + 292 pp, $34. [ISBN: 
0-12-111155-5] Pascal-SC (scientific computation— 
implemented for Z80, 8088, and 68000 processors) is 
an extension of standard Pascal which allows greater 
accuracy in solving numerical problems, and has 
dynamic arrays and better string handling; chap- 
ters (among others) include standard Pascal, real 
floating-point arithmetic, strings and text process- 
ing, dynamic arrays, and modules; a syntax diagram 
appendix is included, along with numerous complete 
examples; useful as either a reference or text for a 
numerical-based Pascal course. RSF 


Languages, P. Lecture Notes in Computer Scitence- 
274: Functional Programming Languages and Com- 
puter Architecture. Ed: Gilles Kahn. Springer- 
Verlag, 1987, vi + 470 pp, $34.60 (P). [ISBN: 0-387- 
18317-5] 24 papers from the third conference on the 
title subject, held in Portland, Oregon, September 
1987. This conference focused particularly on im- 
plementation techniques for functional programming 
languages and on computer architectures to support 
the efficient execution of functional programs. DFA 


Languages, P. REDUCE: Software for Algebraic 
Computation. Gerhard Rayna. Symbolic Compu- 
tation. Springer-Verlag, 1987, ix + 329 pp, $29.80 
(P). [ISBN: 0-387-96598-X] An introduction to RE- 
DUCE 3—a system for symbolic algebraic compu- 
tation available on systems ranging in scale from an 
IBM PC toa Cray X/MP. Of special interest is a sec- 
tion containing several case studies illustrating the 
use of this symbolic computation package to solve 
nontrivial problems. AO 


Languages, P. Lecture Notes in Computer Science- 
276: ECOOP ’87: European Conference on Object- 
Oriented Programming. Ed: J. Bézivin, et al. 
Springer-Verlag, 1987, vi + 273 pp, $23.10 (P). 
[ISBN: 0-387-18353-1] Contains 25 papers on various 
aspects of object-oriented programming. AO 


Computer Systems, P. Lecture Notes in Com- 
puter Sctence-275: System Development and Ada. 
Ed: A.N. Habermann, U. Montanari. Springer- 
Verlag, 1987, 305 pp, $25.70 (P). [ISBN: 0-387-18341- 
8] An interesting collection of papers presented at 
the workshop on software factories and Ada held 
in Capri, Italy, May 1986. Covers three general 
topics. First, design of software development en- 
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vironments, describing commercially available envi- 
ronments to ease production of very large software 
systems. Language-specific and mixed-language en- 
vironments are presented. Secondly, methods for 
software development, addressing the issue of ensur- 
ing that implementations accurately reflect specifica- 
tions. Finally, Ada compiler validation and specify- 
ing and testing Ada events are covered. PS 


Theory of Computation, P. Lecture Notes in 
Computer Science-273: A Connotational Theory of 
Program Structure. James S. Royer. Springer- 
Verlag, 1987, 186 pp, $18 (P). [ISBN: 0-387-18253- 
5] This monograph is an outgrowth of a Ph.D. dis- 
sertation (SUNY at Buffalo) continuing development 
of a language-independent theory of program struc- 
ture begun by Riccardi and Case; the central theme 
is the subclass, called acceptable numberings, of ef- 
fective numberings; also stressed is “building” one 
control structure from others. It is very mathemat- 
ical, with symbolism, numerous definitions, lemmas, 
propositions, and theorems; includes a bibliography, 
notation index, and definition index. RSF 


Theory of Computation, P. Lecture Notes in 
Computer Sctence-270: Computation Theory and 
Logic. Ed: Egon Borger. Springer-Verlag, 1987, ix 
+ 442 pp, $34.60 (P). [ISBN: 0-387-18170-9] Memo- 
tial volume for Dieter Rodding, a German logician 
interested in interaction of logic and computer sci- 
ence. Topics include recursion theory, automata the- 
ory, complexity theory, applications of logic in sys- 
tems theory. KS 


Theory of Computation, P. Logic of Program- 
ming and Calculi of Discrete Design. Ed: Manfred 
Broy. NATO ASI Ser. F, V. 36. Springer-Verlag, 
1987, 413 pp, $75. [ISBN: 0-387-18003-6] Proceed- 
ings of a NATO Advanced Study Institute held in 
Marktoberdorf, Federal Republic of Germany, July 
29-August 10, 1986. Contains fourteen papers on 
methodologies for the specification, design, and ver- 
ification of programs. AO 


Theory of Computation, T*(16-18: 1, 2), S, 
P, L. Foundations of Logic Programming, Second, 
Eztended Edition. J.W. Lloyd. Symbolic Compu- 
tation. Springer-Verlag, 1987, xii + 212 pp, $44.50. 
[ISBN: 0-387-18199-7] Expanded material in the Sec- 
ond Edition leans toward databases. As well, the 
class of programs considered is enlarged. Preliminar- 
ies are in the first chapter while the second chapter 
contains the declarative and procedural semantics of 
definite programs. Chapter three deals with normal 
programs and chapter four is new and treats unre- 
stricted programs. The last chapter provides a the- 
oretical foundation for deductive database systems. 
Chapter problems. References. Index. (First Fdi- 
tion, TR, October 1985.) RJA 


Artificial Intelligence, P. Natural Language Pars- 
ing Systems. Ed: Leonard Bolt. Symbolic Computa- 
tion. Springer-Verlag, 1987, xviii + 367 pp, $49.50. 
ISBN: 0-387-17537-7] This collection of nine essays 
(fourteen authors) about current problems of natural 
language parsing in an artificial intelligence context 
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presents research results in greater detail than jour- 
nal articles allow; each paper has an abstract and 
reference list; there is a volume subject index. RSF 


Artificial Intelligence, P. The Knowledge Fron- 
iter: Essays in the Representation of Knowledge. Ed: 
Nick Cercone, Gordon McCalla. Symbolic Compu- 
tation. Springer-Verlag, 1987, xxxv + 512 pp, $42. 
[ISBN: 0-387-96557-2] This collection of 17 essays 
(26 authors) about knowledge representation is an 
outgrowth of the IEEE Computer Special Issue on 
Knowledge Representation in 1983; six of the papers 
are new, 7 are updated, and 4 are unchanged; the pa- 
pers are organized in six sections—Overview, Logic, 
Foundations, Organization, Reasoning, and Applica- 
tions, giving a comprehensive treatment to this as- 
pect of artificial intelligence. The preface, index, and 
reference list are extensive. RSF 


Computer Science, P. Lecture Notes in Computer 
Sctence-263: Advances in Cryptology—CRYPTO 
86. Ed: A.M. Odlyzko. Springer-Verlag, 1987, xi 
+ 487 pp, $34.60 (P). [ISBN: 0-387-18047-8] Pro- 
ceedings of an annual conference held at the Univer- 
sity of California, Santa Barbara, devoted to cryp- 
tologic research. Thirty-six papers classified by data 
encryption, public-key cryptography, zero-knowledge 
proofs, secret-sharing methods, hardware systems, 
software systems, software protection, probabilistic 
metheds, and other topics. LCL 


Computer Science, P. Lecture Notes in Computer 
Sctence-265: Analogical and Inductive Inference. Ed: 
K.P. Jantke. Springer-Verlag, 1987, vi + 227 pp, 
$20.60 (P). [ISBN: 0-387-18081-8] Written versions 
of presentations given at the International Work- 
shop on Analogical and Inductive Inference held in 
Wendisch-Rietz, GDR, October 6-10, 1986. Discus- 
sion centered around two basic approaches to learn- 
ing algorithms: inductive inference and analogical 
reasoning. RJA 


Computer Science, P. Lecture Notes in Com- 
puter Science-264: Logic Programming ’86. Ed: Ei- 
iti Wada. Springer-Verlag, 1987, 179 pp, $18 (P). 
[ISBN: 0-387-18024-9] Proceedings of the Fifth Logic 
Programming Conference held in Tokyo, Japan, June 
23-26, 1986. RJA 


Applications, P. Reliability Data Bases. Ed: 
Aniello Amendola, Alfred Z. Keller. D Reidel (US 
Distr: Kluwer Academic), 1987, xii + 398 pp, $72. 
[ISBN: 90-277-2549-7] Probabilistic techniques are 
increasingly used to assess safety, new system de- 
signs, and reliability of products. These applications 
require the availability of good dependable and rele- 
vant data about failure rates, incidents, performance, 
and so forth. These proceedings (ISPRA Course, Oc- 
tober 1985) provide a comprehensive review of the 
state-of-the-art relating to the collection, processing, 
and use of such data. LCL 


Applications, P. Filtering for Stochastic Pro- 
cesses with Applications to Guidance, Second Edt- 
tion. Richard S. Bucy, Peter D. Joseph. Chelsea, 
1987, xviii + 217 pp, $19.95. [ISBN: 0-8284-0326-0] 
Filtering a random signal process from observations 
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corrupted by noise. A reprint of a 1968 edition which 
gives a detailed derivation of the Kalman-Bucy filter 
(TR, May 1969). Includes appendices which correct 
the a priort bounds and gives an up-to-date list of 
relevant references. CEC 


Applications (Biological Science), P. Mathe- 
matical Aspects of Hodgkin-Huzley Neural Theory. 
Jane Cronin. Stud. in Math. Biology, V. 7. Cam- 
bridge U Pr, 1987, xi + 261 pp, $49.50. [ISBN: 0-521- 
33482-9] An introduction to the study of mathemat- 
ical models of electrically active cells. Provides an 
account of the literature on the subject and covers in 
detail the Hodgkin-Huxley model for nerve conduc- 
tion. Accessible to mathematicians with little or no 
background in physiology. RH 


Applications (Cognitive Science), S, P, L**. 
Brains, Machines, and Mathematics, Second Edi- 
tton. Michael A. Arbib. Springer-Verlag, 1987, xvi + 
202 pp, $27. [ISBN: 0-387-96539-4] Revision (princi- 
pally an expanded introduction) of a path-breaking 
book written in 1962 (published by McGraw-Hill in 
1964) outlining a theory of automata to explain the 
processing of inforznation in brain-like machines. Be- 
ginning with the McCulloch and Pitts model of neu- 
ral networks and ending in Godel’s theorem, Arbib 
joins biology to mathematics in a prescient challenge 
to contemporary cognitive science. LAS 


Applications (Electrical Engineering), T(15- 
16: 1). Integrated Circuit Design. Alan F. Mur- 
ray, H. Martin Reekie. Springer-Verlag, 1987, xiv 
+ 152 pp, $28. [ISBN: 0-387-91303-3] Designed as 
a textbook for an introductory course for electrical 
engineering or physics students. The final chapter 
provides a design exercise that can be assigned as a 
year-long project. AO 


Applications (Electrical Engineering), T(16- 
17: 2), S, P, L. Methods in Electromagnetic Wave 
Propagation. D.S. Jones. Oxford Eng. Sci. Ser., 
V. 8. Clarendon Pr, 1987, $29.95 each (P). Vol- 
ume 1: Theory and Guided Waves, 433 pp, (ISBN: 
0-19-856189-X]; Volume 2: Radiating Waves, 476 pp. 
[ISBN: 0-19-856190-3] The objective of these volumes 
is to develop a suitable framework of theory and nu- 
merical analysis with applications to various aspects 
of propagation of electromagnetic waves. Numerous 
exercises are included. MU 

Applications (Engineering), P. Reliability Mod- 
elling and Applications. Ed: A.G. Colombo, A.Z. 
Keller. D Reidel (US Distr: Kluwer Academic), 
1987, x + 391 pp, $74. [ISBN: 90-277-2566-7] Pro- 
ceedings of the ISPRA Course (November 1985) in- 
cludes twenty papers grouped into five sections: sys- 
tems reliability, availability and maintainability of 
systems, structural reliability, reliability of comput- 
ing systems, human reliability. LCL 


Applications (Engineering), P. Lecture Notes in 
Mathematics-1279: Saint-Venant’s Problem. Dorin 
Iegan. Springer-Verlag, 1987, viii + 162 pp, $16.30 
(P). [ISBN: 0-387-18361-2] The problem of A.J.C.B. 
de Saint-Venant is to “determine the equilibrium of a 
homogeneous and isotropic linearly elastic cylinder, 
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loaded by surface forces distributed over its plane 
ends.” These lecture notes present recent research on 
Saint-Venant’s problem for anisotropic elastic bod- 
ies, on the relaxed Saint-Venant problem for hetero- 
geneous elastic cylinders, and on Saint-Venant prob- 
lems within the linearized theory of Cosserat elastic 
bodies. One hundred sixty references from Saint- 
Venant’s original papers to the present. JK 


Applications (Engineering), P. Contact Mechan- 
ics. K.L. Johnson. Cambridge U Pr, 1987, xi + 452 
pp, $34.50 (P). [ISBN: 0-521-34796-3] Covers stresses 
and deformations which arise when two solid bod- 
ies come in contact, especially when the contact is 
localized due to dissimilar profiles (non-conformal). 
This induces a local stress concentration which can 
be studied using the method of superposition of point 
force solutions. MR 


Applications (Engineering), P. Lecture Notes in 
Engineering-9: The Complez Variable Boundary El- 
ement Method. Theodore V. Hromadka II. Springer- 
Verlag, 1984, xi + 243 pp, $22 (P). [ISBN: 0-387- 
13743-2] Develops methods for approximating solu- 
tions to the Dirichlet and Neumann boundary-value 
problems by applying the Cauchy integral formula 
to the boundary element method. Includes a quick 
review of basic complex variable theory. BH 


Applications (Fluid Mechanics), P. Amorphous 
Polymers and Non-Newtonian Fluids. Ed: Constan- 
tine Dafermos, J.L. Ericksen, David Kinderlehrer. 
Inst. for Math. & Its Applic., V. 6. Springer- 
Verlag, 1987, xii + 195 pp, $22. [ISBN: 0-387-96556- 
4] Ten papers from a workshop held at the Institute 
for Mathematics and Its Applications (IMA) in Min- 
nesota during the 1984-85 special year on continuum 
physics and partial differential equations. LCL 


Applications (Physics), P, L. Three Hundred 
Years of Gravitation. Ed: S.W. Hawking, W. Is- 
trael. Cambridge U Pr, 1987, xiii + 684 pp, $69.50. 
[ISBN: 0-521-34312-7] Sixteen chapters, separately 
authored, surveying cosmology, gravitation, string 
theory, and other theories descended from Newton’s 
Principia, published just 300 years ago. Much more 
than a simple collection of papers, this compre- 
hensive commemorative volume is a superb, well- 
planned exposition of the modern theory of gravi- 
tation. LAS 

Applications (Physics), T(16-17: 1, 2), S, L. 
Gravitational Physics of Stellar and Galactic Sys- 
tems. William C. Saslaw. Cambridge U Pr, 1987, 
xvii + 491 pp, $34.50 (P). [ISBN: 0-521-34975-3] Al- 
though primarily a theoretical treatise based on clas- 
sical Newtonian gravity, some observational data is 
introduced for motivation. The text is relatively self- 
contained and accessible to anyone familiar with ad- 
vanced calculus. Numerous interesting exercises are 


included. MU 


Applications (Physics), P. Symmetries of Maz- 
well’s Equations. W.I. Fushchich, A.G. Nikitin. 
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Transl: John R. Schulenberger. Math. & Its Applic. 
D Reidel (US Distr: Kluwer Academic), 1987, xiv 
+ 214 pp, $74. [ISBN: 90-277-2320-6] The power of 
Maxwell’s equations lies largely in their symmetries, 
both geometric (Lie-group) and non-geometric (Lie- 
algebra). The former is classical; the latter is still 
being developed, and is the main subject here. The 
moral: Symmetry makes the world go ‘round. BC 


Applications (Simulation), P. DEMOS: A Sys- 
tem for Discrete Event Modelling on Simula. G.M. 
Birtwistle. Springer-Verlag, 1987, 215 pp, $18 (P). 
[ISBN: 0-387-91301-7] An introduction to discrete 
event simulation modelling using DEMOS—a sys- 
tem which compliments SIMULA by providing el- 
ements which help beginners write simulations more 
quickly. Describes the basic DEMOS approach to 
model building as well as DEMOS descriptions of 
synchronization problems arising in discrete event 
simulations. Written as a teaching text for DEMOS, 
not as a reference. Tutorial style presents new fea- 
tures and offers an illustrative example of each. DE- 
MOS programs’ advantage over regular SIMULA 
programs: more easily written and understood with- 
out a thorough knowledge of SIMULA. Includes ex- 
ercises and solutions. PS 


Applications (Social Science), S, P, L. Multi- 
dimensional Similarity Structure Analysis. I. Borg, 
J. Lingoes. Springer-Verlag, 1987, xiv + 390 pp, 
$39. [ISBN: 0-387-96525-4] A class of models that 
represent similarity coefficients among a set of ob- 
jects (e.g., correlation matrix) as distances in mul- 
tidimensional space (two points are closer together 
when they are more correlated). The resulting pic- 
ture is easier to assimilate than the table of coeffi- 
cients. This book of case studies deals with all as- 
pects of this subject, starting from scratch assuming 
no more than a high school background in mathe- 
matics. LCL 
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The * Prime Number Theorem” for the Periodic Orbits of a 
Bernoulli Flow 


S. P. LALLEY 
Department of Statistics, Purdue University, W. Lafayette, IN 47907 


1. Introduction. In 1969 G. Margulis [8] published the statement of a remarkable 
result concerning the distribution of closed geodesics on a compact Riemann surface 
of curvature —1: if M(x) is the number of closed geodesics with lengths not 
exceeding x, then 


N(x) ~ e*/x 


as x — oo. Margulis’ proof, unfortunately, has never been published in English. In 
1976, Hejhal [6] gave a proof of Margulis’ theorem based on the Selberg trace 
formula. 

Since Margulis’ announcement statistical regularities in the distributions of 
periodic orbits have been discovered for various flows. For a weakly mixing Axiom 
A flow restricted to a basic set, Parry and Pollicott [10] (following earlier work by 
Bowen [1], [2]) proved that the number of periodic orbits with minimal period not 
exceeding x is asymptotic to e"*/hx as x > oo, where h is the topological entropy 
of the flow. Sarnak [11] proved a similar result for the horocycle flow. 

These results closely resemble the prime number theorem. This is no accident: 
the proofs in [6], [10], and [11] all use suitable zeta functions together with some of 
the standard machinery of analytic number theory. In a recent paper Parry [9] has 
taken this approach one step further and produced an analogue of the Dirichlet 
density theorem for periodic orbits of Axiom A flows. 

In this article I shall present a similar result for a very simple flow, the so-called 
Bernoulli flow. I shall use only elementary techniques of asymptotic analysis: no 
zeta functions, no Tauberian theorems, no heavy machinery. This approach has the 
advantage that it leads to some interesting results concerning the distributions of 
individual periodic orbits, results which have no analogues in analytic number 
theory. In another paper [7] I have shown that this elementary method can be 
adapted to the general Axiom A flow, giving similar results. 

No knowledge of geometry or dynamical systems is necessary to understand this 


paper. 


2. Symbolic flows and Bernoulli flows. Consider the map o: [0,1] — [0,1] given 
by ox = ((2x)), where (( y)) denotes the fractional part of y (.e., (Cy)) =y — 
[[ y]], where [[ y]] is the greatest integer in y). The map o is called the shift because 
of’ what it does to the binary expansion of x: if x = .x,x,x3..., then ox = 
X4X3Xq---- 

Let f: [0,1] — (0, co) be a continuously differentiable function. (More generally, 
one may take f to be piecewise C! with discontinuities at dyadic rationals.) The 
symbolic flow under f (sometimes called the f-suspension or the special flow under f ) 


After a brief career wrestling alligators in carnivals, Lalley took up the study of probability and 
statistics. He wrote his Ph.D. dissertation on sequential testing at Stanford University under the tutelage 
of David Siegmund. He was a member of the statistics faculty at Columbia University from 1980 until 
1986, and is now at Purdue University. 
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takes place on the region 1, = {(x, 5): O< x <1 and 0 <s < f(x)} under the 
graph of f. The dynamics are as follows: starting at any point (x, 5) € L,, a particle 
moves upward along the vertical line segment from (x, 5) to (x, f(x)) at unit 
velocity; upon reaching the “ceiling” at (x, f(x)) the particle jumps instantaneously 
to the “floor” at (ox,0), then proceeds at unit speed up the vertical line segment 
from (ox, 0) to (ox, f(ox)), jumping again upon reaching the ceiling, etc. If T)(x, s) 
denotes position at time ¢ of a particle started at (x, s) initially, then 


T(x,5) (x,s+t) ifst+t<f(x) 
S = 
AM (ox, 0) ifs+t=f(x) 
and 
T,°T, = Toys. 


A Bernoulli flow is a symbolic flow under a function f of the form 
1 : 1 : 
=lx<=})+ >=}, 
fl) . | P . 2 


For many interesting flows, including geodesic flows on compact manifolds of 
negative curvature and, more generally, Axiom A flows, questions about the ergodic 
behavior of trajectories may be reduced to similar questions about trajectories of 
symbolic flows. This is because the trajectories of such flows may be “coded” into 
binary expansions in a continuous and (more or less) one-to-one fashion. The idea is 
that if the phase space is partitioned into finitely many “boxes” then trajectories of 
the flow are more or less uniquely determined by the sequences of boxes the 
trajectories enter and leave. The matter of choosing a suitable partition is quite 
difficult: cf. Bowen [2] and Series [12]. 

The key to the study of periodic orbits of the symbolic flow under / is that the 
periodic orbits are precisely those orbits which intersect the “floor” (i.e., x-axis) at 
points (x,0) for which x has a periodic binary expansion. Observe that if x has a 
binary expansion with (minimal) period m then the orbit through (x, 0) intersects 
the floor of L, at (x,0),(ox,0),(07x,0),...,(06”~'x,0), and at no other points. 
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Furthermore, the (minimal) period of such an orbit is 


f(x) + f(ox) + --- +f(o™"7 1x). (2.1) 


Observe that for the Bernoulli flow the quantity (2.1) is just 


m+(B-1)))x,, 
i=1 
where .x,X,X,... is the binary expansion of x. 
The main result of this article is an asymptotic formula for the number of 
periodic orbits of a Bernoulli flow with B > 1 and 8B irrational (cf. (9.4) below). This 
will be derived in sections 4—9. First, however, I shall look at a special case. 


3. A special case. For the symbolic flow under the constant function f= 1 the 
counting problem is almost trivial. The number of periodic binary expansions with 
minimal period dividing n is 2”, so by the MGebius inversion formula the number of 
n-periodic binary expansions is 

n 
d 2% ~ 
d\n d 
u being the MG6ebius function. Each periodic orbit of the flow with period n 


corresponds to a set of n such n-periodic binary expansions: therefore, the number 
of periodic orbits of the flow with period n is 


n—' Yu ( =] ~2"/n (3.1) 
d|n d 
as n — oo. This agrees in form with the results of Margulis and Parry and Pollicott. 

Now consider the ensemble of periodic binary expansions with minimal period 
dividing n. By the weak law of large numbers for Bernoulli trials, for “most” 
periodic binary expansions with period dividing n the proportion of ones is near 
1/2, when n is large. Hence “most” periodic orbits of the flow with period dividing 
n spend approximately half of the time on the right half of 2, and the rest on the 
left half of Ly. 

Much more can be said. Let e,e, --- e,, be a finite sequence of zeros and ones. 
Then the weak law of large numbers implies that for “most” periodic binary 
expansions with period dividing n, the proportion of times the pattern e,e, --: e,, 
appears in the expansion is near 2~™, provided n is large. Thus most periodic orbits 
of the flow with period dividing n spend about 2~”™” of the time in each rectangle 
with vertices (k2~”,0), (k27™,1), (kK + 1)27",1), and ((k + 1)27",0), k= 
O,1,<..,2™” — 1. Since the number of periodic orbits with period n is asymptotic to 
the number of periodic orbits with period dividing n (cf. (3.1)), it follows that most 
periodic orbits with period n are nearly uniformly distributed on the square Ly. 

It is tempting to conjecture that this result is true for all symbolic flows. The 
truth, however, is more subtle, as will be seen. 


4. Counting orbits of the Bernoulli flow. Consider now the Bernoulli flow de- 
scribed in section 2, i.e., the symbolic flow under 


f(x) = if < ;\ + pr x . ;\ 
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For definiteness, I assume B > 1. In addition, I assume that B is irrational: the 
reason for this will be seen later (cf. section 7). Notice, however, that if 8 were not 
irrational, then B = k// for certain relatively prime integers k and / and the lengths 
of all periodic orbits would be integer multiples of 1/1. 

Recall that a periodic orbit of the Bernoulli flow is one which passes through the 
floor of 1, at a point (x,0) such that x has a periodic binary expansion, and that 
the period of such an orbit is 2”, f(o'x), m being the period of the binary 
expansion. Let ¢ > 0 be sufficiently small that 


&, = e/(B —1) <1. (4.1) 
For a > 0, define 


N(a; ©) = #{ periodic orbits with period between a anda-+ &}, 
N,7 (4a; €) = #{ periodic binary expansions .x,X.X3 +++ with period m such that 


a—m<(B-1) Dx, sa-m+eh 


i=1 
N,,(@; &€) = #{ periodic. binary expansions .x,X,X, +++ with period dividing m 
such that 


a—m<(B~1)Yx,<a-m+e). 
i=1 
Then 
N(a;e)= )) N,*(a;8)/m 
m=1 


and 


N,,(a; «) = ) N*(ad/m; ed/m). 
d|m 


The Mo6ebius inversion formula implies that 
Nyt(a; ©) = Da(]Nq(ad/m; ed/m): 
d|m 
consequently, 


N(a;¢) = > m} ¥ of 5 | N(ad/m: ed/m). (4.2) 


m=1 d\m 


The reason for representing N(a; «) this way is that the quantities N,(a; e) may 
be easily calculated in terms of the binomial coefficients. Suppose there is an integer 
k > 0 such that a—m<(B—-—l)k<a-—~m+ gé; notice that there can only be one 
such integer, because of (4.1). Then N,,(a; «) is the number of distinct sequences 
X1X +++ X,, of zeros and ones with exactly k ones, so 


m 
N,,(a; €) = (i): (4.3) 
If there is no integer k satisfying a-—m<(B—-1)k <a—m+ e, then 


N_,(a; €) = 0. (4.4) 
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It is worth noting that the numbers N,,(a; «) have a “probabilistic” interpreta- 
tion. If S| is a random variable with the binomial (m,1/2) distribution (i.e., S,, is 
the number of heads in m independent tosses of a fair coin), then 


N,,(4; €) = 2"Pr{a—m<(B-1)S,<a-—m+e}, 


thus the “asymptotics” of the numbers N_(a; «) are tied up with some of the 
classical limit laws of probability theory. 

Equations (4.2)—(4.4) provide an explicit representation for N(a; €) in terms of 
the binomial coefficients and the MG6bius function. Unfortunately, this representa- 
tion is not very illuminating, nor is it very useful for exact computations. In the next 
four sections I shall use methods of asymptotic analysis to obtain an approximate 
formula for N(a@; ®). 


5. Asymptotic analysis: preliminaries. The series (4.2) contains two sums. I shall 
argue that the sum ).,,,, is dominated by the term d = m. 

Suppose d|m. Let .x(xfx..., i=1,2,..., m/d, be periodic binary expan- 
sions each with period dividing d. One may form a periodic binary expansion 
.X1X7X,... with period dividing m by splicing together d-blocks of the binary 
expansions .x{x$)x{)... as follows: 


XX _Xq eee = HKD 2 DHOOM GPX 


This construction is one-to-one: each distinct set of binary expansions .x{?x$?..., 
i=1,2,...,m/d, gives rise to a distinct expansion .x,x,x,.... Furthermore, if 
each sequence x{x$) ... x has the property that 


ad/m — d< (B -1) > x < ad/m— d+ ed/m, 
j=l 
then 
a—m<(B-1)¥ox,<a-mt+e. 
j=l 
It therefore follows that 


N,(ad/m; ed/m)"”* < N.,(a; 8). (5.1) 


If d|m and d < m then m/d = 2. Since m has at most m nonnegative divisors it 
follows from (5.1) that 


Y (=) Ny(ad/m; ed/m)| < mN,,(a; 6)” (5.2) 
d|m 
d#m 


In subsequent sections I shall show that the sum LN (a; €)/m and the largest 
terms in this sum both grow at an exponential rate as a — oo. Hence the terms 
represented in (5.2) grow at a smaller exponential rate. In any case (5.1) and (5.2) 
imply 


N(a;¢) = > {m~'N,,(a; @) + C,,(a; e)}, (5.3) 
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where 
|C_,(a@; e)| < N(a; e)'/*, (5.4) 


Recall that N,,(a; «) is the number of periodic binary expansions .x,X,X3... 
with period dividing m such that 


(a —m)/(B -1) < Y x, <(a-m)/(B-1) +e 


(e, = €/(B — 1) < 1 by (4.1)). The sum 27 |x, is a nonnegative integer no larger 
than m. Since the difference between the extreme sides of the double inequality is 
é, <1, there is at most one possible value for /7,x,; furthermore, there is an 
integer between (a — m)/(B — 1) and (a — m)/(B — 1) + «, iff the fractional part 
of (a — m)/(B — 1) 1s at least 1 — e,. In order that this integer lie between zero 
and m it is necessary that 


a/B<ms<at+B-1. 
It now follows from (5.3) and (5.4) that 


N(a; 2) = > (i, )/m + C,(a; ) 


a/B<ms<a+B-1 


1{(((a— m)/(B —-1))) > 1- ex}, 


(5.5) 


where 
km = |[(a—m)(B-1)7"]] +1 (5.6) 

and 

m \" 


ICu(as ell < (x (5.7) 


m 


6. Asymptotic analysis: applying Stirling’s formula. The binomial coefficients in 
the series (5.5) may be handled by Stirling’s formula and a related inequality (cf., 
e.g., Feller [3, section II.9]): 


n! ~ n"e~"(2an)’* asn > 0, 
nre"<ni<(n+1)"e" Wn=1,2,.... 
The double inequality and the elementary relation (1 + m~')” < e imply 
(") < (m4 rye Yn, 20. 
where H is the Shannon entropy function 
H(p) = —plogp—(1—-p)log(1—p), O<p<1, 
H(0) = H(1) = 0; 


Stirling’s formula implies that if n > oo, k > oo, and n — k > o, then 


(j,) ~ ere“ ((2mm(k/n)(1 — k/n))) 7, 
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m 


Consider now the quantities (." occurring in the sum (5.5). By the preceding 
remarks, 


(i, < (m+ l)exp{mH(k,,/m) +1} Wm, (6.1) 
and 
(x. ~ exp{ mH(k,,/m)}(2am—!k,,(m — km)) (6.2) 
provided both k,, and m — k,, are large. 
Define 


J(p) =pH((l-p)p-(B-1)"), Bspst, 
Jp) =0, p(B], 
O(p)=H'((1-p)p-(B-1)°), Bl s<psl. 
It follows from (5.6) that 
k,,/m = (am~!—1)(B-—1) + m1 — (((a —m)(B - 1)"*))}. 
Consequently, since H( p) is continuous for p € [0, 1], 
mH(k,,/m) ~ aJ(m/a) (6.3) 


as a — oo, uniformly for aB~'< m<a+ B-—1; moreover, since H is C@ on 
(0,1), if m/a is restricted to a compact subset of (8~*,1), then 


mH(k,,/m) = aJ(m/a) 
+Q(m/a){1 — (((am- = 1)(B-1)"'))\ (6.4) 
+ o0(1), 


where o(1) converges uniformly to zero as a —> oo. It now follows from (6.1) and 
(6.3) that 


4 < me("/%e4) WaB-l|<m<at+B-1 (6.5) 


where o(a) is independent of m and lim 
and (6.4) that 


(ic. ) ~ ev%n/*exp{ O(m/a){1 — (((a — m)(B - 1)~*))} 


x (2amr(m/a)(1 — r(m/a)))'””, (6.6) 


o(a)/a = 0; and it follows from (6.2) 


a >c 


where 


r(p)=(1-p)p (B-1)°, pe(s-},1), (6.7) 


as a — 00, uniformly for m/a in any compact subset of (B7', 1). 

It is apparent from (6.5) and (6.6) that the exponential growth rates of the 
binominal coefficients in the sum (5.5) are governed by the function J(p), p € 
[B~',1]. Now J(B~') = JQ) =0, since H(0) = H(1), and J(p)>0 for pe 
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(B~',1), since H(p)> 0 for p € (0,1). Since J is continuous, it follows that J 
attains a maximum value in (B~',1). Routine calculations give 


J"(p) = H'(r(p))/ p(B - 1)’), (6.8) 
H"(r) = -(r(l-r))™, (6.9) 


hence J is strictly concave on [8~+,1], and achieves its maximum value uniquely at 
some pz € (8 ',1). Thus 


J( pg) = max J(p) > 0, (6.10) 
B-l<psl 
J"( pg) <0. (6.11) 


Together with (5.5), (6.5), and (6.6), this suggests that J( pg) is the exponential 
growth rate of N(a; e). I shall prove this, and more, in section 8. 


Note. It may be shown that the exponential growth rate J( pg) is the “ topological 
entropy” of the Bernoulli flow, the topological entropy being the maximum mea- 
sure-theoretic entropy achieved by an invariant measure. The invariant probability 
measure #1, which maximizes entropy may be described as follows. Let X = 
_X,X,X,..., where X,, X2,... are iid. Bernoulli (r(pg)), and let Y be uniformly 
distributed on [0, fg(X)], where fp(x) = 1{x < 1/2} + Bl{x > 1/2}; then pe is 
the distribution of the point (X, Y). (For general results subsuming these, cf. Lalley 
[7].) 


7. Asymptotic analysis: rapidly oscillating terms. Asymptotic analysis of the series 
(5.5) is complicated by the presence of factors depending on the quantity 
({(a — m)(B — 1)~*)), which cause rapid oscillations in the terms of the series. 
There is, however, a high degree of regularity in these oscillations, due to the fact 
that 1/( 8 — 1) is irrational. 

Call an increasing sequence {n,} of integers k-syndetic if n,;,, — n, < k for all j 
(the terminology is adapted from Furstenberg [5, section 1.4]). For real numbers y, 
56#0, 0<e <1, let {n,(y, 6, e,)} be the increasing sequence of nonnegative 
integers n satisfying 


((y + n8)) > 1— ey. 


LEMMA 1. For any 0 < & <1 and any irrational 6 there exists an integer 
k = k(6, €4) > 0 such that for any y © &, the sequence {n,(y, 6, &)} is k-syndetic. 


Proof. Since the unit interval [0, 1] is compact the sequence ((nd)), n = 1,2,..., 
has a convergent subsequence. Hence, there exist integers n’ <n” such that 
|(cn’d)) - ((n"8))| < &/2. Let n=n" —n’; then either ((nd)) < &/2 or 
((nd)) > 1 — &/2, and ((nd)) # 0 or 1, because 6 is irrational. In either case, 
there exists a finite integer m such that every x © [0,1] is within «, /2 of some point 
in the finite sequence ((nd)), ((2nd)),...,((mnd)). Consequently, there exist 
nonnegative integers k,,k, such that ((k,6)) <e,/2 and ((k,6)) > 1 — «,/2. 
Let k = max(k,, k,); then for any y © & the sequence {n,(y, 6, e)} is k-syndetic, 
because if ((x)) > 1 — e,, then ((x + k,6)) > 1 — ey or ((x + k,8)) > 1 — by. 
© 
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LemMa 2. Ler f: [0,1] ~ # be continuous. Then for any irrational 6 © #, any 
0<e, <1, and any y€& &, 


im m=! YS f((¢y + mdyy)1 (Cdr + 08)) > 1 = en} 


= ff) dt. (7.1) 


Furthermore, this convergence holds uniformly in y for each fixed f € C{0,1], 0 < e, 
< 1, and irrational 6. 


Proof. Weyl’s equidistribution theorem (cf. Feller [4, section VII, 4] or Fursten- 
berg [5, section 3.3]) states that for any continuous g: [0,1] > # such that 


g(0) = g(1), 
lim m-™ YY g(((n8))) = [g(1) 
mr n=1 


For any y € & the function g,(t) = g({(y + ¢))) is a continuous function on (0, 1] 
satisfying g,(0) = g,(1) and S58, = {3g, hence by Weyl’s theorem, 


Jim m™ aly + nB))) = f'g(«) de (7.2) 


It is easily verified, using the uniform continuity of g, that this convergence holds 
uniformly in y. 

The function f(f)1{¢ => 1— «} may be approximated from above and from 
below by continuous functions g(?f) satisfying g(0) = g(1). Thus (7.1) follows from 
(7.2) and the uniformity in y of (7.1) follows from the uniformity in y of (7.2). © 


8. Asymptotic analysis: reduction to a Gaussian integral. Using the results of 
sections 6 and 7, I shall evaluate the sum (5.5) asymptotically as a > oo. There will 
be three main steps in the analysis. First, I will show that only those terms for which 
m/a * pp (cf. (6.10)) contribute appreciably to the sum. Second, I will use Lemma 2 
to dispose of the rapidly oscillating factors in the terms of the sum. Finally, I will 
show that the modified sum over the region m/a = pg, with the rapid oscillations 
removed, may be approximated by a Gaussian integral. 


Step 1. Inequalities (5.7) and (6.5) give upper bounds on the terms of the sum 
(5.5): to wit, 


(ic, }/m + C,,(4; ed) {((a —m)(B - 1)~*)) >1- ex} < et)(m/a)+0(a) 
(8.1) 


for all m satisfying aB~!<m<a+ B-—1. Recall that J(p) achieves its maxi- 
mum value uniquely at p = pg; since J(p) is continuous it follows that for every 
6 > 0, max, ,_ pal= sJ(p) < J( pg). I will argue that there is at least one nonzero term 
in the sum (5.5) whose logarithm is approximately aJ( pg); hence, only those terms 
in (5.5) such that |m/a — pg| < 6 contribute appreciably to the sum. 

By Lemma 1 there is an integer k such that for all a > 0 the sequence of integers 
m satisfying ({(a — m)(B — 1)~*)) => 1 — e, is k-syndetic. Hence, there exists a 
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nonzero term in the sum (5.5) such that |m/a — pg| < k/a. But (5.7) and (6.6) 
imply that this term satisfies 


(i) /m + C_(a; °)] ~ ett (Pp) tla) (8.2) 


Now there are at most 0(@) nonzero terms in the sum (5.5), and they all satisfy 
(8.1). Moreover, there is at least one nonzero term satisfying (8.2). Since J( p) is 
bounded away from J( pg) for |p — pg| = 4, it follows that for every 6 > 0, as 
a— 00, 


N(a; &) ~ » (i, | m+ Cua; €)} 


m: |ma~! — pg\<6 


x1{(((a — m)(B-1)7*)) = 1- en. (8.3) 
Notice that (8.1) and (8.2) also yield 
lim a7 log N(a; e,) = J( pg). 


Step 2. Next, consider those terms in the sum (8.3) for which |ma~' 


za_'/*, where z > 0 is a large constant. By (5.7) and (6.6), 
(ic) /m + C,(a; a) (((a —m)(B - 1)71)) >1- es | 
~ "exp! O(m/a){1 — (((a - m)(B - 1)~*))) 


(2ma3r(m/a)(1 — r(m/a)))*(m/a) 
x1{(((a— m)(B - 1)~*)) >1 — en} (8.4) 


1 


— Pp| S 


uniformly for m in the range |ma~* — pp| < 6. The expression on the right has a 
rapidly oscillating factor and a slowly oscillating factor. I will argue that the rapidly 
oscillating factor may be replaced by an average. 

Let m,,m, be any two integers in the range |ma~* — pg| < z/a’'/? such that 
|m, — m,| < log a. Since J( pg) = 0, Taylor’s theorem implies 


jaJ(m,/a) — aJ(m,/a)| < C(log a)/a’”* 


for a suitable constant C (which may depend on z but not on a). Also, since r(p) is 
continuous, r(m/a) ~ r( pg) for |ma~* — ppg| < za~’/*. Therefore, the factor 


e(™/2) (2 q7a3r(m/a)(1 — r(m/a)))"(m/a) 


is nearly constant as m ranges over any interval of length log a contained in 
|ma* — Ppl S za */?, 

The function Q(p) is continuous, hence Q(m/a) ~ Q( pg) uniformly for 
|ma~* — pa| < za~’/*. Since e2'?s)" is a continuous function of ¢, it follows from 
Lemma 2 that the sum of 


exp{ O(m/a){1 — (((a — m)(B - 1)~'))} 
1{({(a— m)(B-1)"')) = 1- ey} 
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over any interval of length [[log a]] contained in |ma~* — p,| < za~'/? is asymp- 
totic to 


(log a) [ e 2 Ppa —*) at 
1—e, 


as a — oo. Furthermore, this holds uniformly over all intervals of length [[log a]] 
contained in |ma~' — p,| < za~'’’, by the uniformity in Lemma 2. 

Since the sum over the range |ma~' — pg| < za~'”* may be broken up into sums 
over intervals of length [[log @]], the arguments of the preceding paragraphs show 
that as a > ©, 


» ((e)/™ + C,(a; a)i{(((a —m)(B - 1)7?)) >1- ex) 


m:|ma™} — Pp\|S za 


~ x e"/2)(2na3r(m/a)(1 — r(m/a))) /°(m/a)*”? 
m: |ma~1—pp|<za~\/? 
x “e0(r0™ dr| 
J 
aJ(m/a)(9 a 43 1- yy" 3/2 
~ » e ( 7a r( pg)( r( pg) Pp 


m: |ma™} — pg\<za7'/? 


x f *eQ(pp)t dt}. (8.5) 
0 


Step 3. Recall that J(p) attains its maximum value at p = pg, and that J’( pg) 
< 0 (cf. (6.9)). Consequently, for m in the range |ma~ — p,| < za~'”?, 


aJ(m/a) — aJ( pg) = a(ma~? — pp) J" pg) /2 + o(1), 


where o0(1) > 0 as a > 00 uniformly for m in the range |ma~' — pg| < za~'/’. It 
follows that 


(22a?) 7? e2d(m/a) 


m:|ma} —pgiszao/? 


~ qa ie4I (Ps) y (27a) ~'”*exp{ a( man? — Pp) J”( Pp)/2\ 


m:|ma} —pg\<za7\/? 


as a ~— oo. Observe that the sum on the right-hand side of this expression is a 
Riemann sum for [7 ,(27)~'exp{ yJ’"(pg)/2} dy. Therefore, as a > ©, 


> (2ma3) 1/7 et(m/a) ~~ am tea(Po) f° (27) 7°77 eI" (pp)/2 dy. 
—Z 


(8.6) 


Notice that as z — oo the integral in this expression converges to (—J’( pg))~ 17”. 

A similar argument may be used to bound the sum of those terms in (8.3) in the 
region za~!/* < |ma™! — Pg| < 5. By (8.4), there is a constant C < oo such that 
each term is dominated by Ca~?/*e4/("/%, If 6 > 0 is sufficiently small, Taylor’s 


m: |ma~! — Pg\|SZza- 
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theorem implies that 


aJ(m/a) — aJ( pg) < a(maq! — pg) J”( pg) /4 


for m satisfying |ma~* — pg| < 6. Hence 


E (ie, ) m+ Galas e))i{(((a = m)(B = 1)"")) 1] 


m: za~'/? <|ma~! —pg|<68 


< Ca?/2e4K(o9 y exp{ a(ma~" — py) J”( py)/4} 


m: za~/?2<|ma7! — Pp\<6 


< Cla ler) f eI" (Pe)/4 dy (8.7) 
\y|>2 


for a suitable constant C’ < oo not depending on a or z. Notice that as z > oo the 
integral in this expression converges to 0. 
Combining (8.3), (8.5), (8.6), and (8.7) and letting z — oo gives 


N(a; e) ~ a7'e"(—J"( pg)) (rpg (1 = (pp) Bp 7 


ex 8.8 
i exp{ O( pg)t} at ee) 


as a ~~ ©. 


9. The “prime number theorem.” Define 
M(a) = #{ periodic orbits with period less than a}. 
Then 
M(a)= ), N(a— ke; €) (9.1) 
k>1 
for any e > 0. I shall use the relation (8.8) to obtain an asymptotic formula for 
M(a)asa> o. 
It is evident from (8.8) that the largest terms in (9.1) are those for which a — ke 
is large. For these terms the approximation (8.8) is good. For the smaller terms (8.8) 
may not provide such a good approximation, but these terms do not have much of 
an effect on the first-order asymptotics of the sum, so there is no harm in using (8.8) 
to approximate the smaller terms in (9.1) as well. Thus, 
M(a) ~ C.Y, (a— ke) e642) (20) (9.2) 
k>1 
as a — oo, where 


C= “e2(r0)' dt{ —J"( pg) r( pg )(1 — r(pg))} pg 2”. 


Keep in mind that (9.2) is valid for all e > 0 sufficiently small. 
The sum in (9.2) is dominated by those terms for which (a — ke)/a = 1, because 
of the exponential. Consequently, as a — 00 


Y (a — ke) bet") (P=) ~ YF qu lela ke) (pp) 
k>1 k>1 


_ aT leat(Pe){ eI Pa) /(] _ e7 T(Pp))} 
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hence, 
M(a) ~ ate PC { e7 7 (PH) /(1 — e 7(P8)) | (9.3) 


for all e > 0. Recall that e, = e(B — 1)~', so as € 10, fete?) dt ~ eB —1)71. 
Letting e | 0 in (9.3) therefore gives 


M(a) ~ a~'e*P=){ —J"( pp)r(pg)(1 — r(pp))} pp ?2(B — 1)" (pg) 


= (aJ(pp)) "ev" (9.4) 


(cf. (6.8)-(6.9)). This is the “Prime Number Theorem” for the periodic orbits of the 
Bernoulli flow. 


10. Distribution of individual periodic orbits. The preceding analysis leads to 
some interesting conclusions about the comportment of “typical” periodic orbits. 
As we have seen, the sum (5.5) is asymptotically dominated by those terms for 
which m/a ~ pp and k,,/m ~ r( pg). The index m represents the number of times 
the periodic orbit crosses the x-axis; the index k,, represents the number of such 
crossings which occur between 1/2 and 1 on the x-axis. Hence, “nearly all” periodic 
orbits with period between a and a + e cross the x-axis about ap, times, and the 
fraction of such crossings which occur between 1/2 and 1 is approximately r( pg), 
provided a is large. 

The indices m and k,, are also related to the binary expansions of those points 
on the x-axis where crossings occur. In particular, m is the period of the binary 
expansion, while k,, is the number of ones in a single period. Now for most periodic 
orbits with period between a and a+, where a is large, it is the case that 
K,,/™ ~ (pg). Hence, nearly all periodic orbits cross the x-axis at points whose 
binary expansions are periodic and have the property that the fraction of ones is 
about r( pg). 

An even stronger statement may be made. Suppose m, k,, are integers satisfying 
(5.6). Every m-periodic binary expansion with a cycle consisting of k,, ones and 
m — k,, zeros specifies a periodic orbit of the flow with period between a and a + € 
(except those expansions having period d|m, d # m; however, by (5.2), this compli- 
cation may be ignored). The collection of such expansions is in 1-1 correspondence 
with the set of (ordered) random samples (without replacement) of size m from an 
urn with k,, ones and m — k,, zeros. It is not difficult to show, using the weak law 
of large numbers and some elementary properties of sampling without replacement, 
that if m is large and k,,,/m ~ r( pg), then “nearly all” such random samples have 
the following property: the fraction of times the pattern (e,, e5,..., €,) of zeros and 
ones occurs in the ordered random sample is approximately r( pg)’ — r( Pg)”, 
where‘v is the number of ones in (é), @,..7, @,). (This holds for any fixed pattern 
(€,, €),---, €,).) Therefore, “nearly all” periodic orbits of the flow with period 
between a and a + ¢, where a is large have the following property: of those points 
at which the periodic orbit intersects the x-axis, the fraction whose binary expansion 
begins with the pattern .e,e,...e, is approximately r( pg)’(1 — r(pg))*’. 

Now if Xj, X2,... is a sequence of independent, Bernoulli-(r(p,)) random 
variables, then the probability that X,=e, for each i1=1,2,...,A is also 
r( pg)’ — r¢ Pg)”. Hence, for “nearly all” periodic orbits of the flow with 
period between a and a + e, the crossing points of the x-axis have nearly the same 
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distribution as X = .X,X,X,... . Since each orbit consists of vertical line segments 
above the crossing points, this specifies (approximately) the distribution of the 
entire orbit. Recall from section 6 that the maximum entropy distribution for the 
flow is the distribution of (X, Y), where X is as described above and Y is uniformly 
distributed on (0, f,(X)), with fp(x) = 1{x < 1/2} + Bl{x > 1/2}. Therefore, if 
a is large, then “nearly all” periodic orbits with period between a and a+ are 
approximately distributed according to the maximal entropy measure for the flow. 
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Folding Polynomials and Their Dynamics 


WM DOUGLAS WITHERS, U.S. Naval Academy 


WM DOUGLAS WITHERS: I received my Ph.D. in 1983 from Georgia Tech, 
working under Les Karlovitz. I spent a year at the University of Maryland as 
a visitor before coming to the Naval Academy in 1984. Most of my research 
has been in the somewhat related fields of dynamical systems and analysis of 
fractals. In the past year I have become interested also in the theory of 
orthogonal polynomials in several variables. One might describe this as 
entirely disjoint from my other interests, except that this article serves as a 
counterexample. 


Abstract. The equilateral triangle, the right isoceles triangle, and the 30-60-90 triangle are special in 
that they can be folded into replicas of themselves. We describe polynomial mappings which are 
equivalent as dynamical systems to the mappings which stretch and fold these triangles onto themselves. 
The analogous polynomials in one dimension are the Chebyshev polynomials, whose special properties as 
dynamical systems are shared by our polynomials. Our construction can be carried out in any number of 
dimensions. 


1. Introduction. The mapping 
f(x) =2x?-1 (1.1) 


on the interval [— 1,1] is one of the most basic examples in the theory of dynamical 
systems. The dynamical systems view of a mapping such as (1.1) is to consider the 
points in [—1,1] as possible states of a physical system and the mapping x > f(x) 
as describing the evolution of the system over one unit of time. It is in this sense 
that we refer to a mapping such as f from a set into itself as a dynamical system. 
The long-term behavior of the physical system is described by the behavior of the 
iterates X9, x; = f(Xo), X. = f(X1),---, Xn41 = S(X,),--- aS n goes to infinity. We 
call the set of these iterates the orbit of xo. 

Typical questions to be asked about a dynamical system such as f are: 

Does f have periodic orbits of a given period p? A periodic orbit of period p is 
a set of p points {x,,...,x,} such that f(x,) =x,4, for n=1,..., p—1 and 
f(x,) = x,. In particular, a fixed point for f is a periodic orbit of period 1. 

Is there a measure p» which gives long-term average values for continuous 
functions g: 

1 4N 
Jim + dL alf (xo)) = fgdu? (1.2) 
Such’a measure must be invariant under /; that is, wE = pf ‘(£) for measurable 
sets E, and often w is largely independent of x. 

Do iterates of initial conditions close together separate over time and if so, how 
rapidly? Relevant to this question is the determination of Lyapunov exponents, 
logarithms of average stretching factors. For example, a one-dimensional mapping f 
such as (1.1) has a single Lyapunov exponent A given by 


A= lim log|D(f*)(x))'/*, 
where D represents the derivative. Often A is largely independent of x. 
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We shall use the term dynamical properties loosely to refer to properties such as 
the existence of periodic orbits, invariant measures, and Lyapunov exponents. 
By an attractor for f we mean a compact set A such that the set 


B={x: lim inf If") — yl = OF 


has positive Lebesgue measure, and which is minimal with respect to this property. 
The set B is called the basin of attraction for A. One can show that A and B must 
be invariant under f: f(A) =A, f(B)=B. If a measure p giving long-term 
averages as in (1.2) is independent of x, for x, in some set FE of positive Lebesgue 
measure, then its support A is an attractor for the system and E is contained in the 
basin of attraction for A. 

An attractor may be of classical type, a periodic orbit or (in higher-dimensional 
spaces) a torus. For the mapping (1.1), however, it can be shown that the iterates of 
almost any initial condition in [—1,1] are dense in [—1,1]; this interval is thus an 
attractor for f, a strange attractor, as attractors not of classical type are called. 

The system (1.1) is, however, not at all typical of systems with strange attractors 
because its dynamical properties can be determined in great detail. Although for 
most systems with strange attractors, the problems of counting periodic orbits, 
representing invariant measures, calculating Lyapunov exponents, and so forth, 
seem quite intractable, for (1.1) all these questions can be answered with the aid of 
the coordinate change x = h(u) = cosau. When we make this change of coordi- 
nates, the system is transformed to the “tent map”: 


2u ifu< 
t(u) =h-'e foh(u) = 
21-—u) ifu 


IV 
Nl rR NO] Re 


on [0,1]. Since this mapping is piecewise linear with a constant stretching factor 2, 
we can see at once that its Lyapunov exponent is log2. We can determine other 
dynamical properties with the aid of a symbolic argument. As applied to the tent 
map, the symbolic argument consists mainly of writing u as a binary expansion and 
noting that the map ¢ merely shifts and (sometimes) complements the binary 
expansion of u. We present a more detailed example of such an argument in §5 of 
this article. Almost every point in [0,1] has a binary expansion which is statistically 
random, that is, in which all sequences of digits of a certain finite length occur with 
equal frequency. It follows that the iterates t”(u) of almost every initial condition in 
[0,1] are uniformly distributed in [0,1] as n goes to infinity; thus the invariant 
measure which gives long-term average values ts just Lebesgue measure on [0, 1]. We 
can also count periodic orbits of ¢ by observing that the points of these orbits are 
just those points with periodic binary expansions. 

Most dynamical properties are easily transformed under a differentiable change 
of coordinates; thus we can determine the properties of our quadratic mapping f. 
The Lyapunov exponent is invariant under such a coordinate change, so the 
Lyapunov exponent of / is log2. The invariant measure p giving long-term average 
values for f is the image of Lebesgue measure under the cosine function; the 
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measure of a set E is therefore given by 
dx 
E> [ 
E my(1 — x*) 


Periodic orbits of ¢ map directly to periodic orbits of f under the cosine function. 

If we seek additional examples of dynamical systems with strange attractors 
whose dynamical properties can nonetheless be completely determined, we observe 
that the Chebyshev polynomials, by virtue of the definition 


T, (cos tu) = cosniu (1.3) 


all share the properties of (1.1); under the transformation x = cosvu, the nth 
Chebyshev polynomial 7,, becomes a piecewise linear function which stretches the 
interval [0,1] by a factor of n and folds it back n times onto itself. Examples of 
such completely solvable systems beyond this seem to be scarce. 

The purpose of this article is to show that further examples do exist and have 
many interesting properties. For example, the mapping 


A,(x) =x?- 22 


in the complex plane has as an attractor a deltoid with vertices at the points 3 and 
—3/2 + 373 /2 with its interior, as shown in Figure 1. 


Fic. 1. 


The dotted circle with its interior is mapped under A, to the entire attractor, as 
are each of the corner regions. This system can be analysed with the aid of a certain 
coordinate change (detailed in §3) which transforms the deltoid into an equilateral 
triangle and its inscribed circle into an inscribed triangle, as in Figure 2. Under this 
change of coordinates, the mapping A, from the deltoid to itself becomes a 
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FIG. 2. 


mapping F, from the triangle to itself, and F, is nothing more than folding the 
triangle on the dotted lines and stretching by a factor of two to cover the original 
triangle (the vertex at the right is mapped to itself). The mapping A, is one of a 
sequence of polynomial mappings which are equivalent to folding the triangle into 
9,16, 25,... copies of itself. 

By a foldable shape we mean a polytope which can be folded into replicas of 
itself. Corresponding to any foldable shape in k dimensions there is a sequence of 
polynomials P,, such that the mapping P,, is equivalent to stretching and folding the 
shape onto itself n*-to-one. We call these polynomials folding polynomials. More 
precisely, a foldable shape ® is one which can be partitioned into smaller congruent 
replicas ¢$,,..., ,« of itself which overlap only at their boundaries, and for which 
there exists a mapping F,: ® — ® which is continuous and whose restriction to 9, 
is n times an isometry. In fact if F, exists for any n then it exists for all natural 
numbers n. We can uniquely define this mapping by requiring that it be n times the 
identity in the neighborhood of a particular point of ® (in two dimensions, the 
vertex with the smallest angle). Then there exists a function h defined on ® and a 
polynomial P,, which is conjugate to F, by h; that is, P,: h(®) — h(®) and 


P,=heF oh}, 


This existence theorem is proven in Hoffman and Withers [3]. 

In two dimensions there are just four foldable shapes: the equilateral triangle, the 
isoceles right triangle, the 30-60-90 triangle, and the rectangle, as is shown in [3]. 
Since the rectangle is a product of intervals, it is a trivial matter to define the 
associated polynomials in that case in terms of Chebyshev polynomials. Associated 
with each of the other three shapes is a sequence of folding polynomials, which can 
be transformed by a suitable change of coordinates into stretching and folding of 
the corresponding triangle onto itself. 
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We have not explored applications of folding polynomials in great depth; we 
note, however, that the polynomials we call A, were used by Mittag and Stephen [6] 
in the study of the Potts model of phase transitions. They are also related to 
orthogonal polynomials in several variables studied by Koornwinder [4] and Eier 
and Lidl [2]. 

A complete description of the derivation and theory of folding polynomials is 
given in [3]. In this article we sketch the derivation and properties of the particular 
sequence of polynomials A, associated with the equilateral triangle and discuss 
their properties as dynamical systems, together with other special properties. We 
then present briefly the sequences B, and G, associated with the right isoceles 
triangle and the 30-60-90 triangle, respectively. Proofs are omitted where our 
results can be verified by direct computation. 


2. The (u,v) plane. We start with two lines in the plane R’, denoted /, and 1,, 
which intersect at an angle of 7/3. We give each point in the plane coordinates 
(u, v), where wu is the signed distance of the point from /, and v its signed distance 
from /,. We choose signs so that the region where both u and v are positive has an 
angle of 7/3. The region bounded by the lines v = —2u, u= —2v, andu—v=1 
is an equilateral triangle, which we shall call the fundamental region and denote by 
®; in general, we denote by ®, the triangle bounded by the lines v = —2u, 
u = —2v, and u — v = n. This system of coordinates is shown in Figure 3. 


Fic. 3. 


3. The function / and its properties. Let h: R* > C be defined by 
h(u, v) _— e2miu 4 e2tiv 4 e2mi(utu) 


Let p,: R* > R’ be defined by p,(u, v) = (1 + v, u — 1); then p, is a reflection 
through the line u — v = 1. Let p,: R* > R? be defined by p,(u, v) = (u, —u —v); 
then p, is a reflection through the line u = —2v. Let p,: R? > R* be defined by 
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p,(u, v) = (—v —u, v); then p, is a reflection through the line v = —2u. We then 
have 


hop, =hop, =hop, =h. 


Thus h is invariant under any isometry in the group A generated by p,, p,, and p3. 
It follows that the values taken by h everywhere in the plane are determined by its 
values in ®. 

Since h is continuous, it must map ® and hence the entire plane onto a compact 
subset of C. We denote h(R’) by A. The boundary dA of A is the image under h of 
the set of points where the Jacobian of h is zero; this set consists of the line 
v = —2u and its images under the group A. We can thus find by substitution: 


dA = {h(u, -2u): ue R} = {(2e77 + e 47™: wER}. 


This parametric equation describes a hypocycloid of three cusps, called a deltoid. 
Some properties of this curve are summarized in [5]. The set A consists of the 
deltoid with its interior as in Figure 1. The function h is a bijection between the 
boundary of ® and dA and between ® and A. The vertices of ® are mapped to the 
cusps of A. 

Let us also note how h7! may be calculated. If we let z = h(u, v), note that we 
have 


2miu,2miv 4 e2miv —27i(vtu) 4 eo 2mi(utu)o2aiu 


e e =Z 
and 
erty 2miv,g—27i(v+u) =]. 
Thus the roots of the equation 
si —zs?+Zs-1=0 (3.1) 


are e274 e27!¥ and e 27(4+4) This shows also that h is one-to-one from © to A. 


4. Definition of A,; the recursion formula. Let us define F,, the nth “folding 
function”, from ® to ®, as follows. Define S,: ® > ®, by S,(u, v) = (nu, nv). The 
triangle ®, consists of n* copies V,,...,V,2 of the triangle ®; for each copy ¥, 
there is an element a, of A which maps it onto ®. We define U,;: ®, ~ ® by 
U(u, v) = a,(u,v) for (u,v) € ¥,. Then F. = Ue S, maps © n?-to-one onto 
itself, except for points mapped to the boundary. 

We define the nth folding polynomial A,: A — A by 


A,=heF ch}, (4.1) 
Since hoU, = h, we have the alternative definition 
A,(h(u, v)) = h(nu, nv). (4.2) 


Note the similarity to the definition (1.3) of the Chebyshev polynomials. 
The functions defined in this way are nonanalytic functions of z but are 
polynomials in z and Z. To see this, one may verify 


A,(z) = 3. 
A,(z) =z. 
A,(z) =z? - 22. 
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That A, is indeed a polynomial in z and 2 for all n then follows from the recursion 
formula: 


A,, — ZA,-1 _ ZA,,_7 + A,_3 (n = 3). 


Thus A,(z) can be defined for all z € C. It is a stmple matter to calculate further 
A,: 


A,(z) = 2° — 32z + 3, 
A,(z) = 24 — 4272 + 27° + 4z, 
A,(z) = 2° — 523z + 52z* + 5z? — 5z, 


and so forth. 
From the definition (4.1) we can obtain directly the fact that, for n > 0, A, maps 
A onto itself n?-to-one, as well as the composition property: 


Ann =Am°An = An? Am: 


The composition property is an unusual one; it is known, for example, that the only 
sequences of one-dimensional polynomials with this property are essentially the 
sequence of Chebyshev polynomials 7,(x) and the sequence of power functions x” 


[7]. 


5. Geometric mapping properties of the A,. One property of the sequence of 
power functions z +> z” on C is that they carry certain straight lines into straight 
lines, namely, those lines containing 0. The functions A, have a similar property, 
but for a different family of lines. 

Consider a line in the (u, v) plane given by the equation u = c, c being constant. 
Its image L. under h is a line with parametric equation 


z= e2m7ic 4 e27iv 4 e 2mi(utc) 


_ e2mc 4 eW *mic/2( erm(u+c/2) 4 e 2mi(vtc/2)) | 


The quantity in parentheses can be considered as a real parameter 
t = 2cos2a(v + c/2); 
thus the equation becomes 
z= e2mic 4 en Ty 
The line L.. makes an angle of c/2 with the positive real axis. 

From equation (4.2), the image of L. under the map 4A, is the line L,,.. Thus, for 
the family of lines L,, the functions A, have the same angle multiplication property 
as the power functions z +> z” have for lines passing through the origin. 

Im Figure 4 we have plotted the lines Z. for several values of c. It is seen that 
their envelope is the deltoid dA. 


Directing our attention to the function A,, let us inquire where its Jacobian is 
zero. Since A,(z) = z* — 22, the derivative matrix for A, is 


2x-2 -2y 
2y 2x+2]? 


where z = x + iy. Thus the set where the Jacobian of A, is zero is a unit circle 
centered at the origin; this is the inscribed circle I’ of the deltoid A. If we 
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parametrize this circle by z = e!®, we find that its image under the map A, is the 
curve z = e278 4 e~% whichis OA. 


6. Dynamical properties of the A4,. The fact that the polynomials A, on the 
deltoid A can be transformed by the differentiable coordinate change h to the 
folding functions F, on the triangle ® makes it possible to determine their 
dynamical properties, that is, the behavior of the iterates A,(z), A2(z),... of a 
point z in great detail. One must first determine the dynamical properties of the 
functions F,. Since the function F, is piecewise linear, with a derivative matrix 
which is n times an isometry, this can be accomplished with the aid of a symbolic 
argument, which will be familiar to dynamicists. 

Subdivide ® into equilateral triangles $,,...,,2, each of which is 1/n the scale 
of ® and is mapped onto ® by F.. These triangles overlap only on their boundaries. 
We further divide each ¢, into smaller equilateral triangles $,,,..., 9,2 and each $,, 
into smaller triangles $,,,,...,;,,2, and so on, such that $,..., is 1/n the scale 
of g,..., and F,(%, ...;,.,) =, ...;,,,. Thus diamg, ..., =n ~*"'diam® and a 
triangle $, ...;, overlaps others of the same size only at its boundary. 

Let = be the set of infinite sequences from {1,...,*}. Corresponding to each 
(jo. Ji. +.) © &X 1S a unique point $( jo, j;,-:.) € ® given by 


{$( jo. dis---)} = () Pine jet 
k=0 
Moreover, we have 
Foo = 6°, 
where o: = — > is the left shift operator: 
O( jos fis») = Cis dao). 
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We also have the relationship F,“($( jo; j,,--.)) © ¢;,. Thus a point (u,v) of ® 
which corresponds to more than one sequence in > must lie on the boundary of 
some >, _, and F*"(u, v) lies on the boundary of ®. From this it can be shown 
that the set of points in ® which correspond to more than one sequence has 
Lebesgue area measure zero. We can now establish the following dynamical proper- 
ties of F.. 

(i) Periodic orbits of F, can be counted as follows. It can be shown that each 
point of a periodic orbit of F, of period p corresponds to a unique sequence in %. 
This sequence must therefore be a point of a periodic orbit for o of period p; in 
other words, the sequence must be periodic. We thus have the combinatorial 
problem of counting the periodic sequences with a given period. This is known as 
the circular word problem and its solution is the following. Let II( p, n) denote the 
number of different circular words of p letters on an alphabet of n letters. Then 
(as in Berge [1]) 


(p,m) = —Ynl 2) are (6.1) 


q\P q 


where p. is the Mébius function (u(s) = (—1)/ if s is a product of ; distinct primes, 
p(s) = 0 otherwise) and the symbol | denotes “divides.” 

(ii) The Lyapunov exponents of F, are both equal to logn. (A mapping f: 
R*? > R’ has two Lyapunov exponents A, > A, given by 


A, = lim loglD(f*)(u, II", 


hy = lim — log||D(f*)(u, v) ||“, 
k—©o 


where Df(u, v) denotes the derivative matrix of f at (u, v).) In the case of F, these 
limits take the same value, log n, unless (uw, v) is in the set of measure zero which is 
eventually mapped to the boundary of ®, some power of F,, then being nondifferen- 
tiable at (u, v). 

(iii) Almost all initial conditions in ® have orbits under F,, which are dense in ©, 
and the invariant measure giving long-term average values of continuous functions 
is uniform on the triangle ®. Since the area of the triangle is 1/6, the density is 6. 
To see this, we choose a point (u, v) at random from ®, uniformly distributed, and 
let (jo. Jy»---) be its corresponding sequence in > (unique with probability 1). The 
probability that j, takes a particular value / is given by m@,...;,_,/m9;,...;,. = 
1/n’, where m is Lebesgue area measure. Thus, with probability one, any finite 
sequence (/,,...,/,) appears in the infinite sequence (jo, j),...) with frequency 
mo, ...;,/m® = 1/n**, and the orbit of (u,v) visits @, ...,, with this frequency. 
From this it can be shown that the orbit of (u, v) visits any measurable set E with 
frequency mE/m® = mE/6 with probability one. 

The corresponding dynamical properties of the functions A, can be derived from 
these. Properties (i) and (ii) above are invariant under a differentiable coordinate 
change such as h. Thus we have 

(1) The function A, has II(p, 7) periodic orbits of period p, where II is as in 
equation (6.1). 

(ii) The Lyapunov exponents of A,, are both equal to log n. 
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The invariant measure generated by A, is the image of that for F, under h. Thus 
it is absolutely continuous and has density 


6 
5(z) = Thyeha 


where J(h) denotes the Jacobian determinant of h. Thus 6 satisfies 
6 
d(h = — . 6.2 
(M4) = FG) (6.2) 


Note that J(h) cannot be a function of z = h(u, v) because it is not invariant under 
the reflections p,, p,, and p,; it actually changes sign under each of these 
reflections. However, it is true that (J(h))? is invariant under p,, p,, and p,; and is a 
function of z, indeed a polynomial in z and Zz. It is a result of the theory of folding 
polynomials that for any folding polynomial, the density of the invariant measure is 
the inverse square root of a polynomial. 

Theoretically one could: determine h~! from equation (3.1) and compose it with 
J(h) to determine 6. What we did in fact was to take the equation of the deltoid in 
Cartesian coordinates from [5]: 


(x? + y2) — x(x? — 3y”) + 18(x? + y?) — 27 =0. 
In terms of z = x + iy this becomes 
o(z) = (zz) — 4(z3 + 73) + 1827 — 27 =0. 


Since J(h) is zero on the boundary of ®, the density 6 goes to infinity on the 
boundary of A. It is not unreasonable to suppose that 6 might be somehow 
inversely related to o. With some experimentation we arrived at the following 
formula which satisfies (6.2): 


3 _ 
d(z) = —, (27 — 1827 — (2z)° + 4(23 + 73)| ve 
T 


The polynomials A, are also orthogonal with respect to the density 6; it is proven 
in [3] that: 


[An + iy)-A,(x + iy)8(x + iy) dxdy = 0. 
A 


The sequence A, is not a complete set; one can define, however, a complete set of 
polynomials associated with the A, in a natural way; see again [3]. 


7. Polynomials B, associated with the right isoceles triangle. In the case of the 
right isoceles triangle we will use a standard coordinate system in the (u, v) plane 
with the axes intersecting at right angles. Our fundamental region ®, consists of the 
right isoceles triangle bounded by the lines u = 0, v = 1/2, and u = v. 

The crucial factor in finding a sequence of polynomials conjugate to stretching 
the triangle ®, and folding it onto itself is to find a function h, such that (i) h, is 
invariant under the group of isometries generated by the reflections through the 
edges of ®, and (ii) h,(nu, nv) is a polynomial in h,(u, v). We define h,: R* > R? 
by 


h,(u,v) = (2cos2a7u + 2cos2mv,4cos2a7ucos2mv). 


Let p,(u, v) = (—4, v) be the reflection through the line u = 0. Let p,(u, v) = (u, 
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1 — v) be the reflection through the line v = 1/2. Let p,(u,v) = (v, u) be the 
reflection through the line u = v. Then we have 

hp? P; = hg? py = hg? p3 = hg, 
so h, is invariant under any isometry in the group B generated by p,, p,, and pz, 
and the values taken by 4, everywhere in the plane are determined by its values 
in O,. 

The function h, maps the entire plane onto a compact subset of R*, which we 
denote A,. The boundary of A, is the image under h, of the lines u = 0, v = 1/2, 
and u=v which bound ®,. One can show that these images are the lines 
y+4-2x=0, y+4+2x =0, and the parabola x* = 4y, respectively; these 
form the boundary of A,. Figure 5 shows this region. 


(-— 4,4) (4, 4) 


(0, ~ 4) 


Fic. 5. 


Parallel to the definition (4.2) of the polynomials A, we define B,: A, ~ A, by 
B, (hp (u,v)) = hg(nu, nv). (7.1) 
We then have 
By(x, y) = (4,4) 
B,(x, y) = (x, y) 
B(x, y) = (x? -2y — 4, y? — 2x? + 4y + 4) 
B,(x, y) = (x? — 3xy — 3x, y? — 3x?y + 6y? + 9y). 
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That B, is indeed a polynomial for all n then follows from the recursion formulae: 
Xn+4 = X(Xq43 + Xn41) — (2 + Y)Xn+2 — Xnp (7.2) 


Ynva =V(na3 + Yaar) — (7? — 2y — 2) Ina2 — ne (7.3) 


where we have written (x,, y,) for B,(x, y). It is then a straightforward matter to 
calculate further B,: 


B,(x, y) = (x4 - 4x?y — 4x2 4+ 2y? + 8y + 4, 

yt — 4x7y? + 2x4 + By? — 8x2y + 2y? — 8x? + 16y + 4), 
B(x, y) = (x5 — 5x3y — 5x3 + S5xy? + 15xy + 5x, 

y? — 5x*y3 + 5x4y + 10y4 — 20x2y? 

+ 35y> — 25x*y + 50y? + 25y), 


and so forth. 
From the definition (7.1) we can obtain directly the fact that, for n > 0, B, maps 
A, onto itself n?-to-one, as well as the composition property: 


Bon = B,° 8, = B,° B,- (7.4) 


The right isoceles triangle differs from the equilateral triangle in that besides 
being foldable into n? copies of itself it can be folded into 2 (and therefore 2n7) 
copies of itself. It is natural to ask whether there also exist polynomials conjugate to 
these mappings. Such polynomials do exist; they are given by the definition: 


B,fx(hg(u,v)) = hg(nu + nv, nu — nv). (7.5) 
For example we have 


Byx(x, y) = (y, x7 - 2y — 4). (7.6) 
With definition (7.5), the composition formula (7.4) holds even if m or n or both are 
multiples of ¥2. Thus the recursion formulae (7.2) and (7.3) together with (7.6) 
allow us to calculate all of this “extra”? sequence of polynomials. 

The dynamical properties of the polynomials B, can be obtained in the same way 
as those of the A,; we list them here (n represents an integer or an integral multiple 
of y2): 

(i) The function B, has II( p, n) periodic orbits of period p in Az, where II is as 
in equation (6.1). 

(1) The Lyapunov exponents of B, are both equal to log n. 

(iii) Almost all initial conditions of A, have orbits which are dense in Az, and 
the,invariant measure generated by B,, is absolutely continuous with density 


8(x,y) = =e +442x)(y + 4—2x)(x?- 4y)] 7”. 


The polynomials B, are also orthogonal with respect to this measure. 


8. Polynomials G, associated with the 30-60-90 triangle. For the 30-60-90 
triangle we use the same coordinate system in the (u, v) plane as for the equilateral 
triangle; the axes intersect at an angle of 7/3. Our fundamental region ®, consists 
of the 30-60-90 triangle bounded by the lines u = —2v, u = —v, and u—v=1. 
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We then define h,: R* > R* by 
h,(u, v) = (2cos2mu + 2cos2av + 2cos2a(v + u), 
2cos2a(u — v) + 2cos2a7(u + 2v) + 2cos2m(v + 2u)). 


Let p,(u,v) =(u, —u—v) be the reflection through the line u = —2v. Let 
p,(u, v) = (—v, —u) be the reflection through the line u = —v. Let p,(u,v) = 
(1 + v, u — 1) be the reflection through the line u — v = 1. Then we have 

Ne? py = hg? py = hep; = he, 
so hg is invariant under any isometry in the group G generated by p,, p, and p;, 
and the values taken by h, everywhere in the plane are determined by its values 
in Deo. 

The function h, maps the entire plane onto a compact subset of R*, which we 
denote A,. The boundary of A; is the image under h, of the lines u = 2u, u = v, 
and u + v = 1 which bound ®,. These images are curves described by parametric 
equations 


x = 41? + 4¢ - 2, 
y = 160? — 124 + 2, 
and 


x=2+ 4, 
y = 41? + 4t - 2. 


(The lines u = 2v and u + v = 1 map into the same curve.) These curves form the 
boundary of A,. Figure 6 shows this region. 


(— 3,6) (6, 6) 


(-2, ~2) 


Fic. 6. 
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Parallel to the definition (4.2) of the polynomials A, we define G,: A; > Ag by 
G,(h¢(u, v)) = hg (nu, nv). (8.1) 
We then have 
Go(x, y) = (6,6). 
Gi(x, y) = (x, y). 
Gy(x, y) = (x? -— 2x -— 2y — 6, y? — 2x3 + 6xy + 10y + 18x + 18). 
G3(x, y) = (x? — 3xy — 9x — 6y — 12, y? — 3x3y + Oxy? 
— 6x3 + 18y? + 45xy + 63y + 54x + 60). 
G,(x, y) = (x4 — 4x?y — 10x? — 4xy + 2y? — 8x + 8y + 6, 
yt + 2x® — 4x3y? — 12x4y + 12xy? + 18x7y? 
—28x3y — 36x4 + 24y? + 120xy? 
+108x2y — 40x? + 134y? + 372xy + 162x? + 280y + 360x + 198). 
Gs(x, y) = (x° — S5x3y — 15x? — 5x*y + Sxy? — 10x? 
+ 35xy + 10y* + 55x + 50y + 60, 
py? + 5x®y — 5x3y3 — 30x4y? + 10x® + 15xy4 
+45x*y3 — 65x3y? — 150x4y 
+ 30y* + 240xy? + 360x2y? — 205x3y — 180x4 + 255y? + 1200xy? 
+ 945x?y — 190x3 + 920y? + 2415xy + 810x? + 1495y + 1710x + 900). 
That G, is indeed a polynomial for all n then follows from the recursion formulae: 


Xn+6 — X(Xn45 + Xn+1) 7 (x + y + 3)(Xnv4 + Xn+2) 


+(x?-2y—-4)x,.;-%,, (8.2) 
Ynv6 =V(Mnes t+ Ynvr) — (x3 — 3xy — 9x — Sy — 9)(Yung + In42) 
+(y? — 2x3 + 6xy + 18x + 12y + 8)y,.3—-),, (8.3) 


where we have written (x,, y,) for G,(x, y). It is then a straightforward matter to 
calculate further G.,. 


From the definition (8.1) we can obtain directly the fact that G, maps A, onto 
itself n*-to-one, as well as the composition property: 


Gin = Gn°G, = G,° Gy: (8.4) 


The 30-60-90 triangle is similar to the nght isoceles triangle in being foldable 
not only into n? copies of itself but also into 3 (and therefore 3n”) copies of itself. 
Like the case of the right isoceles triangle, there also exist polynomials conjugate to 
these mappings. They are given by the definition: 


G,/3(hg(u, v)) = hg(nu — nv, nu + 2nv). (8.5) 


For example we have 


Gy3(x, y) = (y, x* — 3xy — 9x — 6y — 12). (8.6) 
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With definition (8.5), the composition formula (8.4) holds even if m or n or both are 
multiples of 73. Thus the recursion formulae (8.2) and (8.3) together with (8.6) 
allow us to calculate all of this “extra” sequence of polynomials. 

The dynamical properties of the polynomials G,, can be obtained in the same way 
as those of the A,; we list them here (n represents an integer or an integral multiple 
of y3): 

(i) The function G,, has II( p, n) periodic orbits of period p in Ag, where II is as 
in equation (6.1). 

(11) The Lyapunov exponents of G,, are both equal to log n. 

(ui) Almost all initial conditions in A, have orbits which are dense in A,, and 
the invariant measure generated by G,, is absolutely continuous with density 


3 _ 
S(x,y) = 5 ca l(4x? — y? — 12xy — 24y — 36x — 36)(4y + 12 - x?)| — 


The polynomials G,, are also orthogonal with respect to this measure. 
The fact that the equilateral triangle can be folded into two 30-60-90 triangles is 
reflected in the polynomials A, and G,. Define p: C > R* by 


p(z) =(z2+ 2, 2z — 3). 
Then for each integer n, we have 
ped, = G,° Pp. 


Acknowledgements. This research was supported by a grant from the Naval Academy Research 
Council. 

I am indebted to Michael Hoffman for his invaluable contribution to the theory of folding 
polynomials. 


REFERENCES 


l. C. Berge, Principles of Combinatorics, Academic Press, New York, 1971. 

2. R. Eier and R. Lidl, A Class of Orthogonal Variables in k Variables, Math. Annalen, 260 (1982) 
93-99. 

3. M. E. Hoffman and W. D. Withers, Generalized Chebyshev Polynomials Associated With Affine 
Weyl Groups (to appear). 

4. T. Koornwinder, Two-Variable Analogues of the Classical Orthogonal Polynomials, in Theory and 
Application of Special Functions, ed. R. Askey, Academic Press, New York, 1975. 

5. J. D. Lawrence, A Catalog of Special Plane Curves, Dover Publications, New York, 1972, pp. 
131-135. 

6. L. Mittag and M. J. Stephen, Yang-Lee Zeroes in the Potts Model, J. Stat. Phys, 35 (1984) 303-320. 

7. T. J. Rivlin, The Chebyshev Polynomials, John Wiley & Sons, New York, 1974. 


Celebrating Mathematics 


LYNN ARTHUR STEEN, St. Olaf College, Northfield, MN 55057 


LYNN STEEN is Past President of the Mathematical Associa- 
tion of America. This paper is the text of his Retiring Presiden- 
tial Address presented January 8, 1988 at the annual meeting 
of the MAA in Atlanta. Steen received his B.A. from Luther 
College in 1961 and his Ph.D. from the Massachusetts Institute 
of Technology in 1965. He has studied at the Mittag-Leffler In- 
stitute and with his St. Olaf colleague J. Arthur Seebach, Jr. 
authored Counterezamples in Topology and served as co-editor 
of Mathematics Magazine. Steen is editor of Mathematics To- 
day, Mathematics Tomorrow, and Calculus for a New Century: 
A Pump, Nota Filter. He is.Chairman of the Conference Board 
of the Mathematical Sciences and Chairman-elect of the Council 
of Scientific Society Presidents. 


This is the year we celebrate American mathematics. 

One hundred years ago, Thomas Scott Fiske founded the New York Math- 
ematical Society, precursor to the American Mathematical Society. Then, 
mathematics flourished in Europe but barely existed in the New World. 
Today, the American Mathematical Society sustains the world’s strongest 
environment for research mathematics. It has given science—and society— 
much to celebrate. 

The Mathematical Association of America is nearing its 75th anniversary 
as an organization devoted to the teaching of mathematics, especially at the 
collegiate level. Ever since 1915, the Society and the Association have coop- 
erated on many jointly sponsored activities. Our missions—mathematical 
research and mathematics teaching—are like braided strands that together 
form a strong fiber for the fabric of American science. 

On behalf of the Mathematical Association of America, I salute our sister 
society for a century of accomplishment, for the creation of a rich tapestry 
of beautiful and useful mathematics. The AMS centenary beckons all of us 
in the mathematical community to look around to our colleagues, to look 
outward to society, and to look ahead to the future of mathematics. We join 
the celebration, to applaud the achievements of American mathematics; to 
proclaim that strong mathematics contributes to strong science, to strong 
defense, and to a strong economy; and to challenge American youth with 
the excitement of mathematical discovery. 

Of course mathematics did not begin with the founding of the Ameri- 
can Mathematical Society. Two hundred years before that—three hundred 
years ago—Newton published Principia Mathematica, thereby establishing 
mathematics as the methodological paradigm of theoretical science. From 
this paradigm have emerged many mathematical sciences, and many math- 
ematics societies. The American Statistical Association celebrates its 150th 
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birthday in 1989; the Society for Industrial and Applied Mathematics cele- 
brated its 35th Anniversary last year, at the same time that the Association 
for Computing Machinery celebrated its 40th birthday. 

So “100 Years of American Mathematics” is a bit of hyperbole, a slogan 
that throws images beyond literal meaning. It is in part a strategy to focus 
attention on the need for sustained support for mathematics. We celebrate 
100 Years of American Mathematics not because we equate Thomas Fiske 
with Isaac Newton, but because it is a timely device to set important issues 
before the mathematical world, the scientific community, and the attentive 
public. Despite a century of success, there are urgent matters that need 
attention. 


Mathematics Today 


Those of us who are part of the mathematical community recognize the 
enormous current vitality of the mathematical sciences. Applications, enthu- 
siasm, initiatives, opportunities, and unity are the “vowels” around which 
mathematical language is formed, a language in which mathematicians ex- 
press solutions to old problems and explore new areas of fruitful growth. 
Explosive growth is a sign of remarkable health, but it does leave the thou- 
sands of us who try to keep up panting breathlessly as the research leaders 
disappear over the horizon. 

Dozens of areas of active research could be cited to document the vitality 
of contemporary mathematics (see [17], [18], [24]). For some of these, look 
at the program of this meeting. Better still, look at the whole program for 
100 Years of American Mathematics, especially at the marvelous symposium 
that is part of the 1988 AAAS meeting in Boston and at the special series 
of expository lectures on contemporary mathematics that is featured at the 
AMS Centennial Meeting in Providence. 

Today I want to highlight four areas as examples of the unity and appli- 
cability of mathematical research: computational statistics, mathematical 
biology, geometrical mathematics, and nonlinear dynamics. Each of these 
offers opportunities for initiatives in mathematical research and for engen- 
dering enthusiasm among students. Think especially about the latter, about 
opportunities for initiatives in mathematics education, while I briefly outline 
each of these four areas. 


Computational Statistics. The statistical sciences study problems as- 
sociated with uncertainty in the collection, analysis, and interpretation of 
data. Not surprisingly, the increasing use of computers to record and trans- 
form data has generated a host of new challenges for the statistical sciences. 

For example, analysis of data from electronic scanning devices (in tomog- 
raphy, in aircraft or satellite reconnaissance, in environmental monitoring) 
has produced an urgent need for statistical analysis of data that has an 
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inherent spatial structure [26]. Research in this emerging field of spatial 
statistics employs a wide variety of mathematical, statistical, and compu- 
tational techniques: problems of separating signals from noise borrow ideas 
from engineering; ill-posed scattering problems employ methods of numer- 
ical linear algebra; and smoothing of data requires statistical techniques of 
regularization. Underlying all this is the inherent geometry of the problem, 
which in many cases is dynamic and non-linear. 

Many applications of statistics (for example, to clinical data from innova- 
tive medical protocols) involve small data sets from which one would like to 
infer meaningful patterns. Bradley Efron and others have pioneered inno- 
vative, computationally-intensive statistical techniques that use the limited 
available data to generate more data with the same statistical characteris- 
tics ([4], [5], [14]). By resampling the given data repeatedly, these so-called 
bootstrap methods generate millions of similar possible data sets which yield 
accurate approximations to various complex statistics. By comparing the 
value of statistics for the given sample with the distribution obtained by 
these resampling schemes, one can determine whether the observed values 
are significant. 


Mathematical Biology. Nothing better illustrates the potential for 
mathematics in the biological sciences than the many traces of mathematics 
behind the Nobel prizes. For example, the 1979 Nobel Prize in medicine was 
awarded to Allan Cormack for his application of the Radon transform to the 
development of tomography and CAT scanners. The 1984 Nobel Prize in 
chemistry was awarded to biophysicist Herbert Hauptman, President of the 
Medical Foundation of Buffalo, for fundamental work in Fourier analysis 
pertaining to X-ray crystallography [19]. 

Indeed, recent research in the mathematical sciences suggests dramati- 
cally increased potential for fundamental advances in the life sciences using 
methods that depend heavily on mathematical and computer models. Struc- 
tural biologists have become genetic engineers, capturing the geometry of 
complex macromolecules in supercomputers and then simulating interaction 
with other molecules in their search for biologically active agents. Using 
these computational methods, biologists can portray on a computer screen 
the geometry of a cold virus—an intricate polyhedral shape of uncommon 
beauty and fascinating geometric features—and search its surface for molec- 
ular footholds on which to secure their biological assault. 

Geneticists are beginning the monumental effort to map the entire hu- 
man genome, an enterprise requiring expertise in statistics, combinatorics, 
artificial intelligence, and data management to organize billions of bits of 
information. Ecologists—the first mathematical biologists—continue to use 
the extensive theories of population dynamics to predict the behavior and in- 
teraction of species ([9], [32]). Neurologists now use the theory of graphs to 
model networks of nerves in the body and the neural tangle in the brain [10]. 
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Cell biologists study the replication of DNA using the newly discovered alge- 
braic classification of knots ([12], [13]). Epidemiologists monitor the spread 
of AIDS with techniques that blend innovative statistics with classical anal- 
ysis. And, finally, physiologists employ contemporary algorithms applied to 
nineteenth-century equations of fluid dynamics to determine such things as 
the effects of turbulence in the blood caused by cholesterol or swollen heart 
valves [11]. 


Geometrical Mathematics. Ever since Euclid, geometry has been one 
of the major pillars of core mathematics. After decades of decline (espe- 
cially in mathematics teaching), the geometrical view in mathematics has 
undergone a renaissance, assisted both by the development of new theoret- 
ical tools and by the power of computer-based visual representation. In a 
very real sense, geometry is once again playing a central role on the stage 
of mathematics, much as it did in the Greek period. 

Geometry claimed two of the three 1986 Fields Medals, which were 
awarded to Michael Freedman and Simon Donaldson for work in the ge- 
ometry of four dimensional manifolds ([{1], [3], [21]). By exploiting proper- 
ties of the Yang-Mills field equations that reflect the wave-particle duality of 
matter, Donaldson showed that the differential geometry of four dimensional 
manifolds was vastly different than that suggested by their topological struc- 
ture. Freedman provided the topological classification. Together, their work 
yielded not only deep understanding of four-dimensional manifolds, but the 
surprising insight that in four dimensions there are differentiable manifolds 
that are topologically but not differentiably equivalent to the standard Eu- 
clidean four dimensional space. Already, insight from this work has led to 
applications in string theory—the new super-symmetric theory of elemen- 
tary particles—thereby providing fresh evidence (see [34]) of what Eugene 
Wigner called the “unreasonable effectiveness” of mathematics in the phys- 
ical sciences. 

Computer graphics provides a powerful new tool that extends geomet- 
rical techniques into many parts of mathematics. Computers—especially 
supercomputers—can calculate and display various mathematical structures 
in visual form, thereby enabling mathematicians to “see” the significance of 
abstract patterns that before could be interpreted only by formal means. 
For-example, visual representations of solutions of differential equations of- 
ten produce conjectures that open up whole new insights into the behavior of 
the system which the equations represent. Geometrical studies themselves 
regularly yield innovative challenges in the design of new algorithms and 
data structures, with spin-off benefits to applications in computer science 
(for example, database systems and word processing) far removed from the 
original geometric problem. The newly launched Geometry Supercomputing 
Project at the University of Minnesota is one example of the growing inter- 
action of researchers in geometry with those in theoretical computer science. 
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Nonlinear Dynamics. Only in recent years have we been able to pro- 
vide mathematical analyses of problems that are essentially nonlinear (for 
example, turbulence in fluids). These have been made possible by novel ana- 
lytical methods, clever numerical simulation, and visual display on computer 
screens. Applications range from airfoil design to plasma physics, from oil 
recovery to studies of combustion ([7], [15], [16], [22]). 

Nonlinear dynamics has yielded many surprises (see [6], [8], [33]), includ- 
ing long-term localized structures (e.g., the Red Spot on Jupiter), determin- 
istic (rather than stochastically) generated chaotic motion (typical of some 
weather phenomena), and fractal patterns at the interface between fluids 
(for example, displacement of oil by water). The mathematics of nonlinear 
dynamics involves a great deal of traditional analysis (especially differential 
equations), reinforced by iterative processes, automata theory, and fractal 
geometry. 

Computer display of nonlinear phenomena makes visible patterns that 
would never have been noticed by analytic means alone. In research on 
dynamical systems, on the transition from order to chaos, and on the emer- 
gence of fractal shapes from smooth flows, computers are to mathematics 
what telescopes and microscopes are to science: they increase by a thousand- 
fold the portfolio of patterns that mathematicians can see and investigate. 

The Newtonian revolution not only established mathematics as a para- 
digm for scientific reasoning, but it also established determinism as a para- 
digm for the behavior of physical systems. Nonlinear dynamics—a direct 
descendant of Newtonian mathematics—shows how ambiguity and uncer- 
tainty can arise in even simple deterministic systems, and how the onset 
of chaos itself can be predictable. In its power to change our Newtonian 
view of mathematics, nonlinear dynamics is as revolutionary as quantum 
mechanics: each breaks the bond of determinism and reveals entirely new 
structures that often defy what we have come to think of as common sense. 


Mathematics in the Classroom 


I choose these examples, drawn from diverse areas of the mathematical 
sciences, not just to illustrate the vitality of the mathematics that we are 
here to celebrate, but to provide a mathematical and intellectual backdrop 
for what is, unfortunately, a very different portrait of mathematics in the 
classroom. 

In research we see a lot of geometry, a lot of data, a lot of science, a lot of 
computation—together with more traditional mathematical tools. We see 
investigation, exploration, and a continual search for pattern. Contemporary 
mathematics compels attention. It has the power to excite the best minds 
of our youth and to stimulate renewed creativity in teaching mathematics. 

But this mathematics is not the mathematics taught in typical school or 
college classrooms. Far too often, mathematics in the classroom is a freeze- 
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dried mathematics—trigid, cold, and unappealing. Instead of exploration 
there is drill; instead of investigation, imitation. From elementary school 
arithmetic to college calculus, mathematics in the classroom is dramatically 
different from mathematics in practice. 

You've all heard the litany of problems with school mathematics. It ranges 
from poor test performance on international assessments to declining inter- 
est among Americans in pursuing advanced study of mathematics. I won’t 
repeat this evidence here since it has been widely publicized in reports, 
journals, and newsletters ([20], [30], [31]). 

Not yet so well known are the current attempts by several organizations 
(for example, NCTM, MSEB, AAAS, and the University of Chicago) to 
reverse this decline ((23], [27], [28]). These projects have engaged school 
teachers, mathematics educators, and mathematics researchers in collabo- 
rative work on the problems of mathematics education in the schools. Al- 
though these projects differ greatly in purpose and detail, their emerging 
recommendations have much in common that resonates with the nature and 
practice of contemporary mathematics: 


¢ Mathematics should be taught in a natural context; 

¢ Students should be encouraged to create, to invent, and to participate; 

¢ Calculators and computers should be used throughout the mathematics 

curriculum; 

New topics (for example, algorithms, data analysis, estimation) should be 

introduced into the mainstream curriculum; 

¢ Facility in computation need not be a prerequisite to the study of math- 
ematics; 

¢ Mathematics should be studied as an integrated whole; 

¢ Mathematics should help build students’ abilities to reason logically; 

¢ Communication is an important goal of mathematics instruction. 


It doesn’t take much imagination for someone who is familiar with exam- 
ples of contemporary mathematical science to see how student involvement 
in such mathematics could contribute to achieving these goals. Just the ex- 
amples I have cited—statistics, biology, geometry, dynamics—overflow with 
natural context and with opportunities for students to use computers to dis- 
cover patterns. These examples reveal far better than the isolated morsels 
of the traditional curriculum that new mathematical methods are needed to 
solve new problems; that communication is important for one to just under- 
stand, let alone express, the subtleties revealed by mathematical analysis; 
and that mathematics in action requires not only calculation and logic, but 
also intuition, imagination, and organization. 

Unfortunately, too few of those who are most knowledgeable about math- 
ematical research are working with teachers to translate their research into 
experiences suitable for classroom exploration. And far too few teachers— 
even at the college and university level—-have the background, the interest, 
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or the time to learn enough about modern mathematical research to trans- 
late it successfully into classroom experiences. So long as a large gap remains 
between those who create mathematics and those who teach mathematics, 
we cannot expect students to see in mathematics the challenge of an exciting 
and intellectually rewarding career. 


Among the anniversaries we celebrate this winter is the centenary of the 
birth of George Pélya, who died just three years ago. Pdélya was one of 
the rare mathematicians who made major contributions both to mathemat- 
ics research and mathematics education. Andrei Kolmogorov, who died in 
October, was another. 


The December 1987 issue of Mathematics Magazine is devoted to Polya’s 
life and work; I urge you to read it—I’m sure you'll find it as fascinating 
as I did. In that issue, Alan Schoenfeld wrote an interesting analysis of 
Pélya’s theory of heuristics and its impact on teaching students to solve 
mathematical problems. Schoenfeld begins with Pdélya’s dictum that a good 
mathematics education is one that provides systematic opportunities for 
students to discover things. 


How often does our teaching really do that? Think about the contrast of 
the stylized two-column proofs of high school geometry with the exploratory 
possibilities of three-dimensional computer graphics, or of the linking of 
knots, or of topological transformations of common surfaces. Or think, as 
many did at the NRC colloquium on Calculus for a New Century, about the 
contrast between the five thousand exercises in typical calculus books that 
mostly ask students to imitate calculator buttons, and the discovery po- 
tential in symbolic computer systems or in visual presentation of nonlinear 
dynamics. 


Pélya’s discovery dictum was echoed (perhaps unconsciously) at the cal- 
culus colloquium by Oberlin College President Frederick Starr [29]. He cited 
research [2] on college student career choices that shows “incontrovertibly” 
that the only institutions that are successfully resisting the precipitous de- 
cline in the percentage of students entering careers in science are those that 
base their pedagogy on a kind of apprenticeship system. In these schools 
students are brought into the laboratory to pursue real science under the 
direct giiidance of professors who are themselves actively engaged in the 
scientific quest. 


Traditionally, it has been the laboratory sciences—notably chemistry— 
that excel at attracting students by a style of education that involves stu- 
dents in the discovery of science. But now mathematics can do the same. 
With frontiers as exciting as chaotic systems and spatial statistics, there is no 
longer any reason for mathematics to fall behind the more glamorous labora- 
tory sciences in attracting the interest and enthusiasm of our brightest youth. 
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Causes for Celebration 


In celebrating mathematics, we point to the immense success of math- 
ematical research in creating an intellectual understanding of space and 
number, of order and chaos, of pattern and disarray. Mathematics itself is 
beautiful, powerful, and deep; the process of doing mathematics is personally 
stimulating and intellectually rewarding. 

Nevertheless, the profession of mathematics—as distinct from the disci- 
pline of mathematics—is not in good health. Decades of neglect in main- 
taining clear communication channels—with education, with science, with 
the public—have left mathematics isolated from the support systems that 
are vital to its health and well-being. 

Our celebration must become a commitment to communicate. As we 
move into the second century of American mathematics—to continue the 
hyperbole—we should build on the impressive accomplishments in mathe- 
matics itself to bring the excitement and power of the mathematical sciences 
to all Americans. Here are five causes to champion as we celebrate mathe- 
matics in 1988. 


» INVOLVE STUDENTS IN THE PRACTICE OF MATHEMATICS 


The evidence is overwhelming that students receive a better education, 
and are more likely to be attracted to mathematics, when they are actively 
involved in mathematical experiences [25]. Pdlya called it discovery learn- 
ing; Starr described it as apprenticeship education. Although only a few 
mathematics students in the United States now receive the benefit of this 
type of learning, there are many excellent models of such teaching: prob- 
lems competitions, research experiences for students, exploratory computer 
graphics, innovative tutorials where students become teachers, and team- 
based internships in mathematical modelling. It is noteworthy that the Na- 
tional Science Foundation, under the research directorates, has once again 
begun to support programs that provide research experiences to undergradu- 
ates. Especially for undergraduate students, but also in appropriate degrees 
for younger students, we must tilt the balance of mathematics education 
towards greater student involvement in learning. 

Doing this will be expensive, and might involve radical departures in the 
way we finance undergraduate and graduate education. Since teachers tend 
to teach as they were taught, the most effective way to promote discovery 
learning in the schools is to enhance apprenticeship learning in the colleges, 
where tomorrow’s teachers are today learning how to teach by the examples 
set by their college professors. The time is ripe for department chairs to 
insist that universities fund teaching at a sufficient level that all undergrad- 
uates can be taught by experienced teachers who will involve students in 
the excitement of discovering mathematics. 

Apprenticeship education will require significantly better integration both 
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of research mathematics and of contemporary applications in the experience 
of students. Current research, new applications, and the emerging goals 
of mathematics education could resonate in ways that would greatly en- 
hance student involvement in mathematical learning. But such resonance 
cannot happen so long as researchers and teachers continue to operate in 
separate spheres—worlds apart in mathematical outlook, experiences, and 
expectations. Resonance requires significant connection between stimulus 
and resonator. “Vertical integration” of mathematical knowledge that links 
research and applications with education, and that brings researchers into 
active contact with students and teachers, should become a major criterion 
in funding decisions that concern the support of research. 

We should celebrate the wealth of interesting new mathematics by bring- 
ing this mathematics into every classroom in the nation. To do that will re- 
quire changes in the way we judge teaching and in the way we judge research: 
each should be found wanting if it does not include appropriate linkage with 
the other. It is not enough that individuals be competent as teachers and 
separately as professionals: separate but equal is as inadequate as a model 
for the relation of teaching and professional activity as it is for racial compo- 
sition of public schools. Our goal should be professional standards that insist 
on apprenticeship learning and vertical integration of mathematical research. 


»s EDUCATE THE ATTENTIVE PUBLIC 


Far too many educated persons are ignorant of mathematics. More trou- 
blesome, most do not feel their mathematical ignorance to be a great hand- 
icap. Most successful lawyers, politicians, educators, business executives— 
and university administrators—have achieved positions of prominence with 
only a minimal (and frequently archaic) knowledge of mathematics. More- 
over, many persons, whether well educated or not, harbor feelings of appre- 
hension or even anxiety about mathematics due to an unpleasant early ed- 
ucational experience, often with something labelled the “new math.” When 
we try to take the case for mathematics to the public—or even just to univer- 
sity administrators—we face not only the healthy skepticism that naturally 
greets any self-serving argument, but also ignorance, fear, and often hostility 
that is a legacy of our neglect of mathematics education. 

The state of mathematics as a profession compels us to find ways to di- 
minish, public fear and ignorance of mathematics, for without broad public 
support—for teaching, for research, for encouragement of students—there 
is no possible way for the mathematical community on its own to sustain 
the momentum of the past half-century. Now, however, perhaps for the first 
time, the breadth of the mathematical landscape makes it possible at least 
to imagine overcoming this pervasive public apprehension of mathematics. 

Mathematics now touches people’s lives in ways that matter and that 
can be described and revealed in human terms. From symmetry and chaos 


1988] CELEBRATING MATHEMATICS 423 


to computers and cosmology, from AIDS epidemiology and nuclear risks to 
political polls and ozone depletion, mathematics lurks behind most mani- 
festations of science and technology. The same mathematization of society 
that makes the task of public understanding so essential also provides the 
means by which the task can be started—by building on mathematical ideas 
that are part of daily experience. 

Surely part of our celebration must be to tell the story of mathematics to 
the attentive public. We are not publicists, but we are teachers. Suppose 
each department of mathematics made a commitment, just once each year, 
to arrange a public event that made mathematics visible in their community: 
an outside speaker who is working on something in which the public might 
be interested; a student project that involved a practical problem of interest 
to the community; a forum on the changing nature of school mathematics; or 
an exposition of a slice of mathematics related to some professor’s research. 
Since the public is always more interested in people than in abstractions, 
there are, in addition, many good opportunities for news stories in home- 
town papers about the accomplishments of students. Someday some math- 
ematics department should try to put out as many publicity releases on the 
accomplishments of their students as the athletic department does of theirs. 


» EXPLORE FUNDAMENTAL ISSUES IN MATHEMATICS EDUCATION 


Computers influence mathematicians not only by providing new tools for 
research and teaching, but also by posing deep questions about central is- 
sues in our discipline. Now that calculators can manipulate symbols and 
calculate answers, 


¢ What—if not arithmetic—should be the core of elementary school math- 
ematics? 

¢ What—if not manipulation—should be the core of high school algebra? 

¢ What—if not calculation—should be the core of calculus? 

¢ What—if not calculus—should be the core of college mathematics? 


At the same time that computers force attention on issues that are deeply 
rooted in unexamined tradition, mathematical research has transformed the 
nature of mathematics, opening up new options for what might be consid- 
ered central and what derivative among the concepts of mathematics. 

We need to find new threads of continuity with which to weave a mathe- 
matics curriculum for the twenty-first century. Finding appropriate central 
themes poses an immense challenge for the best minds among us, researchers 
and teachers alike. It gives common purpose to our diverse expertise, and 
sets a common agenda for those in research, those in college teaching, and 
those in school mathematics. 

This provides yet another occasion for celebration: the opportunity, joined 
with the need, to transform school mathematics in ways that reflect the rich- 
ness and diversity of mathematical research and mathematical applications. 
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Vertical integration of research, applications, and teaching will help bring 
about this transformation. But we need, in addition, structured opportuni- 
ties for reflection in which the most synoptic thinkers among us bring their 
experiences in research, in applications, and in teaching to bear on the task 
of articulating central themes for mathematics education as we move into 
the next century. 


» ENSURE FOR ALL STUDENTS 
EQUAL OPPORTUNITIES FOR MATHEMATICAL SUCCESS 


Despite the culturally neutral status of mathematics (as compared, say, to 
biology), the last decade has produced distressing evidence of class and eth- 
nic distinctions arising as a result of the way mathematics education is prac- 
ticed in the United States. One-third of U.S. students—Blacks, Hispanics, 
and Native Americans—provides fewer than 10% of mathematics graduates, 
despite the evidence from isolated model programs that excellent retention 
and success rates can be achieved within a suitable educational context. An- 
other third of U.S. students—white females—drops out of advanced degree 
programs in the mathematical sciences at twice the rate of male students. 

Compounding these problems of class distinctions are the political, ed- 
ucational, and social side effects of large numbers of foreign-born teaching 
assistants in our major universities. It is easy to make a strong case for 
having many foreign graduate students in our universities; I join the many 
scientific leaders who defend this practice which has led to the United States 
being, in the words of Robert White, the “schoolhouse of the world.” It is 
harder to make a case for placing inexperienced foreign graduate students 
in the classroom as instructors for American students who are not prepared 
to cope simultaneously with the challenge of a foreign culture, a foreign 
language, and a foreign discipline—namely, mathematics. 

The consequence of these two unrelated trends is that both majority and 
minority students in college classes often receive mathematics instruction in 
a context that is culturally alien to them. Students who have too little in 
common with their teachers are unable to see themselves as future mathe- 
maticians or mathematics teachers. In this context, it is not surprising that 
even white U.S. males are no longer choosing careers in mathematical sci- 
ences. As I am sure you are well aware, the number of U.S. males receiving 
Ph.D. degrees each year in mathematics is less than 40% of what it was 
fifteen years ago. 

The problem of opportunity in mathematics is so serious and so difficult 
that it is hard to even imagine a solution that is feasible, much less opti- 
mal. However, the mathematical community has an enormous resource that 
can be brought to bear on this problem—namely, the strong and cultur- 
ally diverse community of research mathematicians that proves by its very 
existence the universality of mathematics. We need to find effective ways 
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of conveying the rich and worldwide nature of mathematics to youth from 
the many subcultures that contribute to the American mosaic, and then 
to provide a context appropriate to their backgrounds in which to nurture 
mathematical talent. 

The Professional Development Program at at Berkeley, led by Uri Treis- 
man and Leon Henkin, has transformed the success rates of minority fresh- 
men and enabled many to finish Berkeley and proceed to advanced or profes- 
sional degrees. The University of Michigan has had considerable success in 
interesting minority students in careers in science through a special program 
that provides experiences in undergraduate research. These examples—and 
I’m sure there are others—show that it 1s possible to successfully attract 
talented minority students to careers in mathematics and science. Making 
progress in this endeavor would provide just cause for a true celebration. 


=» INVEST IN TODAY’S EDUCATION 
TO STRENGTHEN TOMORROW’S RESEARCH 


Current debate about support for mathematics too often pits research 
against education, when in reality today’s education is the pipeline for to- 
morrow’s research. We read in the Notices of mathematicians who are under 
pressure to get research grants on pain of “being fired, having their teaching 
loads raised, or not getting raises.” We hear continuing concern in the re- 
search community that, in times of limited budgets, new funds for education 
might be subtracted from the already limited amounts available for support 
of basic research—despite the fact that the percentage of federal support for 
science and mathematics that goes to education has slipped during the past 
four decades from nearly 50% to around 10%. 

These arguments follow the same pattern that has led to the enormous 
growth in the federal deficit: by giving priority to the immediate needs of 
those in positions of power, we in effect support adults at the expense of 
children. Mathematicians, of all people, should be able to plan strategies 
that will optimize the strength of mathematical and scientific research over 
the long term. Part of that strategy is the recognition that education is not 
an alternative to research, but the foundation for future research. 

Our celebration of the bounty of mathematical research must entail a 
commitment to education as the wellspring of research. We need to use mul- 
tidimensional criteria in deciding on priorities for our community, seeking 
strategies that lead simultaneously to improvement in school mathematics, 
in collegiate mathematics, in graduate education, and in research. 


All One System 


Despite appearances to the contrary, mathematical research is inextri- 
cably entwined with mathematics education at all levels, with science and 
engineering, and with political, economic, and sociological aspects of society 
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at large. Educators and researchers, teachers and professors, mathemati- 
cians and scientists—we are all part of a single system of knowledge on 
which contemporary society depends. 

It is in this spirit that we join in celebrating the centenary of the Ameri- 
can Mathematical Society. “100 Years of American Mathematics” provides 
the occasion for building new mathematical science on the firm foundation 
of the past century’s research. Our centenary causes should be as broad and 
sweeping as our discipline: 
= To involve students in the practice of mathematics. 
= To educate the attentive public. 
= To explore fundamental issues in mathematics education. 
= To ensure for all students equal opportunity for mathematical success. 
= To invest in today’s education to strengthen tomorrow’s research. 


What transforms these causes from empty rhetoric to concrete options is 
the opportunity for education and communication implicit in the advances 
of today’s mathematical sciences—in such areas as computational statistics, 
mathematical biology, geometrical mathematics, and nonlinear dynamics. 
It is in the frontiers of mathematical science—not in current textbooks or 
today’s classrooms—that one can find the innovative and intellectually re- 
warding options needed to transform education, to excite our youth, to 
educate the public, and to reach all Americans. 

This should be our centennial cause, not just for 1988 but for the rest of 
this century. It is a cause that can unite researchers and educators in a com- 
mon challenge: To let the power and beauty of mathematics speak for itself. 
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Unsolved Problems 


EDITED BY RICHARD GUY 


In this department the MONTHLY presents easily stated unsolved problems dealing with notions 
ordinarily encountered in undergraduate mathematics. Each problem should be accompanied by 
relevant references (if any are known to the author) and by a brief description of known partial results. 
Manuscripts should be sent to Richard Guy, Department of Mathematics and Statistics, The 
University of Calgary, Calgary, Alberta, Canada T2N 1N4. 


Has Every Latin Square of Order n a Partial Latin Transversal 
of Size n — 1? 


P. Erpés 
Hungarian Academy of Sciences, Realtanoda u. 13-15, Budapest V, Hungary 


D. R. HICKERSON, D. A. NORTON, AND S. K. STEIN 
Department of Mathematics, University of California at Davis, Davis, CA 95616 


The notion of a transversal of a latin square easily generalizes to more general 
arrays. We present some of the known combinatorial results in this area and 
indicate the open questions. 

They concern m X n rectangular arrays of mn cells (m < n), each cell containing 
a symbol. A transversal of an array is a set of m cells, no two in the same row or 
same column. A latin transversal is one whose symbols are distinct. A row or 
column is latin if its symbols are distinct. If m = n and each symbol occurs exactly 
n times, we call the array an equi-n-square. If each row and column of an 
equi-n-square is latin, it is a latin square. The definitive work [1] on the subject, 
published in 1974, contains 639 references and 73 problems. The earliest result is 
due to Euler. 

Marshall Hall [2, and see the earlier work of Lowell Paige, 3] examined transver- 
sals of n — 1 by n arrays obtained from an abelian group A of order n, as follows. 
Let A = {a,,..., a,}. Corresponding to each element a in A is the sequence 
{a+ a,,...,a+ a, }, recording translation by a. Let {b,,..., 5,_,} be a sequence 
of n — 1 not necessarily distinct elements of A, and form the n — 1 by n array of 
the translations of A by 5,,..., 5,_,. Each row is latin, but there may be duplica- 
tions in the columns. Hall proved that such an array has a latin transversal. 

In [5] Stein introduced several types of arrays, including the equi-n-square, and 
established the existence of transversals with many distinct elements. For example, 
the equi-n-square has a transversal with at least 


1 1 1 
n I~ tg cc ty = (1 —1/e)n = 0.63n 


distinct symbols. He also gave equi-n-squares (n > 2) without a latin transversal, 
and showed that n by n arrays in which each symbol appears exactly q times have 
transversals with at least n — q/2 symbols. 


428 


1988] UNSOLVED PROBLEMS 429 


More recently, Shor [4] showed that every latin square has a partial latin 
transversal of length at least 


n — §.53(Inn)° 


and observed that his method will yield better results, but not as good as n — log,n 
(log to base 2). 
There are several outstanding conjectures. The most noteworthy are: 


Conjecture 1. An equi-n-square has a transversal with at least n — 1 distinct 
symbols. 


The special case of the title has been associated with Herb Ryser’s name, while [1, p. 
103] attributes it to Richard Brualdi. The answer is affirmative for latin squares 
which are the Cayley tables of abelian groups. 


Conjecture 2. An n — 1 by n array in which each symbol appears at most q times 
(q <n) has a latin transversal. 


The case g =n would imply that a row-latin n —1 by n array has a latin 
transversal, and would also imply Conjecture 1. For g = 1, conjecture 2 is trivally 
true, and the case g = 2 follows from g = 3, which can be proved as follows. 


THEOREM 1. Let 2 < k < m < nand assume that every m by n array in which each 
symbol appears at most 3 times has a latin transversal that misses k proscribed cells. 
Then every such m + 1 by n + 1 array has a latin transversal that misses k proscribed 
cells. 


Proof. Consider an m + 1 by n + 1 array in which each symbol appears at most 
3 times and in which k cells are proscribed. There are two cases: (1) some row or 
column contains at least 2 proscribed cells; (2) no row or column contains more 
than 1 proscribed cell. 


Case 1. Consider a row (or column) with at least 2 proscribed cells. Since k < m, 
there is a cell c in that row which is not proscribed. Denote the symbol in that cell 
by ¢. Delete the row and column containing c, producing an m by n array. This 
array has at most k — 2 proscribed cells and at most 2 cells with the symbol f, so it 
has a latin transversal that misses the proscribed cells and does not have the symbol 
t in it. Adjunction of cell c gives a latin transversal for the original m+ 1 by n+ 1 
array. 


Case 2. Consider a cell that is on a row of one proscribed cell and on a column 
of another proscribed cell. Delete the row and column of this cell and argue as in 
Case 1. 

An exhaustive search showed that every-4 by 5 array in which each symbol occurs 
at most 3 times has a latin transversal avoiding any 2 proscribed elements. Hence 
every m by m + 1 array has this property for m > 4. A similar analysis showed that 
for n > 6, every n by n array in which each symbol appears at most 3 times 
possesses a latin transversal avoiding 2 proscribed cells. 

Theorem 2 is proved much as Theorem 1. 


THEOREM 2. If every m by n array in which each symbol occurs at most 3 times has 
a latin transversal that passes through any given cell, then every m + 1 by n + 1 array 
in which each symbol appears at most 3 times has the same property. 
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Note that there is no latin transversal through the top left cell of a 4 by 4 array 
which contains the configuration of Fic. 1. 
Our final theorem is an unpublished result of Erdos and Joel Spencer. 


THEOREM 3. Let k be fixed. Ann X n array in which each symbol appears at most 
k times has a latin transversal if n is sufficiently large. 


Their proof will also work if k < (Inn)'~* and they believe that it may be 
possible to make it do so for k < (Inn)° for every fixed c, but beyond this there 
may be serious difficulties. [J. Dénes [6] made a conjecture equivalent to Conjecture 
1, and some other relevant remarks.— Ed.] 
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NOTES 


EDITED BY DENNIS DETURCK, RICHARD LIBERA, AND ANITA E. SOLOW 
More on Fresnel Integrals 


I. E. LEONARD 
Department of Mathematical Sciences, Northern Illinois University, DeKalb, TL 60115 


In [2], Flanders evaluates the Fresnel integrals 
Io = i cosx?dx and G)= i sin x? dx 
0 0 


using a strictly real-variable derivation. He considers, for ¢ > 0, 


F(t) = f~ 


_ 2 oe) _ 2. 
e~cosx*dx and G(t) =f e~ sin x? dx 
0 


0 


and shows that they satisfy the functional equations 


F(t) — G(t)’ = Ta and 2F(t)G(t) = 


which he then solves to obtain 


r 7 yVl+?t? +t 6 7 yl+t* -1f 
1) = ,/- |) ————_ , t) = Van 
(7) V8 1+? () V8 1+? 


Towards the end of the article, Flanders remarks that replacing x” by x yields the 
Laplace transforms of (1 es “ve vx) and (1/2)(sin x/ yx ), respectively, that is, 
sin x 


F(t) = 5 fre “dx and G(t) = sf" “hd. (1) 


In this note, we show harp F and G can be transformed into improper integrals 
which may be evaluated directly, using only the method of partial fractions; see, e.g. 


[3]. 


For ¢ > 0, 
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and similarly, 


00 (u? +t) du + t) du 
ral 


1+ (u? 14 (u24+1) 


F(t) = (3) 


for ¢ > 0. 

The interchange of the order of integration is easily justified, since f(s, x) = 
(sin x/ Vs )e~ © ** is continuous for0 <a<s<b,0 <x < o, and [<f(s, x) dx 
converges uniformly for s in [a, b]. (see, e.g. [1], p. 207). Use was also made of the 
elementary Laplace transforms 


foe) 1 T CO 1 
i —e “ds = /—, i sin x e @t0* dx = —_—F5 > 
o vs x 0 1+(s4+1) 
and 
st+ft 


10.8) 
i cos xe © *9* dx = ————_5. 
0 1+(s4+ tf) 


Now we factor 1 + (u? + ¢)? over R into (irreducible) factors: 


1+ (u24 1) = (u2 —2au + b)(u? + 2au + b), 
where 
1 
alite ~ +)” and b= V14+t°. 


The partial fraction decompositions of the integrands in (2) and (3) are easily 
found to be 


a= 


1 1 2(u-a) 1 1 
1+ (u241)? 8abu?-2au+b 4b (u—a)?+b-a? 
1 2ut+a) 1 1 


8ab u2 + 2au + b 4b (u+a)+b-a? 
and 


u-+ft b-—t 2(u-a) b+t 1 


1+(u2+1) 8ab wre-2aut+b 4b (u-—a)+b-a? 


b-t 2(u+a) b+t 1 
Integrating, we obtain 
u* + 2au + b 


u* —2au +b 


f du ty 
_ w# _ 
1+ (u2+1) Bab” 


1 u 
+ —____— tan" 
AbVb — a? | b — a? 
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u* — 2au+ b 
u* + 2au+ b 


1+ (u? + t)° Bab 
b+t if 47a if uta 
” Goyb a? ran are | * fan ren | 
We can now evaluate the integrals (2) and (3) and obtain (1) for ¢ > 0. Since F 
and G are continuous on [0, 00), letting t — 0°, we have 


ore) TT 00 7 
Fy= f cos x* dx = | = and Go= f sin x°dx = y/ © 


as required. 


0g 


(u? + t) du b—-t 
J 
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A Class of Abelian Groups Arising from an Analysis of a Proof 


Gary A, MARTIN 
Mathematics Department, Purdue University, Calumet Campus, Hammond, IN 46323 


The 1973 Putnam Competition asks for a proof of the fact that if 2n + 1 integers 
have the property that no matter which integer is deleted, some n of those 
remaining have the same sum as the other n, then all of the original 2n + 1 
numbers are equal. Even before beginning to try to prove this, I was struck by the 
hypothesis that the numbers are integers, for the theorem seems equally plausible 
for rational, real, and complex numbers. Indeed, the assertion is meaningful for any 
abelian group. As I will demonstrate below, it is true for precisely those abelian 
groups which have no nontrivial elements of odd order. The proof is a generaliza- 
tion and extension of the slick proof of the original statement, given in [1]. 

While trying to solve the problem for integers, I made two useful observations, 
both of which provided evidence that the assertion was indeed true in greater 
generality. First, one can generalize immediately to rationals by multiplying each 
number by a common multiple of the denominators, applying the known result for 
integers, and then dividing. This idea, that the truth of the hypothesis and the truth 
of the conclusion are invariant under a basic operation, reappears in Lemma 7 and 
is one of two ideas that constitute the proof given in [1]. Second, if 2n + 1 complex 
numbers satisfy the hypothesis of the problem, then so do their real and imaginary 
parts. If one assumes that the statement is true of real numbers, then the 2n + 1 real 
parts are equal and the 2n + 1 imaginary parts are equal, so the complex numbers 
are all equal. In a more general setting, this is just the observation that the class of 
abelian groups for which the statement is true is closed under direct products. This 
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suggests that one should consider the problem from a universal-algebraic point of 
view. 

We will use the following notation: For each n and any x,,...,X,,, In an 
abelian group, let ,(x,,..., X2,,41) be the statement of the hypothesis, namely, that 
for each k, there is a partition of the indices excluding k into two classes each of 
size n for which the sum of the x, for j in one class equals the sum of the x, for j in 


the other class. For each n, let ,(x,,..-,X 2,41) be the statement that x, = x, = 
- = X,,,,. For each n, let 6, be the statement 
VX}, cs Xon41(G, (1, nae Xon+1) => v,(%4, ar, Xonv1)): 
Then 9,(X1,---5% 2,41) and W,(X,..-,X2,4 1) are statements about 2 + 1 ele- 


ments of an abelian group, whose truth depends on both the choice of those 
elements and the group. @, is a statement about abelian groups; if 6, is true in the 
abelian group A, then one says that A satisfies @,,. 

We begin by examining the closure properties of the class of abelian groups 
which satisfy any particular 6.. 


LEMMA 1. Jf A is an abelian group satisfying 0,, and if B is a subgroup of A, then 
B satisfies 6,,. 


The proof is an easy exercise for the reader. 


LEMMA 2. Jf A is an abelian group, and every finitely generated subgroup of A 
satisfies @,, then A satisfies 6. 


Proof. Fix X,,...,;%X3,,, mn A. If p,(x),.--, X2,,41) 1s true in A, then it is also 
true in the subgroup of A generated by x,,..., xX ,,,. Then w,(%,,.-.,; %2,4) 1s 
true in that subgroup, and hence in 4 as well. 


COROLLARY 3. An abelian group satisfies 0, if and only if each of its finitely 
generated subgroups does. 


LEMMA 4. If 6, is satisfied by each of the abelian groups A,,..., A 
satisfied by the direct sum A = ®A,,. 


then 6, is 


n? 


Proof. Given X,...,X2,4, In A, let x; , denote the Ath coordinate of x,. 
Suppose 9,,(X1,..., 2,41) is true in A. Then 9,(X, ,,..-, X2,41,,) 18 true in A,, 
since the partition of the indices which works for A will also work for A,. Then by 
hypothesis, W,,(X,45-++» Xon41,,) is true in A, for each k, so y,,(x,,..., X2,41) 18 
true in A. 


COROLLARY 5. An abelian group A satisfies @,, if and only if each cyclic subgroup of 
A satisfies @.,. 


Proofe Suppose that each cyclic subgroup of A satisfies @,. Any finitely generated 
subgroup B of A is a direct sum of cyclic subgroups of A by the fundamental 
theorem of finitely generated abelian groups. By Lemma 4, B satisfies @,. By 
Corollary 3, A satisfies 6,. The converse follows from Lemma 1. 


It suffices, then, to consider cyclic groups. The style of the statements changes as 
the central issue comes more clearly into focus. 


LEMMA 6. If a cyclic group A satisfies every 6,, then either A is infinite or A has 
order 2™ for some m. 
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Proof. If the conclusion fails, then A has an element a of order n, where n > 1 is 
odd. Let x, = --: =x,=0 and x,,, = +++ =X),4, =a. Then if a zero is 
deleted, half of the n + 1 a’s may be paired with half of the n — 1 zeros to obtain 
equal sums, and if an a is deleted, the remaining n a’s add to zero, as do the n 
zeros. This demonstrates the failure of @, in A. 


LEMMA 7. If ,(X1,---5 Xon41) holds for x1,..., X,,4 1 in an abelian group A, then 
for anya © A, 9,(X, + 4,.--, X24, + @) holds in A as well. 


Proof. For each k, the partition of the x,’s which works when x, is deleted also 
works for the x, + a’s when x, + a is deleted. 


LEMMA 8. If B is a subgroup of index 2 in an abelian group A, and if 
D,(X13-++5 Xan 41) holds for X1,...,Xon4, € A, then x, = +++ =Xy,4,(mod B). 


Proof. For any x € A, 2x © B. For any k, the sum of all of the x,’s is x, plus 
twice the sum of some n of the other x,’s. Thus each x, is congruent modulo B to 
the sum of all of the x,’s, and hence all of the x,’s are congruent to each other. 


LEMMA 9. Jf A is infinite cyclic or cyclic of order 2”, then A satisfies every @.,. 


Proof. In either case, A has a descending chain of subgroups, A > 2A > 4A > 
--+, each of index 2 in the preceding, whose intersection is the trivial subgroup. 
Let X,,..-,%5,4, € A, and suppose 9,(x1,..., 2,41) holds. Then by Lemma 7, 
@, (0, X2. — Xy,--+» Xan41 7 X1) holds as well. By Lemma 8, each of x, — x,,..., 
X5,41 — X, lies in each of the subgroups in the descending chain. Thus x, = 
Xo50065Xy = Xap gt: 


(The proof of Lemma 9 for infinite cyclic groups is essentially that appearing in 


[1].) 


The classification of abelian groups which satisfy every @, is now complete. 


THEOREM 10. An abelian group satisfies every 6, if and only if it has no nontrivial 
elements of odd order. 


Proof. This follows immediately from Corollary 5, Lemma 6, and Lemma 9. 


There is another natural direction in which to generalize the problem. Fix an 
integer r > 1. Consider the class C. of all abelian groups A for which for every n the 
following statement is true: If 7m + 1 elements of A have the property that no 
matter which one is deleted, the rest can be partitioned into r sets of size n all with 
the same sum, then all rn + 1 elements must be equal. Again, the problem is to find 
a description of this class of abelian groups. 

In direct analogy to Lemma 6, if A has an element a whose order n > 1 is 
relatively prime to r, then one sees that A is not in the class C. as follows. Let 
0 <m<vyr bea solution to mn = 1 (mod r). Take mn of the elements to be 0 and 
the remaining (7 — m)n + 1 to be a. Then if a 0 is deleted, both the number of 
remaining 0’s and the number of remaining a’s are divisible by 7, so one may take 
the same number of 0’s and a’s in each block of the partition. If an a is deleted, 
then take m sets of n zeros and r — m sets of n a’s. 

Lemmas 1, 2, and 4, and Corollaries 3 and 5 generalize easily to this setting, with 
the consequence that an abelian group is in C, if and only if all of its cyclic 
subgroups are. By the example in the preceding paragraph, it suffices to analyze the 
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infinite cyclic group and cyclic groups of order p”, where p is a prime which divides 

r. It is easy to see that Lemmas 7 and 8 remain true when 2 is replaced by r and the 

subgroup B is taken to have index p in A. Then Lemma 9 generalizes since both the 

infinite cyclic group and the cyclic groups of order p” have decreasing sequences of 

subgroups, each of index p in the preceding, whose intersection is trivial. 
Therefore the generalization of Theorem 10 is: 


THEOREM 11. An abelian group is in the class C, if and only if it has no nontrivial 
element whose order is relatively prime to r. 


I am grateful to Charles Weibel and Ali Nesin for contributing ideas which are crucial to the analysis 
and to David Hecker for showing the problem to me. 
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The Wedderburn Theorem of Finite Division Rings 


JOHN SCHUE 
Department of Mathematics and Computer Science, Macalester College, St. Paul, MN 55105 


Last year, a senior honors student and I, while working on properties of finite 
rings, went through the two proofs, given in [1] of the Wedderburn theorem, that a 
finite division ring is a field. The first proof is a variation (due to Witt) of the 
original proof in 1905, and is largely number theoretic, while the second is 
essentially that given by Herstein in [2]. It is much more algebraic in nature 
(intentionally so) but becomes rather complicated technically. The proof presented 
here is in the same spirit but more conceptual than computational. It is intended to 
be accessible to the dedicated reader of a book like [1], with its major claim to 
acceptance perhaps resting on its drawing together a number of seemingly unrelated 
ideas developed earlier in the book. There are, of course, numerous other proofs 
available in the literature. Some, such as those found in [4, p. 225] (where it is an 
exercise) or in [3] obtain the theorem as a specialization of the much more general 
theory of simple algebras, while another, almost entirely group theoretic, can be 
found in [5]. 

Following Herstein, we let D be a finite division ring with center Z and assume 
that any division ring of smaller order than D is a field. We assume Z # D, and will 
show this leads to a contradiction. For a € d, if C(a) = {x © D: ax = xa}, then 
C(a) is a subdivision ring containing Z and a so that, if a € Z, C(a) is proper 
and hence is a field. Since any proper subdivision ring containing a also is 
contained in C(a), then C(a) is maximal in D as a subdivision ring. We let D* be 
the multiplicative group of nonzero elements of D and, for a subset S of D, S* = 
SO D*. If p is the characteristic of D then p is a prime integer and |D| = p” for 
some integer n. For a, x © D we let D,x =[a, x] = ax — xa. Then D, is an 
additive operator of D with kernel C(a) and, by Lemma 7.2.1, p. 369 of [1], we have 
D? = D,p. 

Suppose now b is chosen as a fixed element of D with b ¢ Z and K = C(b). 
Then K is a finite field of order p” for some m < n. If N* = {x © D*: xKx7} Cc 
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K }, then N * is a subgroup of D* containing K * as a normal subgroup. Since K * 
is a cyclic group, we can choose a generator g for K *. Then, for x € D*, x Ee N* 
is equivalent to xgx~! © K and, for y © N*, we will have y © Kx if and only if 
xgx 1 = ygy!. We let g =|N*/K *|. 

Now D is a (left) vector space over K and, for a © K, D, isa K-linear operator 
on D. From the above, we have 


DP” = DP?” = D,,. 

Thus the minimal polynomial for D, is a divisor of 
AP" — Xd 

and necessarily splits in K[A] into distinct first degree factors so that D, is 
diagonalizable. Applying this to D,, for E, = {x: [g, x] = kx}, we have E, = K 
and D = LE,, where the sum is direct and taken over all k € K with E, # 0. Now, 
if x € D*, gx — xg = kx for some k € K is equivalent to requiring that x belong 
to N*. Moreover, y € E, is equivalent to y € Kx. Then each E, is a K-subspace 
of dimension 1 and £,* is the coset K *x in N *. Hence, dim, D = q. 

From the structure of finite fields it follows readily that K is a Galois extension 
of Z. We can identify N*/K * with a subgroup of G(K/Z) and, if J is the fixed 
field for N*/K*, a © J implies xax~' = a for all x © N*. Then D, is zero on 
each E, so that D, = O and a € Z. Hence N*/K * = G(K/Z), implying dim 7K = 
IN */K *| = q. 

Combining the results above leads to dim,D = (dim, D)\(dim,K ) = g* and 
dim,C(b) = q forall bE D, bE Z. If r = |Z| then 


ID*;=r% —1 


and |C(b)*|=r%—1 for b € Z. Thus, if s is the number of conjugacy classes 
containing more than one element, the class equation applied to D* gives 


r? —l=(r—1)+ s(rv ~1/r7- 1), 


which implies r%7~) + --- +r%+ 1 must be a divisor of r — 1, giving the desired 
contradiction. 
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Common Hyperplane Medians for Random Vectors 


THEODORE P. HILL* 
School of Mathematics, Georgia Institute of Technology, Atlanta, GA 30332 


Suppose some black and some white points are sprinkled randomly on the plane 
(e.g., salt and pepper on a table). Is there always a straight line so that at least half 


*Partially supported by NSF Grants DMS-84-01604 and DMS-86-01608. 
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of the black points and at least half of the white points lie on one side of the line 
(including the line), and at least half of each color lies on the other side (again 
including the line)? Steinhaus’ “‘Ham Sandwich Theorem” (Corollary 1 below) says 
there is always a line bisecting the total areas occupied by both the black and the 
white spots, but does not answer the question for true (volumeless) points. The 
purpose of this note is to prove a generalization of the Ham Sandwich Theorem 
which includes all distributions: continuous, purely atomic, and mixed. If the black 
and white points represent statistical data or observations, then the positions where 
they fall are random vectors, and it is in this terminology that the theorem will be 
stated. (The two-dimensional example of black and white points on a plane was 
used only for illustrative purposes; what is essential is that there are no more than n 
colors in Euclidean n-space.) 

A median of a random variable is a real number a with the property that the 
probability the random variable is less than or equal to a, and the probability that it 
is bigger than or equal to a, are both at least one-half. One way of generalizing the 
notion of a median of a random variable to that of a median of a random vector is 
via hyperplanes. 


DEFINITION. A hyperplane median for an n-dimensional random vector X= 
(X,,.-., X,) is an (n — 1)-dimensional hyperplane 


n 
ya jx; = 6 in R" satisfying 
j=l 


P 


» 4,X,> | >1/2and P| )| a,X,< | > 1/2. 
j=l j=l 

Clearly every finite-dimensional random vector has a hyperplane median, and the 
main purpose of this note is to prove the following generalization of this fact. 


THEOREM 1. Every collection of m < n random n-dimensional vectors has a com- 
mon hyperplane median. 


An immediate corollary is the Ham Sandwich Theorem, apparently first proved 
by Steinhaus (cf. [2], p. 291). 


COROLLARY 1. (Ham Sandwich Theorem) /f B,,..., B, are bounded Lebesgue 
measurable sets in RR”, then there is a hyperplane which simultaneously bisects the 
measure (n-dimensional volume) of each B,. 


(The picturesque name Ham Sandwich Problem is also attributed to Steinhaus, 
who viewed it as the problem of simultaneously bisecting the ham, cheese, and 
bread in-an ordinary ham sandwich by a single planar cut of a knife.) 


Proof of Corollary. Without loss of generality, assume each B, has strictly 
positive measure, and let Y, be the n-dimensional random vector which is uniformly 
distributed on B,. Then apply Theorem 1.1 to Y,,..., Y,. 0 


The key idea in the proof of Theorem 1 is an application of the following 
theorem of Borsuk and Ulam [1]; unlike the case in the Ham Sandwich Theorem, 
however, the measures (distributions) involved are not assumed to be continuous (or 
even nonatomic), and the usual continuity argument fails. 
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BORSUK-ULAM THEOREM. If f is a continuous map of the surface of the unit ball in 
(n + 1)-dimensional space into n-dimensional space such that f(—x) = —f(x) for 
every X, then there is some point on the ball mapped into the origin. 


Let X be an n-dimensional random vector with distribution ww (ie., (A) = 
P(X € A) for every n- -dimensional Borel set A), let x - y denote the usual inner 
product of x and jy; and let ||x'|| be the norm of x. 


DEFINITION. For 0 € R”, m,(v), m*(v), and m(v) are the lower, upper, and 
midpoint (respectively) medians for X orthogonal to v. That is, m,, m*, and m are 
the functions from R” — R given by: 


m,(5) = inf{a: P(X-i<a)>1/2 and P(X-vF>a)>1/2); 
m*(@) = supfa: P(X-i<a)>1/2 and P(X-7>a)>1/2}; 
and 
m(o) = (m,(0) + m*(v))/2. 


The proof of Theorem 1 will be based on two lemmas: the first lists some easy 
facts concerning m,, m* and m; and the second is the key continuity result. 


LEMMA 1. 
(i) m,(v), m*(v), and m(v) are finite and attained for all v; 
(ii) m4(0) = m*(0) = m(0) = 0; 
(iil) m(—v0) = —m(0v); and 
(iv) ifu+# 0 and m(v) = a, thenX-v=aisa hyperplane median for Xx. 
LEMMA 2. If the support of uw is compact, then m,, m*, and m are continuous. 


Proof. Only the argument for the continuity of m, will be given; that for m* 
follows similarly, and together they imply the continuity of m. 

Let K = support of mw; since K is compact there exists an M > 0 such that 
||| < M for all x in K. 

Fix uv € R” and e > 0; then for every v in R” satisfying ||w@ — v]| < e/M, 


|x - (@—)| < e forall Xin K. (1) 

Let a = m,(uv). Then 
{XE K: X-v0<a-—2es C{XEK: x. 
C{xXEK: 
{x € K: 


<a-—e} 
nae (2) 


ate}. 


“| 
=] =| 


IN 
“ty 


(For the first inclusion in (2), observe that x. U < a — 2e implies X-u < a — 2e + 
xX - (@ — w) which by (1) implies x - i! < a — e. The second inclusion is trivial and 
the third is analogous to the first.) 

Since p(K ) = 1, it follows from (2) and the definition of a that 


P(X-T<a-2e) << P(X-t<a-—e)<1/2 
<P(X-i<a)<P(X-t<ate). 


From (3) it follows that a — 2e < m,(U) < a + &, So since e€ was arbitrary and 
a = m,(u), this establishes the continuity of m,. O 


(3) 
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The conclusion of Lemma 2 may fail if the support of w is not compact, even if p 
is continuous (i.e., absolutely continuous with respect to Lebesgue measure). 


Example 1. Suppose X is a continuous 2-dimensional random vector with strictly 
positive density of total mass 1/2 on A = {(x,y): 1 <x <2, y © R} and uni- 
formly distributed with total mass 1/2 on B= {(x,y):3 <x<4,-l<y<l}.lt 


is easy to check that m((1,0)) = 5/2, but that lim, , .,.m((1,n~*)) = 3. 
Proof of Theorem 1. Fix m < n, let Xe. _ , xX, be n-dimensional random vectors 
with distributions Masses bm respectively, and let m, denote the midpoint-median 


function for xX, (as in Definition 2.1). Without loss of generality, m =n. 


Case 1. The support of each p,, i = 1,..., n, is compact. 

Let f be the function from the surface of the unit ball in R”*! into R” given by 
f=(f:..-;f,), where f,(v,B) = m,(v) — B for all i=1,...,n 

From Lemma 2 it follows easily that f is continuous, and from Lemma 1(iii) that 
f(—v,— 8) = —f(v,8), so the Borsuk-Ulam theorem implies there exists a point 
(0,85) on the surface of the unit ball in R”*? satisfying f(05,B)) = 0, that is, 


m,(d9) = By for alli =1,...,n. (4) 

By Lemma 1(ii) and the fact that |(v),85)|| = 1 it follows that v9 # 0, so the 

existence of a common median for X,,..., X,, follows by (4) and Lemma l(iv). This 
completes the argument for Case 1. 

General Case. Let r > 0 be such that p,(x:||x]| < r) > 0 for all i = 1,..., . For 

j = 1,2,..., let K, denote the closed sphere (x: ||x|| <7 +j} in R”, and let y,, 

denote the (re-normalized) restriction of p, to K p for i=1,...,n. By Case 1, for 


each j > 1 there is a common median hyperplane H, = {x © R":a,- x = b,} for 
Hy jse++> yj Since wj(K;) > 1 as j > 00, for all i = 1,..., n, it follows easily that 
a subsequence { H, } of { H;} converges (that is, there is an a © R” and bE R so 
that a, > aand b, — b), and that the limiting hyperplane H = {x:@- x= b}isa 
common median for Xj,..., X,. 0 


A generalization of the notion of “median” that is often of use in statistics is that 
of a B-quantile, which is a number a with the property that the probability the 
random variable is < a is at least 8, and the probability that it is > a@ is at least 
1 — B. Clearly every random variable has at least one B-quantile for each B © (0,1), 
and it is natural to ask whether Theorem 1 can be generalized to conclude that every 
collection of m <n random variables has a common hyperplane £-quantile. The 
next example shows that in general the on/y common hyperplane £-quantiles which 
exist are for B = 1/2, 1.e., medians. 


Example 2. Let xX, be the 2-dimensional random vector which is uniformly 
distributed on the boundary of the unit circle‘in R’, and let xX be the constant 
2-dimensional random vector which is always the origin, It is easy to see that there 
are no common hyperplane f-quantiles for X, and X, for any B © (0,1) except 
B=1/2. 

(Note that for a fixed B € (0,1), B # 1/2, the above example can be modified 
slightly to yield continuous distributions with no common hyperplane B-quantile). 

As pointed out by Carl Spruill, Theorem 1 can also be proved from the Ham 
Sandwich Theorem by adding N(0,n~') random variables to each component to 
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make each xX (and hence each p,;) continuous, applying for each n a slightly 
generalized version of the Ham Sandwich Theorem (for arbitrary continuous 
probability distributions), and then appealing to standard results for weak conver- 
gence of probability measures. The proof presented above has the advantage that it 
is more elementary (since it relies essentially only on the Borsuk-Ulam Theorem), 
and it also shows how to modify directly the classical proof of the Ham Sandwich 
Theorem to include noncontinuous distributions. 

It should also be emphasized that both proofs of Theorem 1 (the given proof, and 
Spruill’s proof) are highly nonconstructive, and it would perhaps be of interest to 
devise a practical, constructive method for determining the common hyperplane 
medians guaranteed by the theorem. Spruill has also raised the question of which 
curves or manifolds other than straight lines or hyperplanes can serve as common 
medians. 
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Quadratic Reciprocity: Its Conjecture and Application 


Davip A. Cox 
Department of Mathematics, Amherst College, Amherst, MA 01002 


If p and g are distinct odd primes, then the law of quadratic reciprocity states 
that 


mie = (-1) 2-4 (0.1) 


where (a/p) is the Legendre symbol, defined to be +1 when a is a quadratic 
residue modulo p and to be —1 otherwise. Quadratic reciprocity is one of the gems 
of elementary number theory, but at the same time, it’s never been easy to see where 
the theorem comes from. Other formulations are not a great help. For example, 
statement (0.1) is equivalent to the following: 


If p or q is congruent to 1 modulo 4, then p is a quadratic residue 
modulo gq if and only if g is a quadratic residue modulo p. If p and q (0.2) 
are congruent to 3 modulo 4, then p is a quadratic residue modulo q if 

and only if q is a quadratic nonresidue modulo p. 


(Again, p and q are odd primes.) This version of the theorem is easier to read, but it 
still sheds no light on how to conjecture such an amazing result. 

Of course, once we know to look for some relation between quadratic residues 
modulo different primes, it’s fairly easy to find examples that lead to a statement 
like (0.1) or (0.2). But how does one begin to suspect that such a relation should 
exist? 

This paper has two goals. The first is to answer the question just raised, i.e., to 
present a series of questions and examples that lead naturally to quadratic reciproc- 
ity. The second goal is to use quadratic reciprocity to prove the following theorems 
of Fermat for odd primes p: 


p=x?t+y’, x,yEZ 2 p=i1mod4 
p=x?+2y*, x,yEZ @ p#=1,3mo0d8 (0.3) 
p =x? + 3y’, x,yEL = p = 30r p = 1mod3. 


Our two goals are closely related, for we will use Fermat’s results to motivate the 
questions that lead to reciprocity. This might seem circular, but it’s how things 
happened historically. Euler knew of Fermat’s theorems, but since Fermat never 
published the proofs, Euler had to find his own. This took many years, and along 
the way Euler discovered quadratic residues and quadratic reciprocity. This paper 
will follow Euler closely, both in the examples leading to reciprocity and in the 
proofs of (0.3). For an excellent account of Euler’s work on number theory, the 
reader should consult Weil’s book [4]. 

There is one other aspect of our second goal which deserves mention. Many 
books on number theory present quadratic reciprocity but fail to give interesting 
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applications. It is important to show that quadratic reciprocity is not an isolated 
result, but rather is a theorem that leads immediately to other theorems of 
independent interest. (I should point out that several books do this very nicely, in 
particular Ireland-Rosen [2] and Nagell [3]. I am grateful to the referees for pointing 
out these references and for suggesting improvements in the exposition.) 


1. We will first discuss Euler’s strategy for proving Fermat’s results (0.3). One 
half of the theorem is easy: if p= x*+ny*, n=1, 2 or 3, then the desired 
congruence for p follows by considering x and y modulo 4, 8 and 3 respectively. 
The converse is more difficult, and this is where Euler’s strategy comes in. For the 
case of x? + y?, it consists of the following two steps: 


Step 1. If p = 1 mod 4 is prime, then show that p divides a sum of the form 
x* + y?, where x and y are relatively prime integers. 


Step 2. If an odd prime p divides a sum of the form x* + y*, where x and y are 
relatively prime, then show that p is a sum of two squares. 


The cases x* + 2y* and x? + 3y* can be formulated similarly, and it is clear that 
these steps suffice to prove Fermat’s results (0.3). 

Notice that both steps involve prime divisors of x? + ny”, where n = 1, 2 or 3. It 
thus makes sense to look more closely at this situation, and as happens often in 
mathematics, we can better understand our problem by posing it more generally. 
Hence, in considering prime divisors of x* + ny”, we should let n be an arbitrary 
integer, positive or negative. We still want x and y to be relatively prime, for 
otherwise any prime could divide x* + ny*—just take x and y to be multiples of p. 

We can restate our problem in terms of the Legendre symbol as follows: 


LEMMA 1.1. Let p be a prime not dividing n. Then there are relatively prime 
integers x and y such that p|x* + ny? if and only if (—n/p) = 1. 


Proof. Suppose that p divides such a number x? + ny”. Then x* = —ny?mod p. 
Since x and y are relatively prime, it follows that p + y. The integers modulo p 
form a field, so that yb = 1 mod p for some b. Multiplying our congruence by b’, 
we see that (xb)* = —n mod p, which implies that (—n/p) = 1. The converse is 
trivial, and the lemma is proved. 


We are now ready to turn to Step 1. Using Lemma 1.1, we see that is suffices to 
prove the following for primes p: 


—] 
[—] =1 2 p = 1mod4 
P 
_) “ 
— | = > p =1,3mod8 (1.2) 
P 
—3 
[—] =1 = p = 1mod3. 
P 


As we did before, let’s pose this more generally. Thus we want congruence 
conditions on p that imply (—n/p) = 1. Looking at (1.2), we see that the way to 
unify the congruence conditions is to work modulo 4n. 
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It’s now time to do some examples. We will work out the case n = —5 in detail. 
This means studying the remainders modulo 20 of those primes p for which 
(5/p) = 1. One way to generate such primes is to factor the numbers x? — 5 for 
variable x. If we let x range from 1 to 30, we get the following prime factors 
(excluding 2 and 5): 

11, 19, 29, 31, 41, 59, 61, 71, 79, 89 
109, 131, 139, 179, 181, 191, 251, 471, 571. 


Working modulo 20, all of these primes are congruent to +1 or +11, and the first 
row lists all primes under 100 which satisfy this congruence condition. If we let x 
range from 1 to 80, we get all primes under 200 that satisfy the congruence. It 
appears that we will eventually pick up all such primes, which leads to the 
conjecture that for a prime p # 2,5, 


5 
[=] =1 © p = +1, £11 mod 20. 


P 
Other examples can be worked out similarly. Here are some sample cases: 
—3 
[—} - = p = 1,7mod 12 
P 
—5 
[=| =1 ° p = 1,3,7,9 mod 20 
P 
—7 
—J=1 © ~~ p#=1,9,11,15,23,25 mod 28 (1.3) 
P 
3 
[=}=1 = p = +1mod12 
P 
5 
(=) =1 c p = +1, +11 mod 20 
7 
[| - © p= +1,+3, +9 mod 28. 


Here we always assume that p is an odd prime not dividing n. 

In looking for something to unify these conjectures, the bottom three results of 
(1.3) offer the most hope because of the +’s. We will thus concentrate on the case 
(N/p) for N > 0. 

The key problem is to discern a pattern in the numbers that follow the +’s. Of 
course, numbers in congruences can be written many ways. For example, 11 = 
—9 mod 20 and 3 = —25 mod 28. Using these two facts we can rewrite the bottom 
part of (1.3) as follows: 


3 
(=) =1 = p = +1mod12 
Pp 


5 
= =] = p = +1, +9 mod 20 (1.4) 


= = | ran p= +1, +25, +9mod 28. 
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All of a sudden there is a pattern: all of the numbers that appear are odd squares! 
Before we get too excited, let’s try another case: 


=) =1 = p= +1, +5 mod 24. 
P 
Unfortunately, +5 is not congruent to a square modulo 24. The same phenomenon 
happens for (10/p) and (14/p). So why does the pattern work for 3, 5 and 7 but not 
for 6, 10 and 14? The obvious difference is that the former numbers are all prime! 
This says that there is something very special about the prime case, i.e., the case 
of (q/p) = 1, where p and gq are distinct odd primes. From (1.4), we get the 
following conjecture. 


CONJECTURE 1.5. If p and q are distinct odd primes, then 
(2) =] <p= +’mod 4g for some odd B. 
Pp 


Notice that we have now given an answer to the basic question posed in the 
introduction—the above examples single out the prime case very nicely. It remains 
to show that Conjecture 1.5 is equivalent to the usual formulation of quadratic 
reciprocity. We do this as follows. 


THEOREM 1.6. Conjecture 1.5 is equivalent to the law of quadratic reciprocity for 
distinct odd primes (see (0.1)). 


Proof. Let p and gq be distinct odd primes, and set p* = (—1)'?-?/p. Note 
that p* = 1mod4. We will assume the following two properties of the Legendre 
symbol: 


(=) «(nie 


q 


Pel Lallal 

q q/\4 

(see [3, p. 135]). Using (1.7) it is an easy exercise to show that quadratic reciprocity 
is equivalent to the statement 


(1.7) 


Since each side equals +1, it follows that quadratic reciprocity can be written as the 
equivalence “ 


* 
e)-r(E)- 
P q 
and comparing this to Conjecture 1.5, it thus suffices to show 
p* 
p= +B*mod4q @ = =]. (1.8) 
q 


Note that B* =1mod4 since B is odd. Thus the + sign in (1.8) must be 


446 DAVID A. COX [May 


(—1)'?-)/*, and we then have 
p= +B*mod4q e p= (—1)? °°” B? mod 4q 
<> p* = B?mod 4q. 


Now, to prove (1.8), suppose that p* = B* mod 4q. This implies p* = B* mod q, 
so that (p*/q) = 1 follows immediately. Conversely, if (p*/q) = 1, then p* = 
a*mod qg for some a. Letting B = a or a + g, depending on whether a is odd or 
even, we obtain p* = B*mod 4q, and the theorem is proved. 


With this theorem and the preceding examples, it becomes clearer how someone 
could have conjectured quadratic reciprocity. In our case, however, the “someone” 
is none other than Euler, for we have followed roughly the same path he took to 
discover reciprocity. In fact, our examples (1.3) are taken from Euler’s 1744 paper 
“Theoremata circa divisores numerorum in hac forma pxx + qyy contentorum”’ [1, 
vol. 2, pp. 194-222], and in the same paper there are a series of Annotations that 
include Conjecture 1.5 as a special case. For more details, see [4, pp. 186-187, 
204-210]. 

We should also mention the other parts of reciprocity, the so-called complemen- 
tary theorems for odd primes p: 


—] 


—| = (-1)?? 


P 
= =(- PVA 


Notice that we’ve already used the formula for (—1/p) in the proof of Theorem 1.6. 

For our purposes, the key fact is that once we’ve done a full treatment of 
reciprocity (statements (0.1) and (1.9)), Step 1 of the proof of Fermat’s results 
follows immediately—the equivalences (1.2) are now an easy exercise which we 
leave to the reader. It remains to prove Step 2, which will be done in §2. 


(1.9) 


2. Earlier, we stated Step 2 only for the case x? + y*. We will state the general 
case, for x* + ny’, n = 1,2,3, as the following theorem. 


THEOREM 2.1. Let p be an odd prime that divides a sum a* + nb*, where n = 1, 2 
or 3 and a and b are relatively prime. Then p can be written in the form x* + ny?. 


Proof. We will follow the arguments Euler used in 1752 for the case x? + y? [1, 
vol. 2, pp. 300-307]. We will generalize his arguments to x* + ny”, n = 1,2,3. 
The crucial step is the following lemma. 


LEMMA 2.2. Let g = x? + ny’, na positive integer, and suppose that q divides a 
number N = a* + nb’, where a and b are relatively prime. If either q is prime, or 
= 4 and n = 3, then N/q = c? + nd, where c and d are relatively prime. 


Proof. Let us first consider the case where gq is prime. Since g divides both 
x?N = x*(a? + nb’) and a*q = a?(x* + ny’), it divides their difference 


x*(a* + nb?) — a?(x? + ny”) = n(xb? — a?y?) = n(xb — ay)(xb + ay). 


Since q iS prime, it must divide one of these factors. 
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If g|n, then g =n since gq = x? + ny”. Hence n|N = a* + nb’, so that nia, ie., 
a = nd. Then N = n7d? + nb’, which implies N/q = b* + nd*, as desired. 

If g|xb — ay or q|xb + ay, we can assume that the former holds by changing the 
sign of y. Then xb — ay = dg = d(x? + ny”). This implies that 


xb — dx* = ay + dny? = y(a + ndy), (2.3) 


from which we conclude that x| y(a + ndy). Since x and y are relatively prime (q is 
prime), we must have x|a + ndy, Le., 


a+ ndy = cx (2.4) 
so that a = cx — ndy. Substituting (2.4) into (2.3), we obtain 
x(b — dx) = y(cx), 


which implies that b = dx + cy. 
However, we also have the famous identity 


(c? + nb*)(x? + ny?) = (cx — ndy)’ + n(dx + cy)’. 
Using the above formulas for a and b, this becomes 
(c? + nd*)q =a? + nb*=N, 


so that N/q =c?+nd*, as desired. Since a and 5 are relatively prime, the 
formulas for a and b show that c and d are also relatively prime. 

It remains to consider the case n = 3 and g = 4. Here, we have 4|a? + 3b, so 
that a and b have the same parity. Since a and b are relatively prime, they must be 
odd. Since 4 = 17 + 3 - 17, the argument for the prime case (with x = y = 1) would 
work, provided that 4|b — a or 4|b + a. But the latter holds for any pair of odd 
numbers, which proves the lemma in this case. 


The proof of this lemma is similar in strategy to Lagrange’s proof of the famous 
four square theorem, which asserts that every positive integer is a sum of four or 
fewer squares (see, for example, [3, Theorem 102]). 

To complete the proof of Theorem 2.1, let p be an odd prime dividing a” + nb’, 
where a and Db are relatively prime. Assume that p itself is not of this form. We will 
show that there is an odd prime g < p with exactly the same properties. We would 
then be done by Fermat’s principle of infinite descent: applying the same argument 
to gq would give us q’ < qg, and continuing we would get an infinite decreasing 
sequence p > q >q'> -:- of positive integers, which contradicts the well-order- 
ing property. 

To produce gq, we work with a* + nb’. It is divisible by p, and remains so if we 
replace a and b by a — kp and b — @p respectively. Furthermore, we may choose k 
and ¢ so that |a — kp| < p/2 and |b — @p| < p/2 because p is odd. Thus we may 
assume that p|a* + nb? where |a| < p/2 and |b| < p/2. Since n < 3, it follows that 
a* + nb* < (p/2)? + 3( p/2)? = p’. Thus a” + nb* can be written as 


a* + nb? = pq, --- q (2.5) 


r? 


where the primes q, all satisfy q,; < p. 
We claim that one of these q,’s is odd and not of the form x? + ny*. Assume not. 
Then all of the odd g,’s can be written as x? + ny”, so that by repeatedly applying 
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Lemma 2.2 we can eliminate all of the odd q,’s from (2.5). This leaves us with 
a* + nb? = 22. 
If n=1 or 2, we can also apply Lemma 2.2 to 2= 17+ 17=07+2-1? to 
eliminate factors of 2, showing that p = a* + nb’, a contradiction. If n = 3, the 
case g = 4 of Lemma 2.2 shows that we can reduce to either p = a* + 3b? or 
2p = a* + 3b?. It remains to show that the latter case cannot occur. But p|a* + 3b 
implies (—3/p) = 1, which by quadratic reciprocity means p = 1 mod 3, so that 
2p = 2mod3. Yet 2p = a? + 3b? implies 2p = a? = 1 mod 3, and thus we have a 
contradiction. 
This completes the proof of Theorem 2.1. 


This finishes our proof of Fermat’s three theorems (0.3), and the only gap is that 
we haven’t proved quadratic reciprocity. So once a course in number theory covers 
reciprocity, there really are nice applications waiting to be proved, and some of 
them can even help motivate the statement of the theorem itself. 
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A Geometric Interpretation of the Riemann-Stieltjes Integral 


GREGORY L. BULLOCK, 
Mission Research Corporation, Santa Barbara, CA 93102 


1. Introduction. For many students of mathematics, the moment of truth comes 
in a rigorous study of real analysis. A promising student may become discouraged 
during a real analysis course and subsequently abandon serious hopes of pursuing a 
degree in mathematics and may even dissuade other students from doing the same. 
A graphical explanation of some theorems of analysis can simplify the task of an 
instructor and may save some students from unwarranted distress. 

Herein are presented geometric interpretations of the Riemann-Stieltjes integral 
(or, simply, the Stieltjes integral) and a few of the associated theorems as given in [1, 
chapter 6]. For convenience, the definition of the integral is reproduced, and the 
theorems that are interpreted here are stated without proof. 


2. Definition of the Integral. Let [a, b] be a given interval. Define a partition P 
of [a, b] to be a set of points xo, x),...,x,, where da=xjp <x, < ++: <x,=5. 
Let a be a monotonically increasing function on [a, b]. For each partition P of 
[a, b] write Aa, = a(x,) — a(x,_}). 

For any real function f that is bounded on [a, b] let M, = max{ f(x), x,_, < 
x <x,}, m, = min{ f(x), x,_, <x < x,}, and set 


U(P, f,a) = > M,Aa,, 
k=1 
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and 
L(P, f,a) = ) m,Aa,. 
k=1 


If there is a unique number / that satisfies the inequality L(P, f,a)<I< 
U(P, f, a) for all partitions P of [a, b], then J is called the Stieltjes integral of f 
from a to b and is denoted by 


[Ido or ['1(s) da), 


THEOREM 1. Assume a increases monotonically and a’ is Riemann integrable on 
[a,b]. Let f be a bounded, real, Riemann integrable function on {a, b]. Then 


[ fao = [P(x)e'(x) dx. 


THEOREM 2. Ifa <s <b, fis bounded on [a, b], fis continuous at s, and 


a(x) = {? x<s 


x > S, 
then [?fda = f(s). 


3. In 3-D. When dependent sets are graphed, each set is assigned a dimension 
with a direction perpendicular to that of the dimensions of the other sets. So f(x) 
and a(x) are graphed as in FIGURES 1 and 2. For this discussion, let a(x) be 
differentiable and observe that a(x) is monotonically increasing in agreement with 
the definition of the Stieltjes integral. If f(x), a(x), and x are to be considered 
simultaneously, then each is assigned its own dimension and direction perpendicular 
to that of the other two. A key to finding a geometric interpretation of the integral is 
to notice that since f(x) is independent of a(x), then for a given x, f(x) must be 
constant with respect to a(x). So f(x) is a cylinder (or more intuitively, a sheet) 
that is straight in the a-direction. If one looks along the sheet in the x-direction one 
may see hills and valleys, but one will see only flat terrain in the a-direction (see 
Fic. 3). The same result holds for a(x), which is independent of f(x). 


f(x) 


Fic. 1. 
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a(x) 


| 


Fic. 2. 


f(x) 


a(x) 


Fic. 3. 


4. The Integrated Fence and its Illuminating Shadow. Assume still that a(x) is 
differentiable. If the a-x plane is thought of as horizontal and the f-direction as 
pointing straight up, then the surface to be considered is like a curved fence. The 
fence follows the curve traced by a(x), and the height of the fence is given by f(x) 
(see Fic. 4). The fence is really the section of the a-sheet that is bounded between 
the a-x plane and the fsheet. The Stieltjes integral integrates along this fence, 
summing the products of heights and infinitesimal widths. It uses f as the height in 
each interval, but for the differential width it considers only Aa,, the length of the 
infinitesimal interval in the a-direction. As a result, the area given by the integral is 
really the area of the projection of the fence onto the f-a plane (in the same sense 
that the “area under a curve” is defined as that given by the Riemann integral). If a 
spotlight with a beam parallel to the x-axis is positioned so as to aim toward the 
fence, then the area given by the integral is the area of the shadow of the fence on a 
wall built in the /-a@ plane (see FIG. 5). 
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| ) PPT | ‘‘ fence” 
Dr 


tn, 
~ ST 


a(x) a(x) 


Fic. 4. Fic. 5. 


“shadow” f@ 


FIG. 6. 


It is interesting to note that in this configuration, the Riemann integral is simply 
the area of the projection of the fence onto the f-x plane. It is easy to see that if 
a(x) is defined by a(x) = x, then the fence allows a straight line from the origin 
that makes an angle of 45° with both the a-axis and the x-axis. In this case the area 
of the projection onto the f-a plane is equal to the area of the projection onto the 
f-x plane (see Fic. 6). Thus, the Stieltjes integral reduces to the Riemann integral. 
This can be verified by a simple application of Theorem 1. 


5. Throwing a Curve. Consider the case where a is not a straight line. Define f 
as in FIGURE 7 (in three dimensions this is a plane), and let a, be given by FIGURE 
8a and a, by FIGURE 9a. Let a, and a, be differentiable. The corresponding figures 
show the fences and their projections onto the fa plane. A careful inspection of 
these shadows will convince the reader that the curve in a weights the area of the 
shadow. The values of x for which a(x) has the steepest slope correspond to regions 
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f(x) 
xX 
FIG. 7. 
aC) a5 (x) 
2 
x X 
FIG. 8a. FIG. 8b. 
[h , 
“fence” 


Fic. 9a. FIG. 9b. 
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of the fence that cast the most shadow and thereby carry the most weight in the 
integral. While trying to picture this, it may be useful to observe that regions of the 
fence in which a(x) has zero slope cast no shadow at all. This is the essence of 
the identity from Theorem 1 


[fo = [flx)o'(x)ax (1) 


where a’(x) = da/dx is simply the weighting function. 


6. A Gate in the Fence. Using the definition 


Q, < 
a(x) = (4 _— 


where f is continuous at s, the fence has a hole—or a gate—in it (see Fics. 10 and 
11). The width of the gate-is unity, and the height of the fence at the gate is f(s), so 
the area of the gate is f(s). That the fence has no projection (other than two line 
segments) in the f-a plane only slightly complicates the analogy to the geometric 
interpretation of the integral. The complication can be resolved by simply filling in 
the fence at the hole, in other words, closing the gate door. The projection of the 
gate door has area f(s) which is what the Stieltjes integral gives. This agrees with 
Theorem 2 above. This analogy is applicable whenever a(x) is monotonic and has 
at least one jump discontinuity. 


a(x) f(x) 


“shadow” 


ira 


i 
wi 
Vat 


, a) 

' : HY 
4 

ry 


RY x a(x) 


Fic. 10. ° Fic. 11. 


7. A Sidewise Glance. In the general definition of the Stieltjes integral, the 
restriction that a(x) be monotonically increasing is not imposed. For the following 
discussion, let the restriction be removed, and consider the case where a(x) is 
decreasing on some intervals. On intervals where a(x) is decreasing, a’(x) is 
negative, and the area of the corresponding projected region contributes negatively 
to the integral as suggested by equation (1). The analogy to the geometric interpreta- 
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tion of the integral may be applied directly by dividing the original interval into 
subintervals on which a(x) is monotonic. The shadow of each subinterval may be 
considered separately, and the area of each shadow may be added or subtracted (as 
determined by the sign of a’(x) on the subinterval) to determine the value of the 
integral as illustrated in FIGURES 12a-c. It is amusing to observe that in this 
configuration, the net region whose area is given by the Stieltjes integral typically 
assumes unusual shapes (see Fic. 13). In fact, the integral can give the sum of the 
areas of two (or more) regions connected only by a point as shown in FIGURES 
14a-b. 


Fic. 12a. 


f(x) 


“shadow” 


Fic. 12b. Fic. 12c. 
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f(x) 


Fic. 13. 


f(x) 


f(x) 


Fic. 14a. Fic. 14b. 


8. Conclusion. Geometric interpretations of the Riemann-Stieltjes integral and 
several of its associated theorems have been presented. The author hopes that these 
will aid mathematics instructors in presenting difficult concepts associated with the 
integral. ’ 


I am indebted to Dick Epstein, my mathematics professor at the University of California, Berkeley, 
for his helpful suggestions and encouragement. I am grateful to Robert Drucker for his ever-illuminating 
observations. 
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preferred. The answer to the problem should appear right at the beginning. If your method yields a 
more general result, so much the better. If you discover that a MONTHLY problem has already been 
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E 3265. Proposed by D. M. Friedlen, Georgia Institute of Technology. 


Suppose n distinct pairs of socks are put into a laundry, where it is assumed that 
each of the 2n socks has precisely one mate. At the completion of the laundering 
operation, socks are drawn out one at a time. Suppose the first pair is realized on 
draw number 7,,. 

(a) Find an explicit formula for E(T,,), the expected value of 7), and determine 


lim n7-//"E(T,). 
(b) For positive x prove that 
lim P{ 7, < xn'/*} 


exists and find its value. 


E 3266. Proposed by John C. Turner, University of Waikato, Hamilton, New 
Zealand. 


Let N be the set of natural numbers. If SCN and nEN, let S+t+n= 
{s+n:s €S}. Define a sequence of sets S,, S,, S3,... recursively by putting 
S, = {1} and S,=(S,_, +k) U {2k—1} for k =2,3,4,.... What is N\ 
Ue_,S,? 


E 3267. Proposed by Barry Hayes, Donald Knuth, and Carlos Subi, Stanford 
University. 


456 
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Given a sequence (x, X5,...,x,) of nonnegative integers in which x, > 1 for 
some k, where 1 < k < /, let us say that a “k-move” is the operation of replacing 
the subsequence (x, _4,X,5X,41) by (X,_1 + 1, X, — 2, X,4, + 1). 

(a) Prove that repeated application of such moves to the sequence (0”, 2m, 0”) 
always leads to the sequence (1”,0,1”) after exactly $(m+ 1)(m+ 4)m moves. 
Here 0” and 1” stand for sequences of m 0’s and m 1’s, respectively. 

(b) Prove that, for sufficiently large m, the starting sequence (0, a,,..., a,,0) 
leads inexorably to the sequence (0”*?,14,0,1",0"*" 2 4-"—!) for some p, q, and 
r, if a,,..., a, are positive integers. Furthermore, p, g, and r can be expressed in 
terms of 7_,a, and X”'_, ja; How many moves does this transformation require? 


E 3268. Proposed by Dorothy Maharam and A. H. Stone, University of Rochester. 
Does there exist a sequence &,,§,,... of positive real numbers such that, 
whenever 0 < a < B, we have 
 #{n:l<nK<k,§&,> B} 
lim = 


= 0? 
k>o0 #{n:l<n<k,&,>a} 


SOLUTIONS OF ELEMENTARY PROBLEMS 
Characterizing the Exponential Function 


E 3127 [1986, 60]. Proposed by David Shelupsky, The City College of New York. 


(a) Show that on any real interval [a, b], the exponential function e~ is char- 
acterized, up to multiplication by an arbitrary positive constant, by the inequality 


f(x) < (f(y) — f(x) /(y — x) < fy), (*) 


where x and y arereal,a<x<y<<b. 
(b)* Are the exponentials the only functions on an interval J that satisfy the 
inequality 


min{ f(x), f(y)} < (f(y) — f(x) /(y — x) < max{ f(x), f(y)} (*) 
for x # yin I? 


Solution to (b)* by Bjorn Poonen (student), Harvard University. The answer is 
“ves.” We prove (b)* by showing that (**) implies the continuity and differentiabil- 
ity of f and by then solving the resulting differential equation. First consider any 
subinterval [c,d] =J CJ, |J| < 1/2, and fix t © J. Then for x € J, (**) implies 
that 


f(x) — fe) S af (%)| or [f(%) — FO) < IPO, 
and so |f(x)| < 2|f(¢)| in any event. Thus, for all x, y © J, 
f(x) — FO) Ss 21F (I Ly ~ x] = My — x], 
so f is continuous. But then (**) implies that f is differentiable on (c, d) and 


f(x) = f(x), so that f(x) = ke* for appropriate k and x € (c,d). If I has 
length > 1/2, write J] =J, U-+- UJ, where |J;| < 1/2 and |J NO J,,,| > 0. Then 
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f(x) =k,e* for x €J,, and by considering x € J; J,,,, we see that k; =k for 
all 7. 

As a check, (ke” — ke*)/(y — x) = ke’ for some z between x and y, by the 
Mean Value Theorem, and so f(x) = ke~ satisfies (**). 

If f satisfies (*), then it satisfies (**), so all solutions to (*) have the form 
f(x) = ke*; (*) is satisfied if and only if k > 0, and this solves (a). 


There were twenty-six correct solutions to part (b)* and eight solutions to part (a) only. Three 
incorrect solutions were received. 


Constructing a Triangle 


E 3134, [1986, 132]. Proposed by Jordi Dou, Barcelona, Spain. 


Provide a Euclidean construction of a triangle ABC, given the median m,, the 
bisector ¢,, and the angle A. 


Solution I by G. Velissarios, Athens, Greece. Consider the triangle ABC to be 
constructed, as in Fic. 1. Let AM, AD be the median and bisector from A, 
respectively, with a the measure of the angle. Extend the side AB beyond A to the 
point R such that AB = AR, and let AS be the bisector of the angle at A in the 
triangle ACR. 


FIG. 1. 


We begin by constructing the right triangle DAS. Since AD, AS are angle 
bisectors at A, we have 


Therefore, DS is parallel to BR. This tells us that the angle at D in DAS is a/2, 
which enables us to construct DAS. Let s = AS. 
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applying the law of sines to BMA and CMA yields 


BC 1 1 1 1 


— | ——— + ———— | = -- + —. 
2m,|sin(a/2-—6) — sin(a/2 + 6) sn BB sinC 
Applying the law of sines to BDA and CDA yields 
1 1 1 BD+DC BC 1 


; + = ; = TT 
snB  sinC t,  sin(a/2) t,  sin(a/2) 


Combining these and expanding sin(a/2 + 6) yields an equation that simplifies 
to 


L 4 


[2 sin(a/2)cos 6|sin(a/2) = sin?(a/2)cos’6 — cos*(a/2)sin’6. 


2m 4 


Using sin’6 = 1 — cos’8, we obtain a quadratic equation for cos 6. The solution to 
this is cos 6 = g + /g? + cos?(a/2), where g = (t,/2m,)sin’(a/2). The solution 
6 can now be constructed by standard methods. 


Editorial Comment. This problem has appeared before as Monthly Problem 
E1375 [1959, 513; 1960, 185], as noted by L. D. Goldstone and M. Vowe. It also 
appeared in Mathematics Magazine as problem 1054(b) [51(1978) 305; 53(1980) 
52-53], as pointed out by M. Vowe and A. Jayakrishna. I. Paasche commented that 
the problem is solved in a book by K. Herterich, Dreieckskonstruktionen (Ernst 
Klett Verlag, Stuttgart, 1966), 157-158, problem No. 177. I. A. Sakmar provided 
two solutions, noting that one of them appears in Exercices de Trigonométrie, by 
Frére Gabriel Marie, p. 330. The solutions given above are generally distinct from 
these. 

Also solved by A. Bondesen (Denmark), E. Braune (Austria), L. D. Goldstone, H. Guggenheimer, 
P. L. Hon (Hong Kong), A. Jayakrishna, O. P. Lossers (Netherlands), L. Kuipers (Switzerland), E. 


Morgantini (Italy), I. Paasche, I. A. Sakmar (Canada). I. A. Sakmar and O. Yumia (Turkey), S. Selvaraj 
(student), B. A. Troesch, M. Vowe (Switzerland), and the proposer. 


Coverings by a Convex Set 


E 3149 [1986, 400]. Proposed by Louis Funar, University of Craiova, Craiova, 
Romania. 


Let K be a plane convex set with area a, perimeter p, and diameter d, and let 
A = a/(a + pd+ md’). 

(a) Prove that any finite family of congruent copies of K that covers area A must 
have a subfamily with pairwise disjoint interiors that covers area at least A. 

(b)* Is this also true with A = a/(ad’)? 


Solution to (a) by William A. Newcomb, Lawrence Livermore Laboratory, 
Livermore, CA. If K’ is any congruent copy of K, let E(K’) denote the set of all 
points within distance d from K’. The area of E(K’) is a/A =a + pd + nd’. (CE. 
Theorem 14.12 and Exercise 19.9 of Russell V. Benson, Euclidean Geometry and 
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Convexity, McGraw-Hill, 1966.) If F is a finite family of congruent copies of K 
which covers A, let {K,,..., K,} be a maximal subfamily of F with pairwise 
disjoint interiors. Every member of F intersects some K, and is, therefore, com- 
pletely contained in E(K,). Therefore, the area A covered by F is at most n times 
the area of E(K ), so the subfamily covers area na > AA. 


Partial solution of (b) by David B. Secrest (student), University of Illinois, 
Urbana. Let S be a finite family of congruent copies of K whose union has area A. 
In the special case in which every pair of sets in S intersect, the diameter of U,..B 
is less than 2d. Now the area of any figure with diameter at most 2d is at most wd? 
(a small generalization of problem A5 of the 1967 W. L. Putnam Math. Competi- 
tion). Thus, if \ = a/(1d’), we have AA < a. 


Editorial comment. Note that in fact the members of the family need not be 
congruent in part (a). The argument works as well if all members of the family are 
convex and have area a, perimeter p, and diameter d, with no further restriction. 


Part (a) also solved by the proposer. 


ADVANCED PROBLEMS 
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6572*. Proposed by David L. Book, Naval Research Laboratory, Washington, 
DC. 


Show that 


ore) — g(t) dt 
i {1 —e lap = 7, 


where 


| 1 aa 
t)=— + st . 
at) = =f In(l + st) 
This result appeared in an article in Zhurnal Eksperimental’noi i Teoretecheskoi 
Fiziki, 90(1986) 330 by S. E. Esipov and I. B. Levinson, who stated that they had 
obtained it numerically and verified it to fourteen significant figures. (At the request 
of the editors, the Symbolic Computation-Group of the University of Waterloo has 
verified the result to over forty significant figures.) 


6573. Proposed by Gérard Letac, Université Paul Sabatier, Toulouse, France. 


Suppose X is a real random variable which is normally distributed with mean 0 
and variance 1. Suppose f and g are functions from R to R such that, for each b in 
R, the random variable F(X + b) — g(b) is also normally distributed with mean 0 
and variance 1. Prove that there exists e in {—1,1} and c in R such that 
f(x) = ex + c almost everywhere. 
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SOLUTIONS OF ADVANCED PROBLEMS 
A Characterization of Holomorphic Functions 


6525 [1968, 574]. Proposed by Andrew Lenard, Indiana University. 


Let S be an open subset of the complex plane. Prove that a function f is 
holomorphic on S if and only if 


¥ (Z544 —~ z,)f(z;) 


for every closed disc D C S. Here the star indicates that the supremum is taken over 
all finite sequences Z,, 2),...,2Z,;2Z,4, Of poimts of D (n arbitrary) such that 


Zn41 = 2, and at least two of the z, are distinct. 


Solution by Thomas L. McCoy, Michigan State University, East Lansing, MI. 
First let f be holomorphic in the closure of the disc D, and M an upper bound for 
its derivative there. Let 


n 


c= UL, 
j=l 


where L, is the line segment from z, to z,,,. By convexity, L,; C D. We have then 


Yi (zjar > a) Az) = ff) — Lf (Se) - f(z) & 


=-ZLf (f2) -f@,)) &, 


whence 


Le (Z541 — z;)f(z;) 


< Lilzj.1- z,| sup | f(z) - f(z,)| 
j zEL, 


= Dlg. — zl sup | f7'(w) aw 
J zE L, Z) 
Ss eZ 44 —~ z,|M|z,4 — z,|. 
j 
Thus holomorphy of f in S does indeed imply an inequality of the desired kind for 


any Closed disc lying in S. 
Conversely, assume 


E (20 ~z)f(z) 


n 
2 
< M » IZ44 _ Zi 
j=l 
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is valid for the closed disc D. Then n = 2 yields 
| f(22) — flay) | < 2M|z, - 21], 


so f is Continuous in D. 

Now let I’ be any simple closed rectifiable curve lying in D. Then the integral of 
f over T is the limit of sums 2(z,,, — 2,)f(z;) as sup|z;,; — 2z,| > 9, the z, 
traversing I’. Let s(I°) be the length of I. Choose the z; on Iso that |z,,, — z,| < 6 
for all j, and L|z,,, — z,| < s(1). Then 


2 
Of(z)(z 44 a z,)| < MY \2541 _ z,| 
< MB) 2541 a z,| 
< M8 - s(T), 


and since 6 is arbitrarily small we conclude that 


| faz =0. 


By Morera’s Theorem f is holomorphic in D. Since D is an arbitrary disc contained 
in S, f is holomorphic in S. 


Remarks by the proposer. If the supremum of the problem is taken only over 
sequences of length n = 2, one obtains the class of Lipschitz-continuous functions 
on D, as mentioned in the course of the above solution. If n is unrestricted, one 
obtains the class of holomorphic functions. But what if we take the supremum only 
over sequences of length n = 2,3,..., N for some fixed N? This defines a certain 
class Cy, of functions on D, so that C, is the set of Lipschitz-continuous functions 
and 


TDs 
O 


is the set of holomorphic functions. Can one determine in some effective sense what 
Cy 18? 
Editorial Comment. All solutions received were essentially the same as McCoy’s. 


Solved also by O. P. Lossers (The Netherlands), Chr. A. Meyer (Switzerland), Victor Manjarrez, and 
the proposer. 


REVIEWS 


EDITED BY JOSEPH KONHAUSER 


Department of Mathematics, Macalester College, St. Paul, MN 55105 


Mathematics as Problem Solving. By Alexander Soifer. Center for Excellence in 
Mathematical Education, Colorado Springs, CO, 1987. viii + 117 pp. 


CECIL ROUSSEAU 
Department of Mathematical Sciences, Memphis State University, Memphis, TN 38152 


In the terminology of Paul Erdés, The Book is that transfinite collection of all 
theorems and their best possible proofs. Perhaps the essence of The Book is to be 
found in its section entitled Elementary Problems and Solutions (or words to that 
effect). Here are those problems which require no elaborate formal background but 
whose solutions provide glimpses of mathematics at its finest. Here, to be sure, is 
L. M. Kelly’s solution of the problem posed by J. J. Sylvester and first solved by 
T. Gallai. 


Prove that if n points in the plane are not on one straight line, then there exists a 
straight line containing exactly two of the points. 


Here are examples from George Polya and other masters of the art of problem 
solving. Candidates are to be found in the Elementary Problems section of the 
American Mathematical Monthly and in the various problem-solving competitions 
sponsored by the MAA. The best of these are recognized as coming “straight from 
The Book.” 

Elementary problem solving at its best plays the distinctive role of uniting 
generations of mathematicians. Young Olympiad and Putnam caliber students share 
with “Uncle Paul,” the world’s most famous 25-billion-year-old mathematician, 
common appreciation for problems in elementary number theory and combina- 
torics. Mathematical fashions change, but The Book’s geometry problems are 
timeless. The best problem solvers of a new generation will find in them the same 
satisfaction as did the best of their predecessors. 

What makes a solution suitable for The Book? What is it that we find so 
attractive? Mathematicians often speak of “beauty”, “elegance’’, etc. in reference to 
theorems and their proofs. Generally speaking, such talk leaves the nonmathemati- 
cal fellow citizen totally bewildered. There is perhaps something more elemental to a 
solution frem The Book. That is the element of surprise. Our sense of elegance may 
be so pronounced because we are caught off guard by the proof’s crucial idea and 
are thus surprised by its “elegant” twist. In this respect, there is much to connect 
mathematics and humor. A good solution is like a good joke; the right touch 
produces a classic. The quintessential introduction of a new problem or proof to a 
colleague may be “Have you heard the one about... ?” 

An illustration is in order. The following sleeping mathematicians problem was on 
the 1986 USA Mathematical Olympiad. 
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During a certain lecture, each of five mathematicians fell asleep exactly twice. 
For each pair of these mathematicians, there was some moment when both were 
sleeping simultaneously. Prove that, at some moment, some three of them were 
sleeping simultaneously. 


Members of the committee which constructed the Olympiad had in mind a 
“nice” solution using interval graphs. However, several of the students taking the 
exam were more insightful than the professors. Their solution goes like this. 


Suppose that no three mathematicians were ever asleep at the same time. Then 
there are (5) = 10 disjoint time intervals in which two mathematicians are 


simultaneously asleep. These intervals are each initiated by the event of a 
mathematician falling asleep and there are exactly 5 x 2 = 10 such events. 
Hence, there are 10 distinct times at which a mathematician falls asleep, each 
such time initiating a different period of common dozing. This is absurd; just 
consider the first time a mathematician falls asleep. 


No fancy formulas, no interval graphs, just a surprising and witty solution—and, of 
course, a good joke on the professors. 

Earlier, we mentioned L. M. Kelly’s solution of Sylvester’s problem. The proof is 
like a classic story. It may have been told many times in the past, but deserves to be 
retold. 


Out of all such choices, choose a point P, and a line L not containing P, but 
containing at least two of the remaining  — 1 points so that the distance from 
point to line is minimized. (Justification: not all of the points are on the same 
straight line and the sets involved are finite.) Let Q denote the point on L 
which is nearest P,. If L contained three or more of the n given points, then 
two of these points would lie in one of the two rays obtained by dividing L at 
Q. Suppose that this is the case and let P, and P, denote two such points, with 
P, being the nearer of the two to Q. Since the distance from P, to the line 
through P, and P, would then be less than the distance from P, to L, this is 
impossible. Thus, L cannot contain three or more of the points. It contains 
precisely two. 


~ 


Retelling the best solutions and sharing the secrets of discovery are part of the 
process of teaching problem solving. Ideally, this process is characterized by 
mathematical skill, good taste and wit. It is a characteristically personal process and 
the best such teachers have surely left their personal marks on students and readers. 
Alexander Soifer is a teacher of problem solving and his book, Mathematics as 
Problem Solving, is designed to introduce problem solving to the next generation. By 
external examination, it is a fairly modest book (117 pp) covering standard 
elementary topics and containing about 200 problems. However, it is somewhat 
more than this. It is the reflection of the author’s personal experiences and it is an 
expression of his desire to share what he has learned from The Book. 
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Toward a Lean and Lively Calculus: Report of the Conference / Workshop to Develop 
Curriculum and Teaching Methods for Calculus at the College Level. Donald G. 
Douglas, Editor. The Mathematical Association of America, 1987. xxi + 249 pp. 


RICHARD W. HAMMING 
Department of Computer Science, Naval Postgraduate School, Monterey, California 93943 


There has recently been a lot of noise about revising the calculus course to meet 
modern needs, and it is natural that the needs of society change. But it is also all too 
common that the changes are not recognized by those in charge until many years 
later. For example, E. E. David once remarked to me that our society has passed 
from a manufacturing society to a service oriented one, but the appropriations for 
R&D, especially those from Washington, show no recognition of this fact. I 
similarly observe that we have moved from a society dominated by mechanics and 
electrical circuits in engineering to one dominated by probability and statistics, but 
that the mathematical curriculum has not changed accordingly. And, I would say, 
for about the same reasons. 

The students have also changed. One change that has often been noted is that as 
we have broadened the base of enrollment we have lowered the average level of 
competence. The recent burst of the “New Mathematics” has not helped in their 
earlier education as both they and their teachers remain, to a great extent, demoral- 
ized. But the students have changed in other ways—they recognize that they do not 
live in a world of mechanical things (as I did in my youth) but rather a very 
different world to which almost all that they read in the current calculus books 
appears to be irrelevant. The artificial problems that appeal to us generally do not 
appeal to them with their sense of “relevance.” They remain passively engaged in 
the courses we currently teach. There is little identification of the content with their 
possible lives, but rather mathematics is a chore, a hurdle to be got by as gracefully 
as possible. We must find applications that appeal to the students as they are at the 
time they are in class. I expect that the economics problems I see in the calculus 
books seem to them (as they do to me) little more than empty stuff, and the 
biological examples are not much better! The problems must have some real 
connection with life as they are leading it, involve things that they have heard about 
and are interested in. 

There is another fundamental difficulty with the teaching of mathematics as it is 
presently done. Any systems engineer knows that if you optimize the parts of a 
system then almost surely the system performance will dégrade. We have finally 
managed to hone the individual courses like calculus and linear algebra so that they 
are optimum for themselves—and in the process the teaching of mathematics has 
degraded. Until each course in mathematics is designed to support the whole system 
we will have this counterproductive result. I occasionally teach, for the mathematics 
department, a classic (static) linear algebra course while for the electrical engineer- 
ing department I teach dynamic linear algebra under the name “Digital Filters.” 
Generally speaking, the static course is completely unaware of the dynamic side of 
linear mathematics. 

We now turn to the basis for complaining about the current calculus courses as 
taught. There are two sources possible: those from within and those from without 
the system. Taking first those from within, we see that there is in fact just one 


1988] REVIEWS 467 


standard calculus book with but very minor differences (including the proofs given!) 
in wide use, while excellent books written by first class mathematicians that deviate 
very far are simply ignored. Since it is the professors who choose the texts we have 
to assume, no matter what they claim, that the books being used represent what they 
want. With the power of choice rests the responsibility. There are other books 
around, but the professors will not choose them! From this you conclude that there 
is, in fact, no large scale complaint from within. 

We are a democracy, and this, along with academic freedom, means that any 
college that wants to try new things has the power to do so in spite of screams of not 
being compatible with the rest of the system. Reform can be accomplished in this 
country by the democratic process of the individuals voting as they believe and not 
by central dictatorship (as was once tried by a few pure mathematicians with great 
financial backing to force the New Mathematics on the world). Until the professors 
(who have the power to select) change, there will be little change no matter how 
much noise is made or how many committees and reports are produced. 

The complaints from the outside have been mainly from the computer science 
departments who fancy that they must have a course in discrete mathematics— but 
there is little agreement on what is wanted. Most of the books so far produced look, 
at least to this reviewer, like a hodgepodge of isolated topics that are often only 
loosely connected. Indeed, often a notation that is introduced early in the book is 
simply abandoned later! 

Some of the loudest advocates in the past for abandoning calculus for a discrete 
mathematics course have gradually toned down their complaints as they have 
paused to listen to the simple fact that discrete mathematics carried very far usually 
leads to generating functions (which imply series, integration and differentiation), 
and the generation of a new identity from an old one uses the method of calculus 
(or else a completely contrived derivation that is meaningless to the student). These 
complainers seem now to be more in agreement that what they want is the calculus 
with more discrete mathematics incorporated, like generating functions and dif- 
ference equations. But, Apostol’s magnificent books have long had a lot of discrete 
mathematics in them, so the texts are there (including one by this reviewer) but are 
seldom used. 

I doubt that the computer science departments will get the mathematics depart- 
ments to change much. The American Mathematical Society forced the formation of 
the Association for Computing Machinery long ago by not even allowing a single 
session on numerical methods. The computer science departments may very well 
take up teaching discrete math courses, but they also very likely will not be satisfied 
with what they do in the long run, unless they include a lot of the continuous 
calculus. . 

Any competent mathematician knows that the use of complex variables in 
number theory has been very fruitful—the extremes of discrete and continuous 
(analytic in fact) meet here successfully to the advantage of both. Indeed, there is a 
whole field known as analytic number theory that blends the two. Similarly, as 
noted above, combinatorics in any depth rests on generating functions. It appears to 
this reviewer that any attempt to enforce a strict separation is damaging to both. 

We have yet to discuss what the calculus is. To pure mathematicians it is an 
interesting exercise in the real line and mappings. To scientists and engineers it is a 
powerful tool kit of methods (why else the name?) to be applied in many situations. 
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But here we run into trouble. The slant given by the current mathematical texts does 
not fit what is wanted and in fact leads to ridiculous results. For example, the 
popular (Bourbaki?) definition of a function as a set of ordered pairs runs right into 
the simple desire to count multiple zeros as multiple zeros (as needed, for example, 
in linear differential equations with constant coefficients). The static definition is 
also inappropriate to the calculus which is the study of change, (dynamics). A 
function in the calculus is more like a curve being traced by a moving point than it 
is a set of pairs of numbers. Newton used the word “fluxions” to describe his 
dynamic picture of what is going on. From Newton through Euler, it was the 
dynamic view of function that gave them inspiration. The static definitions may 
have provided some rigor, but at the cost of a meaningful treatment as far as the 
engineers and scientists are concerned, let alone the students! The current books are 
a poorly digested mixture of the static New Math and the dynamic calculus of 
change—and you see it in most of the books when the author forgets what he 
purports to believe and lapses into the dynamic view. 

But there is another aspect of the calculus that is sometimes recognized. The 
expression “mathematical maturity,” whatever it means to you, is probably achieved 
first, if at all, in the calculus course where there is so much generality that the 
student cannot get by with blind memorization, but must come to terms with the 
manipulation of symbols obeying given relationships—one important aspect of 
mathematics! For example, 


ft) dx = [1 dt 


and the variable x or ¢t does not matter. Any extensive simplification of the contents 
of the calculus course is in grave danger of losing this essential step in the 
mathematical development of the student. We can change the contents, to be sure, 
but we need to be very wary of making it “easy.” 

This aspect of the calculus can be stated in another way. I once went to my 
favorite dean who was very concerned with teaching and told him the following 
story. I was, so I said, teaching a weight-lifting class, and the final test was to lift 
250 pounds. I saw that many students had to repeat the course, that many got 
discouraged and dropped out, and that few passed. One night, worrying about this, 
I decided that I should cut the weights in half, and the final test would then be to 
lift one set of 125 pounds and then the second set, thus in the end lifting the 250 
pounds. I went on to say, “When I make a presentation in class of some material 
significantly easier, am I not cutting the weight in half?” We are trying in the long 
run to develop the “mental muscles” of the students, and it is not the particular 
content and its presentation that is the long-range goal. Thus, making the course 
easier and easier for the student to learn is perhaps counterproductive to the 
development of the student’s mind. This is one of the many reasons why I doubt the 
wisdom of trying to teach the calculus to the brighter high school students—lI fear 
that the weights are indeed lifted, but that there is deception in the process. There is 
not the development of the mind which is the goal. I am well aware that it is 
difficult to measure this elusive development I am speaking of, but to turn around 
and measure irrelevant things and base one’s judgment on them seems to me to be 
sheer folly—though that is what is done in most educational circles! 

It is not easy to decide if the textbooks are thicker because of more new and 
essential material or because of more worked examples, more pictures, more 
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problems, more introductions and summaries, and more background reviews of 
needed mathematics. I recall that my college text had the topics of envelopes, 
evolutes, etc. Envelopes are essential to the understanding of the solution of first 
order nonlinear ordinary differential equations because it is the envelope, which is 
not in the so-called “general solution,” that often is wanted in practice. The idea is 
thus a fundamental one, but has been dropped from the texts. I cannot do a simple 
objective survey of the books by counting lines devoted to the various aspects 
because this method founders on the simple fact that an author may use what 
appears to be a worked example and follow with a few lines of text saying “From 
the example follows the general case of... .” 

This reviewer, based on his 30 years as an industrial mathematician, tends to feel 
strongly that the ability to juggle symbols as the pure mathematician does without 
regard to the immediate meaning of the symbols is but half of being a mathemati- 
cian. The other half is the ability to apply the mathematics to the real world. The 
students also seem to feel that playing the mathematical game (as many mathemati- 
cians will claim they are doing when pushed to the wall about the meaning of 
mathematics) is no more meaningful than playing bridge and chess as a lifetime 
career. The calculus is one of the great fields of mathematics where applications are 
easy to find that can be meaningful to the students as they are. 

I need to discuss another well known aspect of mathematics— that it is the art of 
abstraction, generalization, and extension. It seems to me that this is not best taught 
by starting with abstract postulates, but rather by beginning with the concrete and 
teaching the process of abstraction. Thus in class just a few days ago we had the 
problem of finding the maximum area of a rectangle that can be put inside a certain 
ellipse. I got the answer to it, and of course it was merely the answer. Then I turned 
to the general ellipse, found the solution for it and showed that: (1) it matched one’s 
ideas for a circle, (2) it gave the earlier specific case, (3) it was much more 
understandable than the specific case, and (4) it had all the symmetry and dimen- 
sional analysis it should have. They could then see that it was simply the case of a 
square in the circle tilted until it looked like the ellipse they originally had been 
given. The abstraction shed light on the specific case that was obscured by its 
particular details. We need to do such abstractions regularly if we are to teach 
mathematics and not just cover the ground. 

There is yet one final aspect of what calculus is and is not. It is in many respects 
the systematic evasion of hard computation: how to go right to the answer without 
fumbling around! Often the calculus gives you the method. Thus trying to blend 
numerical analysis and the calculus is a lot like mixing water and oil. A small 
amount of computing is useful, though trying to claim that computing an infinite 
series to show its convergence is plain dangerous— the harmonic series will converge 
(slowly to be sure) on a computer to a finite value, but in mathematics the limit is 
infinite. Furthermore, any serious numerical computation uses floating point num- 
bers (von Neumann to the contrary!) and most mathematicians are very ill prepared 
to understand them. Floating point numbers are tricky, very unlike the standard 
number system of mathematics, and the teachers who do not master floating point 
arithmetic will simply look like fools to knowledgeable students who have done 
extensive computation on their computers. 

Let me now turn to the book being reviewed, Toward a Lean and Lively Calculus, 
which is the result of a Symposium held January 2-6, 1986, at Tulane University 
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with the 25 distinguished attendees, listed in the front. A statistician friend of mine 
used to say, “Let me pick the sample, and I will let you do the analysis.” So I 
looked closely at the names and where they were from. There were 16 from 
mathematics departments, one I recognized as an engineer, a lot of professional 
educators at various levels, a professor from Courant Institute, and one high school 
teacher—but none from industry that I could recognize. Well, given the sample, it is 
not hard to deduce, without reading much, what the book will contain. The reports 
and papers are good as far as they go. For example, Stein is clearly right when he 
tries to find out where the complaints are coming from, and finds none in the 
standard places, and Renz agrees that the discrete is being oversold. 

Being in a rather protected place in the educational system, I had not known that 
classes of 500 or more calculus students are widespread. I was appalled at first, but 
when I thought it over, there is no absolute reason against classes of this size, 
provided they are well done. But the fact that graduate students manage most of the 
quiz sections, and probably not the best mathematical and teaching professors are 
used to give the lectures, makes one worry. One proposal in the report that the 
graduate students be given courses in teaching will have no effect unless the 
graduate students perceive that it will count toward getting their Ph.D. degrees—and 
I cannot believe that mathematics departments will do this in the near future. What 
is seldom thought about is this; only one faculty person sees the class size of 500 or 
more, but 500 students see that size, which is quite a different matter! I have had 
some time to think the idea over and to talk to friends about it, and I cannot believe 
that there are many places (there may be a few) where it is well done. I have a sick 
feeling when I consider how such courses probably work out in practice. 

If you think that the calculus is merely technique and the course is a drill place, 
then maybe you can defend that size, but if you think that it is an educational 
experience then you will have to reconsider matters. Of course it can be defended on 
the grounds that it makes the students learn for themselves and that this is good, but 
I still have a very heavy heart when I think that this is what we are doing to eager, 
expectant beginners in the field of mathematics. No wonder we have so few math 
majors! 

The failure rate of 50% in many places also makes me wonder. To the outsider it 
appears to be a heartless method without any attempt to weed out the losers before 
starting. One would think that a good precalculus course, or else a stiff qualifying 
exam in order to be excused, would be widespread, and once a student was in a 
calculus course there would be a reasonable probability of passing. But according to 
the report, failure rates of 50% seem to be regular—and there seems to be little 
apology for it. It is foolish for the mathematics departments to accept the onus of 
chminaling the less able students from the university! 

I suspect that the complaints about the calculus course arise from these monster 
classes and the way they are handled—and from the professors who do not want to 
teach the calculus course, though I myself clearly believe the calculus course should 
be changed. 

As I said, there is much wisdom in the report; Renz for example seems to 
recognize that we are at a local optimum of teaching the calculus, and that small 
reforms will not be effective—but I doubt that the installed professors are willing to 
go for a big change; it is too much trouble! 
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Lax is enthusiastic about the importance of the continuous mathematics, and he 
is right; still there is an increasing amount of discrete mathematics, probability, and 
statistics in science and engineering, and it will not go away, let alone go away from 
the coming fields that will increasingly need a good, solid knowledge of mathe- 
matics. One cannot, in my opinion, do serious statistics using only the discrete, since 
the “statistics of a statistic” is a fundamental idea that is essential to the under- 
standing of statistics, and this involves the continuous! 

Maurer is also very good; but it becomes invidious to name some and not others, 
while a complete listing of all the names seems foolish. Even the preface is full of 
wisdom. Yet nowhere did I see the recognition that a calculus course is only part of 
a system of mathematical education and that the optimization of it is counterpro- 
ductive to the mathematical education of society since it is well known in the field 
of systems engineering that optimizing a component generally degrades the system’s 
performance. Calculus is needed both broadly, and in much greater depth, than was 
necessary in the past if people are to participate in the evolution of our society at 
other than a superficial-level. We indeed live in an increasingly technical society in 
all fields, from science and engineering through the hard sciences and into the softer 
sciences, hence mathematics, along with probability and statistics, is now entering, 
inevitably, into even the humanities and arts. Mathematics is the language of clear 
thinking, and where clarity is wanted there is the need for mathematics. Calculus, 
being the basic language for describing change, will remain the basic intellectual 
tool for understanding our increasingly complex, changing society. Reform of the 
calculus course is necessary but I believe it must come from within the mathematical 
community and cannot be imposed from without. Will the mathematicians respond? 
I want to believe that they will, but I doubt it! 


Decompositions of Manifolds. By Robert J. Daverman. Academic Press, 1986. xi + 
317 pp. 


JAMES W. CANNON 
Department of Mathematics, Brigham Young University, Provo, Utah 84602 


Applications 


At least three famous problems in geometric topology have been solved in recent 
years by arguments involving decompositions of manifolds. These problems are the 
double-suspension problem for homology spheres, the characterization problem for 
topological manifolds, and the 4-dimensional Poincaré conjecture. Daverman’s book 
includes a proof of the first of these and gives a beautiful exposition of the ideas, the 
céntral examples, and the types of argtiments involved in the decomposition theory 
related to all of these problems. As a highlight, Daverman proves the cell-like 
approximation theorem of Edwards which has been known for many years now but 
which has never before appeared in print. 


Elementary Examples 


Consider a dozen eggs. Their union might be considered an infinite topological 
space X with twelve components. But it is equally natural to consider each of the 
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twelve eggs as a single point which together form a space Y having exactly twelve 
points and the discrete topology. The space Y is formed from the space X by 
identification or decomposition. 

The general construction is this. Let X be any space, and let ~ be any 
equivalence relation on X. Let Y= X/~ denote a new space whose points are 
equivalence classes of points of X. Declare a subset U of Y to be open in Y if the 
union of the equivalence classes of U form an open subset of X. Then Y is called a 
decomposition space of X. Decomposition theory compares the properties of X and 
Y. There is, of course, a natural continuous surjection 7: X — Y which takes each 
point of X to the equivalence class which contains it. 

The projection map a: X — Y of the preceding paragraph suggests a standard 
way of obtaining a decomposition space. Let X be any topological space, Y a set, 
and f: X — Y a surjection. Declare x, y © X to be equivalent if and only if 
f(x) = f(y). We may then consider Y as the set of equivalence classes and endow Y 
with the identification topology. Then the map f becomes the projection map. 

As an example of a decomposition consider the case where X is a plane and the 
only equivalence class having more than one point is a (closed) arc in X. Then Y is 
also a plane, as the reader may prove for himself. On the other hand, if the only 
equivalence class having more than one point is a simple closed curve in X, then Y 
is homeomorphic with a plane to which a tangent bubble has been attached. 

Daverman’s book considers the case where X is a topological manifold and ~ 
has especially nice properties. There are two typical hypotheses on ~ . The first is 
that ~ should be upper semicontinuous (usc). One way of stating this is that Y 
should be Hausdorff. Another is that the image of a closed subset of X should be 
closed in Y. The second typical hypothesis on ~ is that it should be cell-like. This 
means that each equivalence class should be a compact set having the Cech-homo- 
topy properties of a point; that is, this compact set should be contractible to a point 
in each neighborhood of itself. These two hypotheses taken together ensure that X 
and Y are hereditarily of the same homotopy type; that is, if U is an open subset of 
Y, then the inverse image of U in X is homotopy equivalent with U. 

The theory is full of beautiful and surprising results. 


Sources of Decomposition Theory 


Here are three of the beautiful results that led to the development of the theory 
of decompositions of manifolds. 


R. L. Moore’s theorem. If f: S* > X is a surjection from the 2-sphere S* onto a 
Hausdorff space X such that, for each x © X, S?\f~'(x) is nonempty and 
connected, then X is also a 2-sphere. Moore’s students asked what the appropriate 
generalization would be in higher dimensions. 


R. L. Wilder’s study of manifolds. J. W. Alexander defined a topological 2-sphere 
S in Euclidean 3-space E? in such a way that the union B of S and its interior Int S$ 
in E° was not a 3-ball. We call the set B an Alexander horned cube. R. L. Wilder 
noted that, if one were to sew two copies of B together along their 2-sphere 
boundaries by a homeomorphism of those boundaries, then the resulting union had 
all of the standard algebraic properties of the 3-sphere, just as if one had sewn 
together two copies of the 3-ball. Wilder asked whether the sewing was in fact the 
3-sphere. 
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P. A. Smith’s theory of periodic mappings. P. A. Smith proved that if one were to 
consider a homeomorphism 7: S? > S? from the 3-sphere S$? to itself whose square 
T oT was the identity, then the fixed point set of T, that is, the set of points x such 
that T(x) = x, was also a topological sphere, possibly of lower dimension. Smith 
asked whether every such periodic map was conjugate to an orthogonal map. 

Notice that none of these three results mentions decompositions of manifolds. 
But each of them suggested an important problem or field of study that led to the 
study of decomposition spaces. This transition to decomposition spaces occurred in 
large measure because of the work of R. H. Bing. 


Tribute to R. H. Bing 


It was R. H. Bing who outlined all of the standard problems, examples, and 
techniques in the study of decompositions of manifolds. 


Pointlike decompositions. R. L. Moore’s theorem can easily be restated as a 
decomposition space theorem: if X is a cell-like usc decomposition of the 2-sphere, 
then X is a 2-sphere. The students of R. L. Moore asked what the appropriate 
generalization of this remarkable theorem would be in higher dimensions. G. T. 
Whyburn suggested a condition on the point preimages of a map and their 
embedding in the domain of the map in order that the domain and range be 
homeomorphic. He called a set pointlike in a space if its complement was homeo- 
morphic with the complement of each and every point of the space. Two related 
conditions are cellularity and cell-likeness. A set in an n-manifold is cellular if it is 
the intersection of a nested sequence of n-cells. In Euclidean space a set is point-like 
if and only if it is cellular. A set in an n-manifold is cell-like if it contracts in each 
neighborhood of itself. Every cellular set in a manifold is cell-like. Bing was the first 
to give an example of a point-like (cellular) decomposition of a Euclidean space 
such that the decomposition space was not a manifold. This example was named the 
dogbone space. 


The Alexander horned sphere, Wilder’s problem, Bing, and the rubber bands. Bing 
was interested in the question implicit in P. A. Smith’s theory of periodic maps: is 
every involution of the 3-sphere conjugate to an orthogonal involution? Bing 
realized that if Wilder’s sewing of two Alexander horned cubes were S?, then there 
would be a nonorthogonal involution of S? which interchanged the two cubes. Bing 
then proceeded to realize Wilder’s sewing as a point-like decomposition space of the 
3-sphere S?. In turn, Bing hit upon an idea for showing that the decomposition 
space was in fact S°; the nondegenerate elements of the decomposition were 
described as infinite intersections of solid tori; he realized that it would suffice to 
arrange the solid tori in such a manner that they were near the union of their 
original positions yet had small diameter. He took rubber bands as models of solid 
tori, arranged these rubber bands in the desired configuration, and then proceeded 
to play with the rubber bands in order to see whether they could be “shrunk.” His 
clever solution to the shrinking problem added a multitude of new ideas to the 
theory of manifolds. In the process, Bing answered the Wilder question in the 
affirmative and demonstrated the existence of nonorthogonal involutions of S°. 


The dogbone space and minimal counterexamples. Bing typically worked by 
induction on the complexity of the examples his techniques wou!d handle. He would 
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try to prove a theorem for two hours, develop a counterexample for the next two 
hours, and cycle the procedure. His techniques often resulted not only in the correct 
theorem but in the best possible example to show the limits of that theorem. “A 
good theorem,” he delighted in saying, “is one that is almost false.” In a sense, 
Bing’s dogbone space was a minimal counterexample to the conjecture that cellular 
decompositions always had manifolds as decomposition spaces. 


Manifold factors. Shapiro showed that the contractible 3-manifold defined by 
Whitehead which was not Euclidean 3-space was nevertheless a factor of Euclidean 
4-space. Bing, who had not seen the proof, discovered a proof for himself and 
generalized his own proof to show that the dogbone space was also a factor of E+. 
Bing’s procedure was very powerful and eventually led people to the realization that 
manifold factors and generalized manifolds were essentially the same class of 
spaces. 


Highlights of Decomposition Theory 


We here describe three results of considerable interest that were solved by use of 
the mature theory of decomposition spaces of manifolds. 


The double-suspension theorem. A finite complex that has the local and global 
homological properties of the n-sphere is called a simplicial homology sphere. 
Poincaré defined the first homology sphere D which was a true topological manifold 
but which was not a true topological sphere. His homology sphere was of dimension 
3 and was clearly not a sphere because it was not even simply connected. If D is 
embedded in a Euclidean space E“*, if E* is considered as a coordinate hyperplane 
of a still higher dimensional Euclidean space E”**, and if S denotes the standard 
unit sphere in the orthogonal complement E” of E*, then one may form the join 
S*D of D with S. The join S* D is simply the union of all straight line intervals 
joining a point of S with a point of D in E"**. The join S* D is called the nth 
suspension of D. If D were a true sphere, then S * D would be a true sphere of 
higher dimension. However, since D is not simply connected, for n= 1, S*D 
cannot possibly be a true sphere, not even a manifold, for it is not like a manifold at 
the two points of the 0-sphere S. People noted, however, that for n = 2 the join 
S * D, which was called the double-suspension of D, had all of the standard local 
and global invariants of the k + 2-sphere. Milnor asked, as one of his famous 
problems, whether it could possibly be a true topological sphere. It was clear that if 
it were a sphere, then it would be impossible to define a piecewise linear homeomor- 
phism from the finite complex S* D to the standard sphere. R. D. Edwards and 
J. W. Cannon finally resolved the problem by decomposition space techniques by 
showing that the double suspension of any homology sphere is indeed a topological 
sphere. The homeomorphisms they described did indeed involve infinite processes, 
indeed shrinking techniques of the type suggested by Bing. 


The 4-dimensional Poincaré conjecture. The most famous problem in geometric 
topology is clearly the Poincaré conjecture: a compact, topological manifold without 
boundary is a sphere if it has the homotopy type of a sphere. This conjecture was 
confirmed for manifolds of dimension > 5 during the sixties by Smale, Zeeman, 
Stallings, and others. Freedman confirmed the conjecture for dimension 4 in the 
early eighties. Freedman’s work depended heavily on decomposition space 
techniques. 
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Characterization of manifolds. Examples of decompositions of manifolds, par- 
ticularly examples due to Bing, showed the existence of spaces that in all of their 
standard algebraic properties were like manifolds but which were not topological 
manifolds. The examples made it apparent that there would be no simple characteri- 
zation of the topological manifold in terms of simple, checkable topological proper- 
ties short of the local coordinate definition. It came as a great surprise, therefore, 
when decomposition studies indicated the probability of the following conjecture: a 
generalized n-manifold having the disjoint disks property, n > 5, is a topological 
n-manifold. A generalized n-manifold M is a Euclidean neighborhood retract whose 
local homology coincides at each point with the local homology of Euclidean 
n-space. The space M has the disjoint disks property if maps f, g: B* > M can be 
approximated by maps /’, g’: B* > M having disjoint images. Cannon stated the 
conjecture early in 1977 and proved many special cases. The general case is still 
partially in question, though major steps presented by R. D. Edwards and F. Quinn 
which almost complete the proof seem firmly established. 


Daverman’s Book and the Three Periods of Decomposition Theory 


Daverman himself is one of the modern masters of decomposition theory. He 
divides the history of this subject into three rough areas. 


The Moore period. The first period dealt primarily with decompositions of 
2-manifolds. It was typified by the theorem of R. L. Moore mentioned above. The 
main technique centered in an axiomatic characterization of 2-manifolds. 


The Bing period. The second period dealt primarily with 3-manifolds. The 
central technique was the shrinking of decompositions. The principal practitioner 
was R. H. Bing. The most beautiful results were a catalogue of beautiful examples 
that were not shrinkable but which were factors of 4-manifolds. Daverman states 
the following positive theorem as prototypical. 


THEOREM (Bing and Armentrout). Let G be an upper semicontinuous decomposi- 
tion of a 3-manifold M into cellular sets. Then M/G is homeomorphic to M if and only 
if G is shrinkable. 


The Edwards period. Daverman associates the third period with the work of 
Edwards. The central result is Edwards’ cell-like approximation theorem. 


THEOREM (CELL-LIKE APPROXIMATION). Let G denote an upper semicontinuous 
decomposition of an n-manifold M, n > 5, into cell-like sets. Then the decomposition 
map 7: M — M/G can be approximated by homeomorphisms if and only if M/G is 
finite-dimensional and satisfies the following disjoint disks property: any two maps of 
the 2-cell B* to M/G can be approximated by maps having disjoint images. 


The bibliography of this book reads like a listing of my mathematical friends and neighbors during a 
period of my life when I lived in a different mathematical neighborhood, city, and state. I thank those 
friends and neighbors for years of friendship and pleasure. I thank Bob Daverman for recording the 
family history. 
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General, P, L**. Encyclopaedia of Mathemat- 
ics, Volume 1: A-B. Ed: M. Hazewinkel. D Reidel 
(US Distr: Kluwer Academic), 1988, ix + 488 pp, 
$149. [ISBN: 1-55608-000-X] First of ten volumes 
of a translation with editorial updates of the Soviet 
Mathematical Encyclopaedia, written in the 1970's 
and originally published during 1977-1985. (Orig- 
inal Russian edition of 150,000 is sold out.) Con- 
tains surveys for non-specialists (e.g., analytic num- 
ber theory, axiomatic method, basis), short arti- 
cles on specific results or problems (e.g., Balayage 
method), and definitions, all arranged alphabeti- 
cally with extensive cross referencing to MR num- 
bers. Most items contain references, frequently sup- 
plemented by the editors to include current Western 
sources. Final volume will be devoted to extensive 
indices cross-referencing terms and MR classification 
numbers. LAS 


General, S(13). Inequalities. P.P. Korovkin. 
Transl: Sergei Vrubel. Little Math. Lib. MIR (US 
Distr: Imported Pub), 1986, 72 pp, $1.95 (P). [ISBN: 
0-8285-3221-4] Cute little book that explores various 
inequalities that pop up in mathematics. The format 
is to introduce a definition or a certain inequality (for 
example, the Bernoulli inequality), state and prove a 
fact or two, then follow with some solved problems. 
This would be a nice book to use in a high school or 
freshman problem solving class. (First Edition, TR, 
October 1977.) LC 


General, §(13). Dictionary of Mathematics Terms. 
Douglas Downing. Barron’s, 1987, xiv + 241 
pp, $8.95 (P). [ISBN: 0-8120-2641-1] Definitions for 
over 600 terms from abacus to zero. Some cross- 
referencing. Uneven in accuracy, clarity, complete- 
ness, and utility. Symbol list for algebra, calculus, 
geometry, vectors, sets, and logic. Brief logarithms, 
trigonometry, and statistics tables. Handy, but far 
from the last word. A compact 4 3/16” x 7”. Good 
buy for high school libraries and college students. JK 


General, P. Fourteen Papers Translated from the 
Russtan. V.I. Arnol’d, et al. AMS Transl. Ser. 2, 
V. 137. AMS, 1987, v + 129 pp, $46. [ISBN: 0-8218- 
3113-5] 


Elementary, T(11: 1). Intermediate Algebra, Sec- 
ond Edition. Daniel L. Auvil. Addison-Wesley, 1987, 
xi + 593 pp, $31.95. [ISBN: 0-201-11046-6] Clearly 
written in a conversational and unitimidating man- 
ner. Presents practical uses of the material. Topics 
include: real numbers, polynomials, inequalities, ra- 
tional expressions, rational exponents, complex num- 
bers, graphing, systems of linear equations, conic sec- 
tions, functions, logarithms, sequences, and series. 
Plenty of problems follow each section. LW 
Mathematics Appreciation, T(14-16), S**, 
L**, The Problems of Mathematics. Ian Stew- 
art. Oxford U Pr, 1987, 257 pp, (P). (ISBN: 0-19- 
289182-0] A lively, non-technical portrait of impor- 
tant themes in mathematics, each traced from an- 
cient roots to yesterday’s research: primes (“equa- 
tions with holes”), geometry, fractals, groups, knots, 
computability (“the used axiom salesman”), and 
much more. Witty and creative, Stewart writes with 
authority in the manner of a well-researched travel 
guide: he conveys enthusiasm and insight on every 
page. LAS 


Finite Mathematics, T(13: 1). Elements of Data 
Processing Mathematics, Third Edition. Wilson T. 
Price, Charles Peselnick. Holt, Rinehart & Winston, 
1987, xi + 670 pp, $26.50. [ISBN: 0-03-000178-1] 
A text for business students, it introduces them to 
such topics as modular arithmetic, bases 2, 8, and 
16, flowcharts and sets, logical forms, basic algebra, 
simultaneous equations, and some linear program- 
ming techniques. It is clearly written and has lots of 
problems that relate the material to computers. MZ 
Education, P. Cognitive Processes in Mathematics. 
Ed: John A. Sloboda, Don Rogers. Keele Cogni- 
tion Seminars, V. 1. Oxford U Pr, 1987, x + 208 
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pp, $57.50. [ISBN: 0-19-852163-4] Ten papers from a 
1985 conference on cognition. Contributors are psy- 
chologists, and mathematics is used as the medium 
in which to represent issues in cognition. Deals pri- 
marily with early competence in counting, computa- 
tion, and simple algebra, and with representation of 
mathematical ideas. Note the price. MW 


Education, P**, L**. Twice As Less: Black 
English and the Performance of Black Students in 
Mathematics and Science. Eleanor Wilson Orr. WW 
Norton, 1987, 240 pp, $15.95. [ISBN: 0-393-02392-3] 
A detailed analysis of how Black vernacular clashes 
with mathematical convention leading to incompre- 
hensible confusion in which “half as large as” be- 
comes “half as small as,” and “one-third of” mas- 
querades as “three times less.” Performance of Black 
students in school mathematics is thus diminished 
by linguistic dissonance, with tragic consequences. 
Full of verbative examples; original; controversial; 
immensely important. LAS ‘ 


Education, P, L. Improving Indicators of the Qual- 
ity of Science dnd Mathematics Education in Grades 
K-12. Ed: Richard J. Murnane, Senta A. Raizen. 
National Academy Pr, 1988, ix + 220 pp, (P). [ISBN: 
0-309-03740-9] An “indicator” is a statistic (e.g., 
test scores, enrollment data) used to monitor qual- 
ity. This report from the National Research Council 
analyzes indicators in science and mathematics ed- 
ucation of learning, of student behavior, of teaching 
quality, of curriculum, and of financial support, and 
makes recommendations concerning improvement in 
the reliability, quality, and usefulness of such indica- 
tors. LAS 


Combinatorics, P, L. Sphere Packings, Lattices 
and Groups. J.H. Conway, N.J.A. Sloane. Grund. 
der math. Wissenschaften, V. 290. Springer-Verlag, 
1988, xxvii + 663 pp, $87. [ISBN: 0-387-96617-X] 
What arrangement allows the largest number of base- 
balls to be packed in a box? This is the problem 
which motivates this book. Lattices and their au- 
tomorphism groups enter the picture by considering 
the centers of the spheres in an arrangement. This 
book is a good comprehensive introduction to the 
sphere-packing problem and its relation to lattices, 
coding theory, and number theory. Many of the ta- 
bles and constructions have not appeared before in 
print. LC 


Combinatorics, P. Lecture Notes in Mathematics- 
1278: Invariant Theory. Ed: S.S. Koh. Springer- 
Verlag’ 1987, 102 pp, $13.90 (P). [ISBN: 0-387-18360- 
4] A coliection of seven papers authored by Pom- 
merening, l‘ormanek, Lascoux, Kung, Olver, Kempf, 
and Grosshans. SG 


Combinatorics, T(16-17: 2), L. Combinatorics 
of Ezperimental Design. Anne Penfold Street, Deb- 
orah J. Street. Clarendon Pr, 1987, xiv + 400 pp, 
$27.50 (P); $55. [ISBN: 0-19-853255-5; 0-19-853256- 
3] Introduction to theory of designs with emphasis 
on aspects of interest to both combinatorialists and 
statisticians. The unifying concept is the pairwise 
balanced design. The main prerequisite is linear al- 
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gebra; results from statistics, number theory, and fi- 
nite field theory are introduced when needed. LC 


Combinatorics, P. Irregularities of Distribution. 
Jézsef Beck; William W.L. Chen. Tracts in Math., 
V. 89. Cambridge U Pr, 1987, xiv + 294 pp, $54.50. 
(ISBN: 0-521-30792-9] A self-contained treatment of 
refinements and generalizations of the following the- 
orem, first conjectured in 1935 by van der Corput: 
For any infinite sequence of real numbers in the unit 
interval, one can choose n and two subintervals of 
equal length such that the difference between the 
number of elements less than n in the subintervals 
is arbitrarily large. GG 


Combinatorics, S(17), P. Lecture Notes in Con- 
trol and Information Sciences-94: Computational 
Complezity of Bilinear Forms. Hari Krishna. 
Springer-Verlag, 1987, xv + 166 pp, $24.40 (P). 
[ISBN: 0-387-17661-6] A study of the relationship 
between the multiplicative complexity associated 
with the computation of a class of algebraic func- 
tions called bilinear forms and linear error-correcting 
codes. A new class of linear error-correcting codes is 
derived from the bilinear algorithms used for aperi- 
odic convolution of certain sequences. CEC 


Number Theory, L. Geometry in Practice and 
Numbers in Theory. Peter Hilton, Jean Pedersen. 
Mono. in Undergrad. Math., V. 16. Journal of Un- 
dergrad. Math. (Dept. of Math., Guilford Coll., 
Greensboro, NC 27410), 37 pp, $8.50 (P). Describes 
an algorithm for folding paper strips into certain 
polygons and investigates number theoretical ques- 
tions which arise from this algorithm. Includes re- 
sults from earlier papers along with some new results. 
An interesting exposition which could easily be used 
as the basis of an undergraduate independent study 
project. RH 

Number Theory, S(15-16). Fascinating Frac- 
tions. N.M. Beskin. Transl: V.I. Kisin. Little Math. 
Lib. MIR (US Distr: Imported Pub), 1986, 87 pp, 
$2.95 (P). [ISBN: 0-8285-3195-1] A clearly-written 
exposition of the basic properties of continued frac- 
tions motivated by two interesting problems: approx- 
imations to # and the establishment of the Gregarian 
calendar. At this price a great bargain. SG 


Linear Algebra, S(14). Linear Algebra. V.A. 
Ilyin, E.G. Poznyak. Transl: Irene Aleksanova. 
MIR (US Distr: Imported Pub), 1986, 285 pp, 
$10.95. [ISBN: 0-8285-3340-7] Very classical, theo- 
retical treatment at higher than average level. Nine 


‘chapters cover matrices and determinants, linear 


spaces, systems of linear equations, Euclidean spaces, 
linear operators, iterative methods, bilinear and 
quadratic forms, tensors, elements of group theory. 
No exercises. GG 


Group Theory, P. Lecture Notes in Mathematics- 
1291: Correspondances de Howe sur un corps p- 
adique. Colette Moeglin, Marie-France Vignéras, 
Jean-Loup Waldspurger. Springer-Verlag, 1987, vii 
+ 163 pp, $17.30 (P). [ISBN: 0-387-18699-9] 

Algebra, T(18). Lectures on Artinian Rings. An- 
dor Kertész. Akademiai Kiado, 1987, 427 pp, $48. 
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[ISBN: 963-05-4309-5] A thorough treatment of the 
topic beginning with set theory to ring theory and 
moving through such topics as semi-simple rings, 
rings of quotients, group rings, and quasi-Frobenius 
rings. A modest number of exercises are included. SG 


Algebra, T(12-13), P. Algebra—aller Anfang ist 
leicht, 9. Auflage. Herbert Kastner, Peter Goth- 
ner. BG Teubner, 1987, 155 pp, 8,40M (P). [ISBN: 
3-222-00382-5] A gentle but careful introduction to 
abstract algebra for use in German high schools. JDB 


Algebra, S(18), P. Lecture Notes in Mathematics- 
1284: Konstruktive Galotstheorie. B. Heinrich Mat- 
zat. Springer-Verlag, 1987, x + 286 pp, $25.80 (P). 
[ISBN: 0-387-18444-9] On the inverse problem of Ga- 
lois theory for a field K, i.e., is every finite group the 
Galois group of some polynomial in K[z]? JD-B 


Algebra, P. Lecture Notes in Mathematics-1289: 
K-Theory, Arithmetic and Geometry. Ed: Yu. I. 
Manin. Springer-Verlag, 1987, 399 pp, $39.40 (P). 
[ISBN: 0-387-18571-2] Nine articles originating in lec- 
tures given at the Manin Seminar at Moscow Univer- 
sity over the years 1984-1986. GG 


Calculus, C(13: 2), S*. Computer Ezplorations in 
Calculus. IBM PC or Apple II. K.D. Stroyan. Har- 
court Brace Jovanovich, 1986, x + 278 pp, $13.95 
(P). [ISBN: 0-15-512636-9] An introduction to nu- 
merical and graphical programming and a supple- 
ment to the first two semesters of calculus. Does not 
require previous experience with programming and 
programming technicalities are kept to a minimum. 
To be used concurrently with the appropriate topic 
in calculus. Programming is done in Basic. Lots of 
exercises. CEC 


Calculus, T*(14: 1), L. Mathematical Analysis: A 
Fundamental and Straightforward Approach. David 
S.G. Stirling. Math. & Its Applic. Halsted Pr, 1987, 
172 pp, $39.95. [ISBN: 0-470-20903-8] Contents of- 
ten termed advanced calculus—lots of limits, some 
differentiation, series, and integration. Reasonable 
number of exercises, often with solution or hint in 
back. Frequently uses mathematical shorthand. Al- 
though it covers no material beyond first-year calcu- 
lus, a good book for introducing rigor and proof. GG 


Calculus, T(13: 1). Calculus for the Managerial, 
Iife, and Social Sciences. S.T. Tan. Prindle, Weber 
& Schmidt, 1986, x + 525 pp. [ISBN: 0-87150-003- 
5] Suitable for one semester class. The text is very 
applications oriented; in fact, it has a list of appli- 
cations from business and economics, social sciences, 
life sciences, and other areas. One quibble is that 
integration is not ernphasized as area under curve. 
Nicely written. MZ 


Real Analysis, S(18), P*. Regular Variation. 
N.H. Bingham, C.M. Goldie, J.L. Teugels. Encycl. of 
Math. & Its Applic., V. 27. Cambridge U Pr, 1987, 
xix + 491 pp, $75. (ISBN: 0-521-30787-2] “Com- 
prehensive account of the theory and applications of 
regular variation. Concerned with the asymptotic 
behavior of a real function of a real variable z which 
is ‘close’ to a power of z.” Well-motivated, clearly 
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written, extensive bibliography, exercises at end of 
each chapter. BH 


Complex Analysis, T(16: 1, 2), L. Basic Com- 
plez Analysts, Second Edition. Jerrold E. Marsden, 
Michael J. Hoffman. WH Freeman, 1987, xii + 604 
pp. (ISBN: 0-7167-1814-6] “Substantial revision of 
the First Edition in detail, but not in spirit. Many 
passages have been rewritten for clarity.” Added 
topics include integrals along continuous curves, nor- 
mal families, Riemann mapping theorem, and func- 
tions of bounded variation, resulting in 130 page fat- 
tening of original (TR, April 1974). Main attrac- 
tion remains numerous diagrams, examples, and ex- 
ercises. BH 


Differential Equations, T(15: 1), L. Ordinary 
Differential Equations. M.L. Krasnov. Transl: Irene 
Aleksanova. MIR (US Distr: Imported Pub), 1987, 
150 pp, $4.95 (P). [ISBN: 0-8285-3444-6] This slim 
but packed volume is a translation of M.L. Kras- 
nov’s Russian book. There are five chapters: first- 
order differential equations, higher-order differential 
equations, systems of differential equations, stability 
theory, and some additional problems. There is a 
mix of theory (e.g., statements of unique existence 
of solutions to the Cauchy problem) and approxi- 
mation concepts; major emphasis is on systems of 
differential equations. Includes a solid treatment of 
Lyapunov stability. Chapter exercises; suitable for a 
second course in differential equations. RSF 


Differential Equations, P. Lecture Notes in 
Mathematics-1285: Differential Equations and Math- 
ematical Physics. Ed: I.W. Knowles, Y. Saito. 
Springer-Verlag, 1987, xvi + 499 pp, $48.50 (P). 
[ISBN: 0-387-18479-1] Papers from an international 
conference on differential equations and mathemati- 
cal physics held at the University of Alabama, March 
1986. MR 


Partial Differential Equations, P. Stochastiche 
Evolutionsgleichungen und deren Steuerung. Wil- 
fried Grecksch. Teubner-Texte zur Math., B. 98. BG 
Teubner, 1987, 212 pp, 22M (P). [ISBN: 3-322-00406- 
6] On stochastic evolution equations in the sense of 
Ito, and their control theory. JD-B 


Numerical Analysis, P. The State of the Art in 
Numerical Analysis. Ed: A. Iserles, M.J.D. Powell. 
Inst. of Math. & Its Applic. Conf. Ser., V. 9. 
Clarendon Pr, 1987, xiv + 719 pp, $98. [ISBN: 0-19- 
853614-3] The proceedings of the joint IMA/SIAM 
conference held in April 1986 at the University of 


Birmingham. A collection of 23 papers covering the 


major developments of the last decade in numerical 
analysis. RH 


Operator Theory, P. Lecture Notes in Mathe- 
matics-1272: Commuting Nonselfadjoint Operators 
in Hilbert Space: Two Independent Studies. Moshe S. 
Livsic, Leonid L. Waksman. Springer-Verlag, 1987, 
115 pp, $13.10 (P). [ISBN: 0-387-18316-7] Two ad- 
vanced discussions intended for specialists. AWR 

Analysis, P, L. N. Bourbaki, Elements of Math- 
ematics: Topological Vector Spaces, Chapters 1-5. 
Transl: H.G. Eggleston, S. Madan. Springer-Verlag, 
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1987, vii + 364 pp, $79. [ISBN: 0-387-13627-4] An 
English translation of the fifth book in the famous se- 
ries. For those who have only heard about but never 
studied the Bourbaki style, this is as good a place 
as any to start. “In principle, it requires no partic- 
ular knowledge of mathematics on the reader’s part, 
but only a certain familiarity with mathematical rea- 
soning and a certain capacity for abstract thought.” 
Keep that in mind. AWR 


Analysis, T(15-16: 1), L. Methods of Summation. 
Bertram Ross. Descartes Pr (2-13-10 Kaguike, Ko- 
riyama, Japan 963), 1987, ix + 127 pp, 4,000 yen 
(P). This little book is a collection of worked prob- 
lems illustrating techniques in summing infinite se- 
ries in closed form. Topics include the calculus of 
finite differences, the Psi function, fractional calcu- 
lus and Fourier series. The prerequisites are some 
advanced calculus and knowledge of the Gamma and 
Psi functions. This should be of interest to applied 
mathematicians and engineers. LC - 


Analysis, P. Seminar Analysis of the Karl- 
Weterstraf-Institute of Mathematics Academy of 
Sciences of the GDR 1985/86. Ed: Bert-Wolfgang 
Schulze, Hans Triebel. Teubner-Texte zur Math., B. 
96. BG Teubner, 1987, 322 pp, 33.50 DM (P). [ISBN: 
3-322-00429-5] Collection of five papers in analy- 
sis touching topics such as vector-valued Sobolev 
spaces, multipliers in Besov-Triebel-Lizorkin spaces, 
non-linear partial differential equations, and scatter- 
ing theory. LC 


Differential Geometry, S(18), P. Lecture Notes 
in Mathematics-1288: Generalized Analytic Func- 
tions on Riemann Surfaces. Yuri L. Rodin. Springer- 
Verlag, 1987, v + 128 pp, $13.90 (P). [ISBN: 0- 
387-18572-0] Generalized analytic (pseudo-analytic) 
functions on Riemann surfaces arise as solutions 
to elliptic differential equations represented by the 
Carleman-Bers-Vekua equation du + au + bu = 0. 
Properties such as the argument principle and the 
Liouville theorem hold for these functions. The au- 
thor applies the theory to the Riemann-Roch theo- 
rem, to Abel’s problem on functions with prescribed 
zeros and poles, and to some physical problems. GG 


Geometry, T(15-17: 1), S, P, L. Geometries and 
Groups. V.V. Nikulin, I.R. Shafarevich. Transl: M. 
Reid. Universitext. Springer-Verlag, 1987, viii + 
251 pp, $32.50 (P). [ISBN: 0-387-15281-4] A study 
of geometries that includes a classification based on 
the relation between their groups and the discrete 
groups of Euclidean space. The study reaches to the 
crystallographic groups, Lobachevskian geometry, as 
well as to interesting geometries on the torus. SS 


Operations Research, S(18), P. Theory of Lin- 
ear and Integer Programming. Alexander Schrijver. 
Ser. in Disc. Math. Wiley, 1986, xi + 471 pp, $71.95. 
[ISBN: 0-471-90854-1] A highly theoretical treatment 
of linear and integer programming suitable for grad- 
uate work. Discusses complexity theory, lattices and 
linear Diophantine equations. Extensive treatment 
of polyhedra, Khachiyan’s method, and a brief dis- 
cussion of Karmarkar’s method. No exercises. SM 
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Operations Research, S(18), P. Surveys in Game 
Theory and Related Topics. Ed: H.J.M. Peters, O.J. 
Vrieze. CWI Tract, V. 39. Math Centrum, 1987, xii 
+ 330 pp, Dfl. 50.40 (P). [ISBN: 90-6196-322-2] A 
set of thirteen survey papers concerning the current 
state-of-the-art in cooperative and non-cooperative 
game theory. SM 


Operations Research, S(17), P. Mathematical 
Modelling Methodology, Models and Micros. J.S. 
Berry, et al. Math. & Its Applic. Halsted Pr, 
1986, 318 pp, $89.95. [ISBN: 0-470-20717-5] A col- 
lection of 23 short articles on mathematical model- 
ing. Directed at undergraduate and graduate school 
instructors. Includes a short chapter on using model- 
ing techniques in calculus instruction. Emphasis on 
models and software suitable for implementation on 
microcomputers. SM 


Operations Research, T(17-18: 1), P. Team 
Theory. K.H. Kim, F.W. Roush. Math. & Its Ap- 
plic. Halsted Pr, 1987, ix + 246 pp, $54.95. [ISBN: 
0-470-20848-1] A “team” is a group of “persons” with 
identical interests but independent information and 
actions. The field has similarities to game theory. 
Cooperation is the underlying behavioral assump- 
tion. Many exercises. Extensive bibliography. SM 


Dynamical Systems, P. Periodic Solutions of 
Hamiltonian Systems and Related Topics. Ed: P.H. 
Rabinowitz, et al. NATO ASI Ser. C, V. 209. D 
Reidel (US Distr: Kluwer Academic), 1987, xi + 
284 pp, $59. [ISBN: 90-277-2553-5] Proceedings of 
a NATO Advanced Research Workshop held in Il 
Ciccco, Italy, in October 1986. AWR 


Control Theory, P. Flow Control of Congested 
Networks. Ed: Amedeo R. Odoni, Lucio Bianco, 
Giorgio Szego. NATO ASI Ser. F, V. 38. Springer- 
Verlag, 1987, xii + 355 pp, $69.50. [ISBN: 0-387- 
18398-1] A collection of papers presented during the 
NATO Advanced Research Workshop on flow con- 
trol of congested networks, the case of data process- 
ing and transportation, held in Capri, Italy, October 
12-18, 1986. Papers focus on current research in flow 
control methodologies and their application to con- 
gestion reduction within data communication net- 
works, transportation networks, and air traffic con- 
trol systems. PS 


Systems Theory, P. Information Systems: Fail- 
ure Analysis. Ed: John A. Wise, Anthony 
Debons. NATO ASI Ser. F, V. 32. Springer-Verlag, 
1987, xv + 338 pp, $59.50. [ISBN: 0-387-17800-7] 
Twenty-nine papers concerning the failure of infor- 
mation systems. The material is intended to pro- 
vide guidelines for both the design of information 
systems and the prevention of future failures. Dis- 
cusses methodologies for the investigation of infor- 
mation system failures. Examples of well-known in- 
formation systems failures are the accident at Three 
Mile Island and the space shuttle Challenger explo- 
sion. SM 


Systems Theory, T(17: 1), P. Introduction 
to Queueing Networks. E. Gelenbe, G. Pujolle. 
Transl: J.C.C. Nelson. Wiley, 1987, xiv + 177 pp, 
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$29.95. [ISBN: 0-471-90464-3] A textbook introduc- 
ing queueing networks. Begins with standard simple 
queues, gradually increasing generality. Examples 
from data transmission networks. A few problems, 
most with solutions. SM 


Probability, P. Lecture Notes in Mathematics- 
1247: Séminaire de Probabilités XXI. Ed: J. Azéma, 
P.A. Meyer, M. Yor. Springer-Verlag, 1987, iv + 579 
pp, $54.40 (P). [ISBN: 0-387-17768-X] Papers col- 
lected from Institut de Mathematiques, Universite de 
Strasbourg. Over half of the papers are in French. 
Topics covered include homogeneous chaos, construc- 
tion of the Malliavin operator on Poisson space, sta- 
tionary Markov spaces, approximation of predictable 
characteristics of processes with filtration. MZ 


Stochastic Processes, P. Stochastic Differential 
Systems, Stochastic Control Theory and Applica- 
ttons. Ed: Wendell Fleming, Pierre-Louis Li- 
ons. IMA, V. 10. Springer-Verlag, 1988, xiii + 609 
pp, $49.80. [ISBN: 0-387-96641-2] Proceedings of 
a workshop held at the Institute for Mathematics 
and Its Applications (University of Minnesota), June 
1986. Topics addressed include stochastic schedul- 
ing, applications to problems of computer networks, 
and stochastic gradient algorithms. MR 


Stochastic Processes, T(17: 1), P. Correlation 
Theory of Stationary and Related Random Func- 
tions. A.M. Yaglom. Ser. in Stat. Springer- 
Verlag. Volume I: Basic Results, 1986, xiv + 526 
pp, $58 [ISBN: 0-387-96268-9]; Volume II: Supple- 
mentary Notes and References, 1987, vii + 258 pp, 
$56.50. (ISBN: 0-387-96331-6] Volume I presents the 
“most important part of the theory of stationary and 
related random functions dealing only with first and 
second moments of these functions,” emphasizing ap- 
plied aspects of the theory rather than proofs. No 
exercises. Volume II (note price) contains a diverse 
set of notes, ranging from historical comments to de- 
tailed proofs, together with a more complete bibliog- 
raphy of both theoretical and applied references. No 
exercises. RSK 


Stochastic Processes, T(17-18: 1). Introduction 
to Random Processes. Yurii A. Rozanov. Transl: 
Birgit Rothinger. Springer-Verlag, 1987, viii + 117 
pp, $35. (ISBN: 0-387-17874-0] Begins with a study 
of homogeneous Markov processes with a countable 
number of states, followed by an outline of the foun- 
dations of stochastic analysis. Studies the problem: 
“Given some relatively simple characteristics of a 
processy compute the probability of another event 
which may be very complicated; or estimate a ran- 
dom variable which is related to the behavior of the 
process.” Introduces concepts by referring to math- 
ematical models that are simple, but important in 
applications. RH 


Statistics, P. Modern Statistical Selection, Part 1. 
Ed: M. Haseeb Rizvi, et al. Amer. J. of Math. & 
Manag. Sci., V. 5, Nos. 3 & 4. American Sciences 
Pr, 1985, 201 pp, $49.75 (P). A collection of papers 
discussing the development of statistical ranking and 
selection. SM 
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Statistics, T(17-18), P. Tensor Methods in Statis- 
tics. Peter McCullagh. Chapman & Hall, 1987, xv 
+ 285 pp, $35. [ISBN: 0-412-27480-9] Text explains 
how index notation and tensors can be utilized to 
simplify calculations in statistics. Topics include cu- 
mulants, Edgeworth series, saddle-point approxima- 
tion, and likelihood functions. LC 


Computer Literacy, S*, L**. Portraits in Sili- 
con. Robert Slater. MIT Pr, 1987, xiv + 374 pp, 
$24.95. [ISBN: 0-262-19262-4] 31 biographical vi- 
gnettes tracing the opportunities, frustrations, moti- 
vations, and accomplishments of some of the pioneers 
of the computer age (Turing, Perot, Noyce, Cray, 
Hopper, Kemeny, Jobs, Knuth, ...). Engagingly 
written in clear yet authentic terms; based largely 
on personal interviews by the author, a professional 
reporter and biographer. LAS 


Programming, T*(13: 1), S, L. Pascal: An In- 
troduction to the Art and Science of Programming, 
Second Edition. Walter J. Savitch. Struct. Prog. 
Benjamin/Cummings, 1987, xxv + 769 pp, $27.95 
(P). (ISBN: 0-8053-8388-3] This book has been 
thoroughly rewritten (First Edition, TR, November 
1986). The basic topic groupings and the early and 
continued use of procedures remain even though en- 
tire chapters have disappeared and have been re- 
placed with new ones. Specifically, expanded topics 
include problem solving, debugging, programming 
techniques, arrays, and recursion. CEC 


Languages, T(17-18: 1), S, P, L. Understand- 
ing Language: Man or Machine. John A. Moyne. 
Found. of Comput. Sci. Plenum Pr, 1985, xvi + 357 
pp, $49.50. [ISBN: 0-306-41970-X] A graduate text 
for computer scientists and linguists interested in 
natural language processing in artificial intelligence. 
Focus on linguistic, psycholinguistic prerequisites, is- 
sues in human linguistic behavior, rather than on 
extant models and algorithms. Preliminaries; gram- 
mars and parsing; semantics; comments on artificial 
intelligence approaches; lexical processing; syntactic 
processing. Self-contained, no specific prerequisites; 
exercises, attention to references. RB 


Algorithms, S. Algorithms for Games. G.M. 
Adelson-Velsky, V.L. Arlazarov, M.V. Donskoy. 
Springer-Verlag, 1988, x + 197 pp, $59.50. [ISBN: 0- 
387-96629-3] Two-person, zero-sum games with com- 
plete information; chess is the model game; a-@ prun- 
ing, heuristics, analogical reasoning. Preliminary re- 
sults on a probabilistic approach. Translation of a 
1978 Russian publication. RWN 


Computer Systems, P. Distributed Operating Sys- 
tems: Theory and Practice. Ed: Yakup Paker, 
Jean-Pierre Banatre, Muslim Bozyigit. NATO ASI 
Ser. F, V. 28. Springer-Verlag, 1987, x + 379 pp, 
$77.50. [ISBN: 0-387-17699-3] Proceedings of NATO 
Advanced Study Institute on distributed operating 
systems, Altinyunus, Turkey, August 1986. Fourteen 
papers concerning formal aspects of concurrent sys- 
tems; design issues for distributed operating systems; 
hardware support for distributed operating systems; 
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case studies of existing research and commercial im- 
plementations. RB 


Computer Systems, S(14-15). Introduction to 
Real-time Software Design. S.T. Allworth, R.N. Zo- 
bel. Springer-Verlag, 1987, xiii + 287 pp, $28 (P). 
[ISBN: 0-387-91307-6] This Second Edition updates 
and expands on the First Edition, adding interfaces, 
real-time languages, distributed systems, and signal 
processing. It is divided into three parts: the real- 
time virtual machine, designing the processes, and 
designing and measuring the system. Each chapter 
has a summary and a list of concepts. It presents a 
structured design of software, concentrating on basic 
principles, and is language independent. Includes a 
list of references, a bibliography, and a comprehen- 
sive glossary. RSF 


Computer Science, P. Lecture Notes in Computer 
Science-289: ESEC ’87. Ed: H.K. Nichols, D. Simp- 
son. Springer-Verlag, 1987, xii + 404 pp, $33.30 (P). 
[ISBN: 0-387-18712-X] Proceedings of the first Euro- 
pean software engineering conference held at Stras- 
bourg in September 1987. Forty papers covering a 
wide range of topics from formal methods to appli- 
cation systems. PS 


Applications, T??, L??. Modern Methods of Mu- 
sic Analysis Using Computers. R.M. Mason. School- 
house Pr (46 Mountain View Dr., Peterborough, NH 
03458), 1985, 299 pp, $44.35. [ISBN: 0-9615669-0-6] 
“Using essentially only the elementary mathematics 
of vectors and complex numbers,” this publication is 
offered as a text for a course in music analysis by 
computers or a graduate seminar in “analytic tonal- 
ity,” simultaneously as a profesional-level reference 
for computer scientists interested in musical applica- 
tions, and as a tutorial-level exposition for the ad- 
vanced music student and the serious computer hob- 
byist. Algorithms expressed in Reverse Polish Nota- 
tion. RB 


Applications, P. Modelling and Analysis in Arms 
Control. Ed: Rudolf Avenhaus, Reiner K. Huber, 
John D. Kettelle. NATO ASI Ser. F, V. 26. Springer- 
Verlag, 1986, viii + 488 pp, $96.50. [ISBN: 0-387- 
17174-6] Analyzes arms control from a game theo- 
retic/dynamical systems point of view. Four parts: 
history, pre-arms control assessment of options, ne- 
gotiations, verification. SM 


Applications (Biological Science), P. Lecture 
Notes tn Biomathematics-73: Applications of Con- 
trol Theory in Ecology. Ed: Y. Cohen. Springer- 
Verlag, 1987, vii + 101 pp, $16.30 (P). [ISBN: 0-387- 
18104-0] Proceedings of the symposium on optimal 
control theory held at the State University of New 
York, Syracuse, on August 10-16, 1986. Five papers 
using the techniques of control theory to study bio- 
logical systems. SM 


Applications (Cognitive Science), P. Computa- 
ttonal Models of Learning. Ed: Leonard Bolc. Sym- 
bolic Computat. Springer-Verlag, 1987, ix + 208 
pp, $45. [ISBN: 0-387-16318-2] A collection of essays 
on selected problems in machine learning. Includes 
learning strategies for expert systems, empirical dis- 
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Growth in Gaussian Elimination 


JANE DAY AND BRIAN PETERSON, San Jose State University 
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and Ph.D. in 1964. After two postdoctoral years at Florida, she visited the 
Institute for Advanced Study in 1966-67, as an AAUW Fellow; from 
1967-81, she taught at the College of Notre Dame in Belmont, CA, and has 
been at San Jose State University since. 


BRIAN PETERSON received his B.A. at San Jose State University in 1972 and 
his Ph.D. at UC Berkeley in 1976. He taught for two years at Rutgers 
University and has been at San Jose State University since 1978. 


Dedicated to the memory of James H. Wilkinson, 1919-1986 


1. Introduction. This article is about a conjecture made by J. H. Wilkinson 
during some of his early work in numerical analysis (see Wilkinson [1961]). We will 
discuss what has been proved about the problem, extend some of those results, and 
provide examples that destroy some seemingly natural conjectures about his asser- 
tion and raise some interesting new questions. 

Wilkinson’s conjecture is very intriguing—easy to state, soon believed, and 
apparently very difficult to resolve. Fix a positive integer n, let A =[a,,] be an 
n X n matrix and reduce A to upper triangular form by Gaussian elimination with 
complete pivoting (precisely defined in (2.1)). Let A“ = [a{] denote the matrix 
obtained after the first k pivoting operations, so A‘"~) is the final upper triangular 
matrix. A diagonal entry of that final matrix will be called a pivot. Wilkinson 
conjectured that g(A) <n for all real n X n matrices A, where 


g(A) = ( max |a{|} /( maxja,,), 


It follows easily from the definition of complete pivoting that g(A4) always occurs at 
some pivot. 

We are interested here only in real matrices; in fact, for an n X n complex matrix 
A, g(A) can exceed n (Tornheim [1965], [1970]). Assume henceforth that all 
matrices are real. 

It is known that the conjectured bound for growth will be sharp if indeed the 
conjecture is true, because an n X n Hadamard matrix A has its last pivot equal to 
+n (see Section 5). Furthermore, there is considerable evidence that, when an 
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n X n Hadamard matrix is reduced by Gaussian elimination with complete pivoting, 
no pivot before the last one can have magnitude exceeding n: earlier authors showed 
that the magnitudes of the last three pivots are n/2, n/2 and n; and we extend that 
in Section 5, proving that none of the last six pivots can have magnitude greater 
than n. However, unlikely as it seems, so far as is known now some still earlier pivot 
might have magnitude greater than n. 

In Section 5 we will also establish values for the first five pivots of any 
Hadamard matrix reduced by complete pivoting, and will show that g(A) does 
equal n for a certain class of n X n Hadamard matrices A. 

It will be useful to have the following notation: 


g(n) = sup{ g(A)|A an n X n matrix}. 


In this notation, Wilkinson’s conjecture says that g(n) <n for all n. We will call 
the problem of determining g(7) the growth problem. 

Section 2 contains essential classical background material, various algebraic and 
geometric interpretations of Gaussian Elimination with complete pivoting, and 
results of ours which justify use of the nonlinear optimization software package 
NPSOL [Gill et al., 1983]. Our theorems include showing that the bound g(n) is 
achieved for some matrix in a certain set of nm X n matrices and that it occurs at the 
last pivot of that matrix. 

Section 6 is a report on the examples we obtained using NPSOL, and the new 
questions they suggest. We attempted to achieve the bound g(n) for n < 7. We also 
used NPSOL to test the conjecture that g(n) will occur in an orthogonal matrix for 
all n. This seems plausible at first because g(n) does appear to occur in Hadamard 
matrices in the sizes where they exist, and the natural analogue of Hadamard 
matrices in other sizes would seem to be orthogonal matrices. However, the results 
indicate that conjecture is almost certainly false for n = 5 and n = 7. 

In Section 3 we will discuss the best general bound for g(n) known so far, 


g(n)< [n 2 31/2 tae n/n 


This bound is due to Wilkinson [1961] and is much larger than his conjectured n; we 
will explain why his method of proof could not yield a considerably better bound. 

To evaluate g(n) directly is a constrained optimization problem with a nonlinear 
objective function and many nonlinear constraints. (See Section 6.) While it is easy 
to see that g(1) = 1 and g(2) = 2, for all n > 2, even n = 3 or 4, it seems quite 
difficult to establish a value or close bound. Cryer [1968], Cohen [1974], and 
Tornheim [1964], [1965], [1969], [1970] proved that g(3) = 2.25 and g9(4) = 4, and 
Tornheim also showed that 9(5) < 417/18. Their methods are algebraic, with many 
cases, and would be prohibitively tedious for large n. Tornheim invokes the 
Kuhri-Tucker theory for nonlinear constrained optimization in one proof, but the 
argument is still very involved. 

In Section 4, we give our own proof that g(3) = 2.25, which uses geometry to 
make various cases and inequalities easier to see. Unfortunately this geometric 
viewpoint does not appear adaptable to larger n either. Apparently the value of 
g(n) has not been shown to be bounded by n for any single value of n > 6. 

One of the curious frustrations of the growth problem is that it is quite difficult 
to construct any examples of n X n matrices A other than Hadamards for which 
g(A) is even close to n. Wilkinson has remarked that in real-world problems, g(A) 
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has never been observed to be very large (p. 214, Wilkinson [1965]). C. W. Cryer 
[1968] did numerical experiments in which he computed g(A), doing complete 
pivoting on n Xn matrices A with entries chosen randomly from the interval 
[—1,1] and for sizes up to n = 8. He had to generate over 50,000 3 x 3 examples 
before finding one with g(A) > 2; and although it is true that ¢(5) > 4, 2(8) > 8, 
and g(n) is nondecreasing, every example he obtained through n = 8 had g(A) < 
2.8348. 

To complete the introduction, we will indicate briefly why complete pivoting 
matters in numerical analysis, and how g( A) gets involved. To solve a linear system 
Ax = b using the usual Gaussian elimination algorithm (not doing complete pivot- 
ing), one simply chooses the first available nonzero entry in rows and columns 
k,...,n for the kth pivot, exchanging rows and columns if necessary to put that 
entry into the (k, k) position. When Wilkinson invented backward error analysis for 
this algorithm, g(A) appeared in the bound for relative error. To sketch how, 
suppose that t-digit rounding arithmetic in base B is used, and assume that A is 
invertible. Let x denote the theoretically correct solution and y the calculated 
solution. Wilkinson proved that there is an error matrix E for which (A + E)y = b 
+ e, and each |e; ,| and |e,| is bounded above by cg(A) where c is about 8'~'. Thus 
y is the exact solution to a nearby problem if g(A) is not large. Using any matrix 
norm consistent with matrix product, such as the oo-norm, it is possible to show 
that the relative error in y is bounded thus: 

lly — xl 4 
<A MIMAIIEI 4, 
III 
where d is not alarmingly large and ||E|| depends of course on the size of the entries 
in E, hence on g(A). 

Now complete pivoting means this: after k — 1 pivot steps, choose the kth pivot 
to have maximum magnitude among all entires in the last n — k + 1 rows and 
columns. Such a choice minimizes round-off error at the kth pivot step, which 
seems sensible. Thus the question of how large g(A) can be when complete pivoting 
is done became of interest. Details can be found in (Sec. 24-27, Wilkinson [1965]) 
or (Sec. 4.2—4.4, Golub and Van Loan [1985]). 

Many decades of computation have passed since that error analysis was first 
done; all experience has shown that || || 1s not large, and that when relative error is 
unacceptable for a problem Ax = b, it is not because of growth but because the 
condition number ||A~1|| ||.A|| of A is large. In fact, in practice this is true even if 
only partial pivoting is done (see Definition (2.1)). Thus the question of finding a 
better bound for g(n) has not been an issue in numerical analysis for many years.* 
Yet the mathematical problem remains tantalizing. It led these authors to explore a 
number of interesting ideas, including some lovely old determinant theory which is 
not widely known today. What follows is a report of what we learned, and some 
new results about g(n). 


2. Notation and basic results. The following establishes precise notation for the 
algorithm commonly called complete pivoting, and for the growth associated with it. 


*However, see Trefethen [1985] for some provocative questions about this and other matters long 
considered settled. 
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(2.1) Definitions. Let B[k] denote the lower right k X k principal submatrix of a 
matrix B. | 

Let A =[a,,] denote a nonzero n X n real matrix. To reduce A by Gaussian 
Elimination with complete pivoting (GECP) is to perform steps (a)-(c): 

(a) Search column 1 from row 1 to n, then columns 2,..., in the same way, for 
an entry with maximum magnitude. Using the first such a,, found, exchange rows 1 
and i and columns 1 and /; if necessary to put it into the (1,1) position. Let 
A® = [a{] denote the new matrix. 

(b) Pivot on a())—that is, add multiples of row 1 to the rows below it to create 
zeroes in their column 1 positions. Let A = [a{?] be the new matrix obtained. 

(c) In the same way for each k = 2,...,n — 1, search A“%*~[n — k + 1] to find 
the maximum magnitude of an entry, and exchange rows and columns of A if 
necessary to put the first such entry found into the (k, k) position. If it is not zero, 
pivot on it, to obtain A“; otherwise, let A“ = A“), Thus the final upper 
triangular matrix can be denoted thus: 


0 0 0 
asp ayy + ay, 
1 1 
Aad = 0 as) an aS? 
0 0 
0 0 eee gi") 


As mentioned in Section 1, the diagonal entries a{*~” are called the pivots. It is 
true that A is invertible if and only if every pivot is nonzero.” 
Using this notation, 


g(A) = max |a{?|/la? 
i,j,k 


We will call g(A) the growth associated with GECP on A. 

By the definition of GECP, the magnitude of some pivot will be g(A)—but it 
need not be the last pivot. However, we will prove below that there is an n X n 
matrix on which g(A) has the maximum value g(n) and for which g(A) does occur 
at the last pivot, a‘”~”. 

If one knew ahead of time what row and column exchanges would occur during 
GECP and did them to A first, in the same order, then it is true that GECP on this 
altered matrix would not actually require any exchanges, and the final matrix would 
be the same A“"~. So to discuss growth we can restrict attention to matrices with 
the property that no exchanges are actually needed during GECP. We call such 
matrices completely pivoted (CP). 

The following notation and proposition will be used throughout. 


(2.2) Definition. Let A(i,:--i,|j, --+ j,) denote the determinant of the p xX p 
submatrix of A obtained from the intersection of rows i),...,7, with columns 
Jiy+++> Jp» When the two sets of indices are the same, A(i, --- i,) will abbreviate. 


*If in step (c) one searches only the kth column from row k to row n for the first entry of maximum 
magnitude to use as the kth pivot, that is called Gaussian Elimination with partial pivoting. If one uses the 
first nonzero entry found instead of the first one of maximum magnitude, that is, usual Gaussian 
Elimination. 
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Let adj A denote the matrix with (—1)'/A(1---7--- njl---J +--+ n) as its (J, i) 
entry, where i means “omit i.” As is well known, A~! = (1/det A)(adj A) if A is 
invertible. 


(2.3) PROPOSITION (p. 26, Gantmacher [1959]). Suppose A is CP, invertible, 
GECP is done on A, and 1 <k <n. Then ifk < i, j, 


a) = A(L +++ kil» kjf)/A(L «+> k). 


Thus the kth pivot a\k— is 1/x where x is the (k, k) entry of the inverse of the k X k 
leading principal submatrix of A; in particular, the last pivot a‘"~) is the reciprocal of 
an entry of A~}. 


Proof. The determinant of a matrix is unchanged by adding multiples of a row to 
other rows. Because of this, and because no exchanges will be done during GECP, 
the k X k leading principal submatrices of A and A‘~» have the same determi- 
nant; that is, A(1--- k) =afPa --- al¥-. Since A is invertible and CP, 
A(1...k) # 0 and this is the denominator of the quotient in the theorem. 

Now adjoin k + 1 entries of row i and column j to get the submatrix of A 
whose determinant is A(1--- kijl--- kj). Just as above, this equals aa{) --- 
ati a. This is the numerator of the quotient in the theorem, hence the equality 
is proved. 

Finally, if B denotes the k X k leading principal submatrix of A, then the (k, k) 
entry of B7! is B(1---(k—1))/B(.-:- k) =A(l---(k-1))/A(-:: k) and 
this equals 1/a(*~» by the preceding. O 

The above result shows that GECP can be interpreted thus: having chosen the 
first k — 1 rows and columns, so that the (k — 1) X (k — 1) leading principal 
subdeterminant is determined, choose the kth row and column so that the k x k 
leading principal subdeterminant will have maximum possible magnitude. 

An alternative interpretation is this: having chosen the first k — 1 rows and 
columns, choose the kth pivot so that |det A“ [n — k]| is minimal—for |a(‘~| is 
maximal among the entries of A“*~?[n — k + 1] and 


k-1 
AC-DIn—-k 41] = ee ran aan 
0 | AM[n-k] 

For geometric interpretations of GECP, recall that the magnitude of the determi- 
nant of a k X k matrix is the k-volume of the parallelepiped in R* which is 
spanned by its rows. The obvious geometric interpretation of GECP then is that it 
means “after the (k — 1)st pivot step, choose the kth row and column so as to 
maximize the k-volume of the parallelepiped spanned by the rows of the k x k 
leading principal submatrix.” 

Alternatively, GECP also means “minimize the (n — k)-volume of the projected 
(n — k)-dimensional solid at the k th pivot step”: for pivoting on a(*~ amounts to 
projecting the parallelepiped spanned by the rows of A“~)[n — k + 1], by vectors 
parallel to its first row, into the parallelepiped spanned by the rows of A‘ [n — k], 
which is of one lower dimension. 

In Proposition (2.5) below, we give what appears to be a new algebraic interpreta- 
tion of GECP. That theorem, and much else that follows in this paper, depends on a 
formula from Gantmacher [1959], which we state in the next Proposition. While the 
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formula is important in what follows here, its proof is somewhat lengthy and would 
not add insight into our applications. Briefly, the formula follows from two classic 
theorems, one due to A. Cauchy and J. Binet about compound matrices, and the 
general Laplace Expansion for det A. Both of these classic results date from the 
early 19th century, and they are nicely presented in Sec. I.4, Gantmacher [1959]; 
also see Marshall and Olkin [1979]. 


(2.4) PROPOSITION (p. 21, Gantmacher [1959]). Let A be an n Xn invertible 


matrix and letk <n. If i, <i, < +++ <i, is an ordered subset of {1,2,...,n}, let 
iy <i, < +++ <i/_, denote its ordered complement. Then for any k rows and 
columns, 
AV ili cc i) = airy, feces ft lit +++ i7_,) 
1 kl/1 Jk det A Ji In-KN1 n—-k)> 
k 
wherem= ), (i,+j,). O 
r=1 ‘ 


Note (2.4) is a strong generalization of the much better known fact that 
A7} = (1/det A)(adj A). 


(2.5) PROPOSITION. Let A be an n Xn invertible matrix and 1<k<n. Dok 
steps of usual Gaussian Elimination with no row or column exchanges. Then the 
unreduced part of A“ can be described thus: 


AM[n —k] =(A7}[n- k])~. 


So having the rows and columns of A arranged to make it CP means this: for each 
k =1,...,n — 1 the magnitude of the determinant of A~'[n — k] cannot be increased 
by rearranging the last n — k + 1 rows and columns of A. 


Proof. Let i, j > k so that a‘ is an arbitrary entry of A“ [n — k]. Then by 
(2.3) and (2.4) we have 


a) = A(1-++ kifl-++ kj) /A(L +: k) 


: (-1)' (det AVA“'"(K +1) +++ fees nik + 1)--- 7+) n) 
A(1--- k) 
Also by (2.4), 
det(A~4[n —k]) =A71((k +1) +++ n) 
= A(1--- k)/(det A). 


Thus we have 


(- 1) i+] 

(kK) = — UVP 1) FG ... k+1)---Peee 
aij det (A~4[n — k]) (( ) J n\( ) l n), (2.6) 
which is exactly the (i, 7) entry of (A~'[n — k])~! when its rows and columns are 
numbered from k + 1 to n instead of 1 to n — k. This proves that A“ [n — k] has 
the form asserted. As remarked above, to do GECP is to minimize |det A“ [n — k]| 
at each step k, which is thus to maximize |det(A~4[n — k])|. O 
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Tornheim [1970] obtained the formula (2.6) for the diagonal entries only in 
A“){n — k] by direct calculation, not using Gantmacher’s formula. He did not 
observe that this entire unreduced part of A‘ is actually the inverse of a matrix 
easily described, as we have done in (2.5). 

Another idea which will be used throughout the sequel is this: if we divide A by 
|a| before beginning GECP, then GECP on the new matrix will yield the same 
growth. So to discuss growth, we can restrict attention to CP matrices for which this 
has been done. 


(2.7) Definitions. Call A normalized if max, , |a,,| = 1.* Let “CPN” abbreviate 
for “completely pivoted normalized.” Let @, denote the set of all real n X n CPN 
matrices. Note a,, = +1 fora CPN matrix. 


Clearly g(n) = sup{ g(A)|A © @,}. The rest of this Section is devoted to pro- 
ving that g(A) does achieve a maximum at some B in @,, and that g(n) = g(B) 
occurs at the final pivot of some such B. The subtlety involved is that, for some 
matrices A in @,, g(A) will occur at an earlier diagonal entry, yet we will find g(n) 
if we maximize ja" )! over G,. This is important because we needed a well-defined 
objective function for using the optimization package NPSOL to maximize g( A). 

The objective function used in NPSOL also must be smooth; although on the set 
of invertible matrices in @,, a‘”~" is a quotient whose denominator goes to zero as 
A approaches a singular matrix. we will show that |a"~| does achieve a maximum 
at some B in @, and is smooth on a neighborhood of B. 


(2.8) Definitions. Let A € @,. Let h(A) = |a“~ | when A is reduced by GECP, 
and h(n) = sup{h(A)|A € & a Identity the set of n Xn real matrices with R” 
by identifying each A with the n*-tuple obtained by concatenating the rows of A. 


(2.9) PROPOSITION. h(A) is continuous on @,, and smooth on the invertible matrices 
in €,. 6, iS compact, hence h(n) is well defined and does achieve its sup at some 
invertible B in @,. Finally, 1 < h(n) < 2"7! and h(n — 1) < h(n) = g(n). Thus 
g(n) is well defined, does occur at some B in @,, and it occurs as \b™~), 


Proof. On R”’, det A is a polynomial function, hence is smooth. Let A € ©, 
and reduce A by GECP. Then clearly 


Jae P| < 2jatt5._1| for each k > 2. (2.10) 


Hence, h(A) < 2”~"|a(| = 2"7', the last because A is CPN. Thus h(n) exists. 
Letting J denote the identity matrix in @ » h(1) = 1 so we have 1 < h(n) < 2"77. 

Because no row or column exchanges are done during GECP on any A in é,, 
when det A # 0 then a“~ is simply (det A)/A(1 ---(n — 1)). Thus on the open 
set of invertible matrices in @,, a‘”~) is the quotient of nonzero smooth functions, 
hence itself is smooth; it is not zero, hence h(A) = |a‘"~| is also smooth on the 
invertible matrices in @,. 

To show that h(A) is continuous on all of @,, it suffices to prove that h(A) > 0 
if det A > 0. To do this, let A © @, and apply (2.10) to see that |a{*~ | > 
ja") 72"-* for any k, 1 <k <n. Let r= (n— 1)(n — 2)/2. Then, remembering 


*This use of the word “normalized” is ours and not standard; for example, the word has a different 
meaning in discussions of Hadamard matrices such as in Lander [1983]. 
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that af = 1, 
\det A| = java® eee qin-b| > Ja Dyn-1 gr, 


Hence h( A) = |a‘"~ | must go to zero if det A does. 

Next observe that the set of normalized matrices in R” is closed and bounded. 
Also, the set of CP matrices is closed since det A is continuous and A is CP if and 
only if it satisfies 


la,,| > |a,,| for alli, 7 > 1 (2.11) 

and 
JA(1--- k)|>JAQ1---(k—-1)ifl---(kK-1),/)| foralli, j>k>2andk <n 
(2.12) 


Since @, is the intersection of these two sets, @, is closed and bounded, hence 
compact. 

To see that h(n — 1) < h(n), let X be an (n — 1) X (n — 1) matrix which is 
CPN and for which h(X) = h(n — 1). Let 


A= | * 
O| xX 


which is normalized because X is. Since X is CP and no actual pivoting is needed 
on a,,, A is also CP. Because h(X) = h(n — 1) > 1, h(A) = h(X). Thus h(n) > 
h(A) = h(n — 1). 

Finally, we apply the existence and monotonicity of h(n) to prove g(n) = h(n). 
Let A & @,; 9(A) = lal‘ | for some k <n, so g(A) < h(k) < h(n); thus g(A) is 
bounded above, hence g(n) exists and g(n) < h(n). Let B © @, such that h(B) = 
h(n). Then g(n) > g(B) > h(B) = h(n). Thus g(n) = h(B) = h(n) as asserted. 

O 


3. The best general bound known for g(n). By (2.9), g(n) < 2”7! is true for all 
n. A much better bound, though still far from sharp, was proved by Wilkinson in 
[1961]. We will discuss that now, and how it can be mildly improved. Wilkinson 
obtained it by applying an old classic inequality to the determinant of each 
intermediate submatrix A‘~”[k], followed by some clever algebraic manipulations. 
Let a, = max{|det Al: A an n Xn matrix with |a,,| < 1 for all i, 7}. 


(3.1) HADAMARD’S INEQUALITY [Hadamard 1893]. For each n> 1, a, <n”, 
and equality holds if and only if |a; il = 1 for all i, j and the rows of A are “mutually 
orthogonal. O 


ijl < 


It is clear geometrically that Hadamard’s Inequality is true, because |det A| is the 
n-volume of the parallelepiped in R” spanned by the row vectors of A (see Section 
IX.5, Gantmacher [1959]). Such a volume is maximized when the vectors are 
mutually orthogonal and every entry is +1, and matrices with this property have 
since been called Hadamard. 


(3.2) PROPOSITION [Wilkinson 1961]. For every n > 1, 


g(n) < [n 231/72 41/3 _ nism D172 
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Proof. Let A be n X n and reduce A by GECP. Let p,,..., p, be the absolute 
values of the pivots, labeled in reverse of the ordering of rows. Thus g(A) = p,/pP,, 
and the magnitude of every entry of A‘"~”[k] is bounded by p,, so that we have: 


det A’ [Kk] | = pePe-a 0 Pr S PEO, = KI, (3.3) 
Letting g, = log( p,), (3.3) implies 
Eq; < ng, + loe(«,) (3.4) 
i=1 
and 
k-1 
yg, < (k — 1)q, + log(a,), k=2,...,n—1. (3.5) 
i=1 


Divide (3.4) by n — 1 and, for each k = 2,..., n — 1, divide (3.5) by k(k — 1). Add 
all these to get 


n-1k-1 nq, log(«,)  "x!} log(a,) Tk 
— Lat Digs bh * ~jy* —] k(k — 1) k 
i=] k=2 i=1 " " k=2 


(3.6) 


On the left side of (3.6), collect like terms and sum the telescoping series of 
coefficients obtained to see that the left side can be written as: 


gq » 
: + » qi/1 | 
(n - 1) i=1 
Now cancel like terms in (3.6) to obtain 


cee ) 4 x wee 


If one uses k*/? as a bound for every a,, as Wilkinson did, then exponentiation 
now yields, as claimed, 


(Pi/P,) < [n23'/41/3 tee p/n ]17?, - 


It is not hard to get n@°&”)/4 as an estimate for this bound (note the log of the 
expression is a sum which can be approximated by an integral); thus this bound is 
much better than 2”~'. However, it is not sharp: for as Hadamard had observed, a, 
can only possibly equal n”/? when n is 1,2 or a multiple of 4. 

Since Hadamard’s 1893 paper, somewhat smaller bounds for a, have been 
obtained when n is not a multiple of 4. Brenner and Cummings [1972] give a survey 
of these; for example, if n mod 4 = 1, then 


a, < [(n +1)""(n- yr yn], 


There are similar expressions for n mod 4 = 2 or 3. If we use these bounds instead 
of k*/? for all k at the last step of the proof above, we obtain a much less elegant 
and only slightly smaller bound for g(n). Worse, it is still far from sharp. 

Indeed, the approach of the above proof cannot yield a very close bound for g(n) 
—recall the remarks after (2.3) which pointed out that the purpose of complete 


W- as 
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pivoting is to choose A[k] within A[k + 1], after n — k + 1 pivot steps, so as to 
minimize |det A“~[k]|; and, for many values of k, the very structure of matrices 
prevents |det A“~[k]| from having the maximum value pfa,. 

For example, let A be an n X n Hadamard matrix; then A~! = (1/n)A‘, so that 
by (2.5), A“? [k] =(1/n)A'[k])~!. Thus to minimize the magnitude of the 
determinant of the latter matrix is to maximize |det A‘[k]| = |det A[k]]. However, 
no k X k submatrix of A can be Hadamard if n/2 < k < n. For let 1 < k <n and 
let B be a k X k submatrix which is Hadamard. For simplicity, suppose B = A[k]. 
Consider the k(n — k)-vectors which are above B in the last k columns of A. 
Because both A and B have mutually orthogonal columns, those k vectors must be 
mutually orthogonal, hence independent. Thus k < n/2. So if n > 12, there are 
values of k for which n/2 < k <n and k X k Hadamard matrices exist, but A[k] 
will not be such a matrix and thus |det A“~| < p{a,. 


4. A new proof that ¢(3) = 9/4. We will establish the value of g(3) in a more 
geometric way than the proofs given by Cryer [1968], Cohen [1974], and Tornheim 
[1964], [1970]. However, our method seems no more amenable than theirs for 
extension to cases of larger n. 


(4.1) PROPOSITION. g(3) = 9/4. 
Proof. We need only show g(3) < 9/4 since g(A) = 9/4 for 


1311p 
A=|-1 122 1 
1/2 -1 1 


To begin, select any 3 X 3 CPN matrix A with g(A) = g(3). Observe that g(A) 
must occur in the (3,3) position since g(2) is clearly 2. We will make several 
modifications to A, if necessary, allowing us to assume it to have a rather special 
form that will reduce the number of cases and diagrams required for the argument. 
The ith row of A will be denoted by r,, and the ith row of each A“ by r{). 

First, we cannot have r, = (1, 0,0), for if that were so then 9(3) = g(A) < 2. Let 
j = 2 or 3 be such that |a,,| = max{|q@j|, |@,3|}. Multiply r, by 1/a,, and multiply 
the first column of A by a,,. Then A will still be CPN, g(A) will be unchanged, and 
we have that r,; = (1,c,1) or r, = (1,1, c). Make c nonnegative by negating the 
second or third column of A if necessary. Now we have 0 < c < 1 and 


1 1 ic 1 cil 
AW = f P : or AY=|10 g r| 
O s ft O ¢f s 
Néxt observe that c # 0: for if c = 0, then |p], |s| < 2 and |g|, |t| < 1; but then 
g(A) = (| pt — gs|/max{| pl, |g|}) < 2, which is false. Thus 0 < c < 1. Interchange 
columns 2 and 3 if necessary so that r, has the form (1,1, c). Then A may no longer 
be CP. To accommodate this, we will allow the possibility that the second pivot 
element in GECP on A might be g = a$} rather than p = a9}, in which case the 
second pivot step will yield r{ = (0, +9(A),0), rather than (0,0, +9(A)). Al- 
though this would differ from Definition (2.1), it should be clear that our approach 
is still general. Whether or not the difference occurs depends on the relative 
magnitudes of a$) and a). 
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Negate r, (and hence r{”) if necessary, so that the second pivot element will be 
positive, whether it be p = a9} or g = a9. 

We will use the fact that the image under a linear map of the convex hull of a 
finite set of points is just the convex hull of the images of those points. The rows r, 
and r, (as well as r,) lie in the convex hull of the eight points (+1, +1, +1), which 
we will refer to as the cube. When we pivot on aj, the transitions r,—> r™ for 
i = 2,3 are both applications of the same linear map (x, y, z) > (0, y — x, Zz — cx), 
so r{? and r{? both lie in the convex hull of the images under this map of the 
points (+1, +1, +1). 

Henceforth we identify any point (0, y,z) in R° with (y,z) in R*. Since 
0<c<1, we see that r{ and r lie in the convex hull H of the six points 
(2,-1+c), (2,1+ cc), (0,1+ cc), (—2,1—c), (—2,-1-—c), and (0,—-1-—-), 
which is the following hexagon. 

We now make the final modification to A, altering r, if necessary so that r{? lies 
on the boundary of H. To see that this is possible, since it is clear that r{? # (0,0), 
if r{ lies in the interior of H we can find A > 1 such that Arf lies on the 
boundary of H. Replace r, by a preimage in the cube of Ar{” under the above 
linear map. Then r{) will lie on the boundary of H, and all the properties we have 
already assumed for A will still hold. 

By our assumptions, r{? = (p,q) lies on one of the line segments numbered 
from 1 to 7 in FIGuRE (4.1). For each of these seven segments, if ( p, g) lies on it we 
can determine the largest possible value for h( A) in terms of p, g, and c. We will 
then maximize this over p and gq, and then finally over c. 


(1 + ¢c,1 4c) 


z-l=cey+) | 
(-2,1 —- c) 
(2,-1+c) 
(1 +c,-1+c?) 
(1, —1) 


z+l=c(y-) 


(2,1 + c) 


FIG. (4.1). 


In more detail, assume ( p, g) is a particular point on line segment number i. If 
1 <i <5 then p = a§) will be the second pivot element in GECP, but if 6 <i < 7 
then g = a) will be the second pivot element. In either case let 4 = max{| pI, la|} 
= max{ p,q} and let S=S, = {(y,z) © R*| |yh |z|< mw}. Then rf? = (s, 2) 
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must lie in HOS. Now HO S is itself the convex hull of its finite set of extreme 
points, which may be given in terms of p,q, and c. This is illustrated in FIGURE 
(4.2) for i = 1, and tabulated for all 1 < i < 7 in the first column of TABLE (4.3). 


Fic. (4.2) 


The second pivot step, which transforms r{ into r{, is again a linear map, 
either (s,t) +> (0,¢t — sg/p) or (s,t) > (s — tp/q,0), depending respectively on 
whether 1 <i < 5 or6 <i< 7. So r@ lies in the convex hull of the images under 
this linear map of the extreme points of HM S. For each value of i, apply this 
linear map to each extreme point of H MS, and so determine the largest possible 
value of h(A) in terms of p,q, and c. The result is given in the third column of 
TABLE (4.3), and in the second column we indicate by an * one of the extreme 
points of H M S which makes this largest possible value of h(A) occur. Since each 
result is linear in p and 4g, it is itself maximized when ( p, q) is one of the endpoints 
of the ith interval. This endpoint is listed in the fourth column of TABLE (4.3), and 
the fifth column gives the corresponding maximum possible value of h(A) in terms 
of c alone. In every case, we see that h(A) < 2 + c(1 — c). Finally, since 2 + cQ — 
c) < 9/4 for all real c, we have shown that h(A) < 9/4. Thus g(3) = h(A) < 9/4 
also, and the proof is complete. O 


5. The growth conjecture for Hadamard matrices. As discussed in Section 3, 
Hadamard showed in [1893] that the maximum possible magnitude for the determi- 
nant of a normalized n X n matrix A is n"/*, and this maximum occurs if and only 
if A has mutually orthogonal rows and each entry +1. Such matrices have since 
been called Hadamard. Hadamard constructed some examples; in particular, he 
observed as Sylvester had done in [1867] that such matrices do exist when n is a 
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TABLE (4.3) 


Segment Extreme points Largest 
containing possible 
h(A) for 
fixed p,q, c¢ 


pe foes a 


p-1 
c 
(p,q),(p,1 + c)*, (0,1 + c) 
(—p,—4),(-p,-1l-o), l+c-—@q (l+c,-l+c?)|2+c(l—c) 
(2,-1 4c), (2,1 4+ .0)*, 
(0,1 + c),(—2,1 — ©), l+c-—q (2,-1 +c) 2 
(—2, -1— cc), (0,.-1 — c) 


(2,1 +a), (2,1 +c), 

(0,1 + c)*,(—2,1 — c), l+c anywhere l+ec 
(—2,-1-c),(0, -1—c) 

(p,1+c),(0,1 4+ c), 

(—p,l1 —c(p— 1))*,(-p,-1l-—c), |2+c¢-e(p-lid+t+e,14c) 2+ c(1-—c) 
(0, -1—c),(p,-1+ c(p — 1)) 

(1 + c,1 +c), (0,1 4 c), 

(-l1-—¢,1-c*),(-l-¢,-l-0o),| (+o + (l+c¢,14+c) 2+ c(1—c) 
(0, -1—c),(1 + ¢,-1 + c*)* (l—c)p 


(p, q),(—4,1 ~ c(q 7 1)), 
(— 4, —4),(-P; —), 
(q,—- 1 + e(q— 1), (4, 9)" 


power of 2. The simplest examples of Hadamard matrices are in these sizes: 


(5.1) Definition. 


1 1 A, My, . 
A, = 5 H, = 5 and in general, Ay, = 


—] 1 — A, HA, —H, H, 


Hadamard showed that, if an n X n Hadamard matrix exists for n > 2 then n is 
a multiple of 4. He constructed 12 X 12 and 20 X 20 examples, and asked what are 
all the values of n for which they exist. It seems generally believed that they do exist 
for every multiple of 4, but that is not known yet; the earliest reference in which we 
found this conjecture is Paley [1933]. The author remarked prophetically that the 
conjecture “seems probable...but the general theorem has every appearance of 
difficulty.” There is a large and growing body of literature on existence and 
applications of Hadamard matrices, and they have been constructed for many 
infinite families of values of n (see for example Lander [1983] and Hall [1986]). 

Hadamard matrices are of interest here because all the evidence indicates that 
they give maximum possible growth in the dimensions where they exist, and no 
other type matrix does. It seems reasonable that, with their many strong properties, 
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it should be possible to prove Wilkinson’s conjecture for them more easily than for 
general matrices. We have not done that, but our results below strengthen the 
evidence that the conjecture is true for them. 

We should emphasize that, even if one knew what growth is for +1 matrices, it is 
not clear how that would help determine g(n) in general: it is easily seen that 
g(3) = 2.25 does not occur in such a matrix. On the other hand, if one could prove 
that g(n) is achieved on a Hadamard matrix in dimensions n where one exists, that 
would be valuable: since they definitely exist for n any power of 2, it would follow 
easily from the monotonicity of g(n) that g(n) < 2n for all n. That is not as good 
as Wilkinson’s conjectured bound n, but much better than the bound given in 
Section 3. 

When Gaussian elimination is done on an n X n Hadamard matrix A, the last 
pivot has magnitude n. This was observed by Tornheim [1964] and Cryer [1968], 
because it is the reciprocal of an entry from A‘ and that equals (1/n)A‘’. Thus 
g(A) > n. Cryer also evaluated the two pivots preceding the last by invoking results 
of Sharpe [1907] which show that the nonzero (n — 2) X (n — 2) subdeterminants 
of an nm X n Hadamard matrix all have the same magnitude, as do the nonzero 
(n — 3) X (n — 3) subdeterminants. Cryer remarked it is unlikely that any earlier 
pivot under GECP could exceed n, hence it seems very probable that g(A) = n. 

Our Proposition (5.5) below provides a new proof and extension of these results 
of Sharpe, Tornheim and Cryer, and strengthens the plausibility of Cryer’s remark. 
The key tool is Gantmacher’s formula, stated in Proposition (2.4) above. As shown 
next, when A! is proportional to A‘, in particular when A is Hadamard, that 
formula provides a proportionality between any subdeterminant of A and its 
complementary subdeterminant in A. Thus the possible values of the largest 
subdeterminants of a Hadamard matrix can be evaluated in terms of small ones, 
whose values are easy to list. 


(5.2) PROPOSITION. Let A be an invertible n X n matrix for which AA‘ = cI. Then 
A7!=(1/c)A' and detA = c"/*. If 1<k <n, then 


A(12--- k) = +0e*°"7A((k +.1)--- n) = +c*° "det A[n — k]. 
Proof. The (1,1) entry of AA‘ is the square of the Euclidean length of row 1 of 


A, hence c > 0. Since AA‘ = cI, A~'=(1/c)A‘ and also (det A)* = c”, hence 
det A = +c"/*. Let X = (1/c) A‘, so that 


X(12--- k) =c"*A(12--- k) 
and by (2.4) we also have 
X(12--+k) = 4e°"7A((K + 1)-+- 1). 


Two more preliminary results will be useful. The next one includes a special 
interpretation of what Wilkinson’s conjecture means for Hadamard matrices. 


(5.3) PROPOSITION. Let A be an n Xn Hadamard matrix and reduce A by 
Gaussian Elimination. No matter how pivots are chosen, after k pivot steps, the 
unreduced submatrix of A‘ satisfies 


AW ln —k] =n(A‘[n- k])~’. 
Thus, when GECP is done, Wilkinson’s conjecture that g(A) <n is true if and only if, 
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for every p X p lower right submatrix A{ p], the magnitude of every entry of (A[ p])~' 
is less than or equal tol, p =1,..., Nn. 


Proof. Since A~! = (1/n)A', A7'[n — k] = 1/n)A'[n — k]; this with (2.5) im- 
plies A“ [n — k] =(A ln -— k))7! = n( An — k))7t. 


We need the following facts. The first two are simple and known; see, for 
example, Brenner and Cummings [1972] or MacWilliams and Sloane [1977]. 


(5.4) PROPOSITION. Let B be m X mand b,, = +1 for alli, j. Then 

(i) det B is an integer and 2™~' divides det B. 

(ii) When m < 6, the only possible values for |det B| are these, and they do all 
occur: 


1 | 0,2 | 0,4 | 0,8, 16 | 0,16, 32, 48 | 0, 32, 64, 96, 128, 160 


ldet B| 


(111) Suppose that det B # 0 and every (m — 1) X (m — 1) subdeterminant of B is 
zero or has the minimum possible nonzero magnitude, 2” ~*. Then every nonzero 
(m — 2) X (m — 2) subdeterminant of B has magnitude 2™~? or 2”~?. 


Proof. To prove (i) and (11), do one step of Gaussian Elimination on B. The new 
matrix B® has only +1 in row 1 and only 0 or +2 in rows 2 through m. Thus 
det B = 2”~ ‘(det C), and C is a matrix having only 0 and +1 entries so det C is an 
integer. The values listed for |det B| in each category m < 5 above are the multiples 
of 2”~! less than or equal to m™/?, that being an upper bound by Hadamard’s 
Inequality (3.1). For m = 6, every such value except 192 is listed, which is known 
not to occur (see Brenner and Cummings [1972]). It is not difficult to construct 
examples of each size listed. 

For (iii), suppose that det B # 0 and every nonzero (m — 1) X (m — 1) subde- 
terminant of B has magnitude 2”~?. Then B™! = (2"~7/(det B))C, where every 
entry of C is 0 or +1. Choose any i,, i,. Then clearly |C(i,i,)| = 0,1 or 2 so 


\B-(i,i,)| = (2"-2/(det B))'d, d= 0,1or2. 
By (2.4), we also have 
BoN(iqi) = (1/(det B)) B(is +++ i). 


So B(ij--: i/,) is an arbitrary (m — 2) X (m — 2) subdeterminant of B, and if it is 
not 0, then 


J2m-4 
ldet B| = ————__d, d=1or2. 
B(is +++ in) 
Since det B must be a multiple of 2”~', and B(ij --- i’,) is a multiple of 2”~°, 
it follows that |B(if --- i/,)]| can only equal 2”~? or 2”~?. 


The next result provides a new proof and extends to more cases the results of 
Sharpe [1907] and Cryer [1986] discussed above. 


(5.5) PROPOSITION. Let A be an n X n Hadamard, and reduce A by Gaussian 
Elimination, not necessarily with complete pivoting. Let D = n"/*, For simplicity of 
notation, suppose no row or column exchanges are needed. Then the possible values for 
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|A(1--+ k)| and jaS*~?| when kk > n — 6 are: 


TABLE (5.6) 


n 
(1/n)D n/2 
(2/n*)D n/2 
(4/n?)D n/2orn/4 


(q/n*D, q = 8,16 np, p = 1,1/2,1/4,1/3, 1/6 
(q/n>)D, q = 16,32, 48 np, p = 3/2,1,3/4,1/2, 3/8, 1/3, 3/10, 
1/4, 1/5,1/8,1/10 


(q/n°)D, q = 32,64, 96, 128, 160 


In fact, the above values for |AQ.-:- k)| are all Possibie values for the magnitude 
of any nonzero k X k subdeterminant of A,k =n — 6,...,N. 
If complete pivoting is done, then la 3-5 | cannot be (3/2)n and thus the 


magnitudes of the six last pivots for GECP are at most n, n, n/2, n/2, n/2, and n, 
respectively. 


Proof. Note that A(1-:- k) #0 since Gaussian Elimination can be completed 
without row or column exchanges. Then det A[n — k] = A((k + 1)--+ n) cannot 
be zero for any k because A is Hadamard, by (5.2). We will use these facts without 
further comment. 

Since |A(1--- k)| =n*~"/*\det A[n — k]]| = Dn*~"\det A[n — k]|, the values for 
determinants of 6 X 6 + 1 matrices given in Proposition (5.3) imply that 
|A(1 ---(n — 6))| = Dn °p, where p = 32, 64, 96, 128, or 160. In the same way, we 
can obtain all the sossible values for |A(V_:-: k)|, kK =n —5,:--,n, and these are 
all listed in Table ©.6). 

Now af’) = A(1--- k)/A(1--+(k — 1)) so the only possible values for that 
are the quotients obtained from the previous lists; for cxampr ja(k% _ | could 
only equal ng/p where gq = 16,32 or 48 and p = 32,64,...,160. In the same way 
we can calculate all possible values for |a{k~)|, k =n — 4 ...,”. All such quo- 
tients are listed in TABLE (5.6). The above values apply to any nonzero k X k 
subdeterminant of A, because any one could be put into the leading principal k X k 
position originally; while some exchanges might be needed within the first k rows 
during Gaussian Elimination, this would not change the value of this subdetermi- 
nant. 

Suppose now that complete pivoting is done, i.e., that A is CP. We will show that 
ja("9 _ 5] cannot equal n(48/32), which is the only one of all the possible values 
for the last six pivots which would violate Wilkinson’s conjecture. 

By way of contradiction, suppose that |det A[6]| = 32 and |det A[5]| = 48. There 
is actually a 6 X 6 +1 matrix like that, but we will see that it cannot be the lower 
right principal 6 X 6 submatrix of a CP Hadamard matrix. 

Consider A[7], the lower right 7 < 7 principal submatrix of A. We claim that A 
being CP and |det A[6]| = 32 implies that every nonzero 6 X 6 subdeterminant of . 
A[7| has the minimum possible magnitude 32. For if any one were larger, we could 
rearrange only the last seven rows and columns of A so the new lower right 
principal 6 X 6 subdeterminant of A is larger than 32; then the complementary 
subdeterminant of this in A is larger than the one originally in that position, by 
(5.2). Thus the quotient of subdeterminants of A which yields a(”3°_. is larger 
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than the original such quotient, which is contradictory since A was assumed to be 
CP. This allows us to apply (5.4) (iii) to conclude that each nonzero 5 x 5 
subdeterminant of A[7] has magnitude 16 or 32. Thus |det A[5]| cannot be 48. 

We believe that more of the values listed above for pivots are impossible when 
GECP is done on a Hadamard. In particular, we conjecture that every pivot before 
|a‘"—| will have magnitude at most n/2. We also believe that the fourth from the 
last pivot can only have magnitude n/4, not n/2. The latter conjecture developed as 
we experimented to create different pivot patterns with Hadamard matrices. We 
always saw +n/4 as the fourth from last pivot, even when we tried to force +n/2 
there. Also, we suspect that there is some connection between the possible values for 
the kth and kth from last pivots when GECP is done; if so, since the fourth pivot 
has to be +4 by Proposition (5.8) below, it does not seem unreasonable that the 
fourth from last should also have a unique possible magnitude. 

The next well known result is usually proved by describing the sign patterns in 
rows, much as we do in the first part of the proof of (5.8) below. However, it is an 
easy consequence of Propositions (5.4) and (3.1). 


(5.7) PROPOSITION (Hadamard [1893]). If A is an n X n Hadamard matrix and 
n> 2, then n is a multiple of 4. 


Proof. Since n>2, n is even by (5.4). Let n=2k, k>1. By (3.1), 
det A = (2k)* and this is divisible by 2?*~! by (5.4) (i). The quotient is k*/2(*~, 
which can be integral only if k is even. 


(5.8) PROPOSITION. Let n> 4 and A be an n Xn Hadamard. Reduce A by 
GECP. Then the magnitudes of the first four pivots are 1,2,2 and 4; and ifn > 4, 
|aQ2| is 2 or 3. 


Proof. We may suppose that A is CP. Clearly |a{?| = 1, and every entry of AW 
is O or +2. Thus a$} = +2 and every entry of A® must be 0, +2 or +4. Because 
2(3) < 4, there cannot be any 4’s in A®, Hence every entry of A® is 0 or +2 so 
a must be +2, and then every entry of A® is 0, +2 or +4. 

We will now show that there is an entry +4 in the last n — 3 rows and columns 
of A®), hence |a¥)| has to be 4 because A is completely pivoted. 

By (5.7), n = 4p for some p > 1. Without destroying the CP property of A, we 
can change signs of rows and columns to make the first row and column positive. 
Then each column of A has one of the following four sign patterns in its first three 
entries: 


I I Wl IV 
; 2 gf f (5.9) 
+ + = = 


Further, the mutual orthogonality of the first three rows of A implies that there 
are exactly p columns of each type. Since A(123) is not zero, the first three columns 
of A must be of three different column types. 

Choose any column / of the fourth type not represented among columns 1, 2 and 
3. There are p of these, so at least one exists. For the purpose of finding a row i so 
that A(1237|1237) = +16, it does not matter if columns 2,3 and j are rearranged; 
thus we may assume that A has the pattern (5.9) in its first three rows and the four 
columns 1, 2, 3, /. 
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Rows can be divided into the same four types of groups of p each as columns 
can, so there are p rows having pattern + — — in the first three entries, and all 
these lie below the first three rows. At least one of these rows must have a,,= +1: 
for otherwise column ; of A has more than p negative entries, which contradicts its 
being orthogonal to column 1. Thus A(123i|1237) is the determinant of the 
following matrix, and that is 16: 


1 1 1 1 
1 -1 1 —-1 
1 1 -1 -l 


1 -1 -l 1 
Then af) = A(123j(123i)/A(123) = 416/4 = +4 as desired. 

Now let n > 4 and let us examine the fifth pivot. Because every entry in A®) is 
of magnitude 0,2 or 4, pivoting on a§) will only involve adding +1 or +(1/2) 
times the fourth row of A®) to the rows below, and this will create only integer 
entries in A, Thus |a{2j must be 1,2 or 3 in order to be an integer and for 
|A(12345)| = 16|a§2| to be less than 5°/*. It will be 2 for the 8 x 8 Hadamard H, 
defined in (5.1); and it will be 3 if GECP is done on a 12 X 12 Hadamard such as 
the one in [Paley 1933]. To see that it cannot be 1 is to show that one could not have 


0 B 


where every entry of B is zero or +1: for if that were true, then B would be a 
normalized (n — 4) X (n — 4) matrix, so |det B] < (n — 4)"~%” by (3.1). But 
ldet B| = (n"/*)/16, and it is easily checked that these cannot both hold when 
n> 4. 

We will now show that Wilkinson’s conjecture does hold for one class of 
Hadamard matrices, those defined in (5.1). Tornheim proved this in [1970] and we 
rediscovered it. Proposition (5.12) is a mild generalization of his theorem and stated 
in our notation. A definition is needed first. 


(5.11) Definition.Let A be n X p and let B be any rectangular matrix. The 
Kronecker product of A and B is 


ay,Bo a)B... 4a,B 

AyBiodadyB... a ,B 
A®B= 

A,jBioa,,B oe... a,,B 


Let ®*A denote the Kronecker product of k copies of A, which is well defined 
since the Kronecker product is associative (see Lancaster and Tismenetsky [1985]). 
Thus the Hadamard matrix H,, defined in (5.1) is @"A. 


(5.12) PROPOSITION [Tornheim 1970]. Suppose A isn X n and CP. Then A ® H, 
is CP and its pivots are the Kronecker product of the pivots of A with those of H,—that 
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is, the entries of 
[a --- af] @ [12]. 
Therefore, A ® H, is CP for all n and the pivots are the Kronecker product of the 
pivots of A with ®” [12] = [12242448...]. 
Proof. Suppose that two steps of GECP are done on 


A®H,= 


a, Hy a, oH, eee an nty 
If no row or column exchanges have been done, then for i > 1, the block a, jy 


must have been changed into a,,Hy — (@1;4;;/4,,)H, and the block a,,H, into 
a,,® Hf”. Thus 


| (5.13) 


(A ® H,)® nn nen 


To see that no exchanges will be done in those first two steps, observe that the 
entries of A @ H, are +a,,, hence |a,,| dominates each because A is CP. So the 
(1,1) entry will be the first pivot. After the first pivot step the upper left 2 x 2 block 
and the block a,,H, for i > 1 have become, respectively, 


“ ayy | and a ai 

0 2ay —af? aj, + (a, ,4;)/ay 

Since A is CP, each entry of the second type block is dominated by |2a,,|, so no 
exchange is done at the second step of GECP either. Therefore, two steps of GECP 
do produce (A ® h,)® as shown in (5.12). The rest of the proof that A @ H, is CP 
now follows by induction. The final upper triangular matrix is (A ® H,)@"~) = 
A‘"—) @ HY, hence the pivots are as claimed. For n>1, A ®H, equals 
(A ® H,_,) @H, so by induction, A ® H,, is CP for all n; and finally, H, is CP 
and H, = H, ® H,,_,, hence H, is CP for all n > 1. 


Curiously enough, it is false that H, ® A need be CP if A is—just consider A 


1 1 


=], ‘| This example also shows of course that having A and B completely 


pivoted is not enough to imply that A ® B is also. 

The above results show that when GECP is done on Hadamard matrices, the 
magnitudes of a few of the first and last pivot elements are determined. However, 
we obtained examples which show that the entire sequence of magnitudes of pivot 
elements in the row reduction of an n X n CP Hadamard matrix is not determined 
by n, nor even by the Hadamard class of that matrix, when n > 12. This says that 
there is a rich variety of values among the various intermediate size subdeterminants 
in such a matrix. 


(5.14) Definition. Two matrices are called Hadamard equivalent if one can be 
obtained from the other by a sequence of row exchanges, column exchanges, row 
negations, and column negations. 
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H,, and H, are representatives for n = 2,4, and 8; and an example in Paley [1933] 
is a representative for n = 12. For n = 16 it is proved in Hall [1961] that there are 5 
classes, and examples of each are given. For n = 20 there are 3 classes, with proof 
and examples in Hall [1965]. Also see Hall [1986]. 

When an n X n Hadamard matrix A is reduced by GECP, let p = (p,,..., p,) 
where p, = lat |. (Note this is the reverse of how p, was defined in Section 3.) If 
n = 2, 4, or 8, then necessarily p = (1,2), p = (1, 2,2, 4), or p = (1,2, 2, 4, 2, 4, 4, 8) 
by the results in (5.5) and (5.8) above. For n= 12, we performed Gaussian 
Elimination on many different 12 x 12 CP Hadamard matrices (all Hadamard 
equivalent to one another). We always obtained p = (1,2,2,4,3,10/3, 18/5, 
4, 3, 6, 6,12), and conjecture that this is the only pivot pattern possible when n = 12. 

For 16 X 16 Hadamard matrices, in Hall [1961] the 5 classes are numbered 
I,..., V. Classes IV and V are one another’s transpose, and so are identical for the 
purpose of GECP (since A is CP if and only if A’ is, by the discussion following 
Proposition (2.3), in which case both give the same p). Within each of classes I-IV 
we row reduced many CP representatives and obtained various results for p, as 
indicated below. 7 


TABLE (5.15) 
Pivot Patterns for 16 X 16 Hadamard Matrices 
Reduced by GECP 
Class P 
(1, 2, 2, 4, 2, 4, 4,8,2, 4,4,8, 4,8, 8, 16) 
I (1, 2,2,4,3,8/3, 4,6,8/3, 4, 4, 8, 4, 8, 8, 16) 


(1,2, 2,4,3,8/3, 4,6,8/3,4,6, 16/3, 4,8, 8,16) 
(1,2,2,4,2,4, 4, 8,2, 4,48, 4,8,8,16) 
II (1,2,2,4,2,4, 4, 6,8/3, 4,6,16/3, 4,8, 8, 16) 
(1, 2,2, 4, 3,10/3, 12/5, 4,16/3, 4,4,8,  4,8,8,16) 
(1, 2, 2, 4, 2,4, 4, 8,2, 4, 4, 8,  4,8,8,16) 
III (1, 2, 2, 4, 2, 4, 4, 6,8/3, 4, 4, 8, 4,8, 8,16) 
(1,2, 2,4, 3,10/3, 18/5, 4,4, 40/9, 24/5, 16/3, 4,8,8,16) 
(1, 2, 2, 4, 2, 4, 4, 44,4, 4, 8,  4,8,8,16) 
IV (1, 2, 2, 4, 2, 4, 4, 4,4,5, 24/5, 16/3, 4,8, 8,16) 
(1,2,2,4,3,10/3, 18/5, 4,4,40/9, 24/5, 16/3, 4, 8, 8, 16) 


Observe that p,_, = 1/4 in every case, as mentioned after Proposition (5.5) 
above. 


6. NPSOL and the growth and length problems. As observed earlier, examples of 
CPN n X n matrices A for 2 < n < 4 which satisfy g(A) = g(n) are given by 


1 1 1/72 1 1 1 1 

4=| I 1.4 = -1 122 1 }|and4=]7' 2 71! 
-1 1 , -1 -1 11 
12-1 1 -1 -1 1 


Each of these also satisfies AA’ = cI and g(A) = c. This observation led us to 
conjecture that the same phenomenon might hold for larger n. We now know that 
this conjecture is almost certainly false for n = 5 and n= 7, as we will explain 
below. However, in early attempts to gather evidence in favor of the conjecture, and 
to generate examples of matrices with large growth, we were led to obtain the 
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non-linear programming software package “NPSOL,” written by Gill, Murray, 
Saunders, and Wright of the Systems Optimization Laboratory group in the 
Operations Research Department at Stanford University [NPSOL 1983]. A detailed 
discussion of the algorithm employed in NPSOL (sequential quadratic program- 
ming) can be found in Gill, Murray, and Wright [1981]. We will describe what this 
package does and how we used it, and then discuss the rather intriguing results we 
obtained and questions that these results suggest. The following definitions will be 
useful; they are ours, not standard ones. 


(6.1) Definition. We call a normalized n X n matrix A normalized orthogonal 
(NO) if AA‘ = c(A)I for some constant c(A). Let c(n) = sup{c(A)|A an n Xn 
NO matrix}. 


If we identify the set of n X n real matrices with R” as in Section 2, it is clear 
that the NO matrices form a compact subset of R” on which the function c(A) is 
continuous. Thus c(7) exists and there is an n X n NO matrix A with c(A) = c(n). 
Also, it is clear that c(n) <n, and c(n) = n iff there exists an n X n Hadamard 
matrix. So the determination of c(n), which we call the length problem because 
yc(A) is the usual Euclidean length of each row of an NO matrix 4A, is at least as 
difficult as the determination of those n for which an n X n Hadamard matrix 
exists. 

For any n X n NO matrix A, A~! = (1/c(A))A’‘; hence by Proposition (2.3) we 
have h(A) = c(A)/|a;,| where a;; is some non-zero entry of A. Since g(n) = h(n) 
by Proposition (2.9), it follows that g(n) > c(n). If it were true that g(n) = c(n) 
for all n, Wilkinson’s conjecture would follow at once. However, as mentioned 
above and explained below, this is almost certainly false for n = 5 and n = 7. 

The NPSOL package addresses the problem of finding min f(x) where x = 
(X,,---)X,,) = R™ is subject to the constraints 


k;<x,;<u,; for l1<j<m and 


kintj < 8X) <S Uns; for 1<j<p. 


The functions f, g),..., g, must be continuously differentiable and the bounds k, 
and u, may be equal, or may be infinite. The user of the package must write a 
Fortran program to call the main subroutine of the package, and must also write 
subroutines to be called from within the package to evaluate f, g,,..., g, and all of 
their first partial derivatives at a specific point x. An initial point x must be 
supplied which satisfies all the constraints, and the package moves x to a local 
constrained minimum for f. Of course if one wants to maximize f, one minimizes 
—f. The package returns an integer variable “INFORM” which, if equal to 0, 
indicates that the authors of the package consider the result returned to be a reliable 
local minimum. We will refer to a run as successful if it returns with INFORM = 0. 

For the length problem, that is, to determine c(n), we used NPSOL in the 
following way. Let x,, be the n? variables (1 < i, j < n) representing the entries of 
ann Xn matrix, with variable bounds —1 < x;,;< 1. The objective function is 


f(x) = — ) x},, and the constraints are given by 
j=l 


n n 
O0< Vxi- Vxz1;<0 (<i<n—-}) 
j=l j=l 
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and 
O< LU XinXjn <0 (l<i<j<n). 
k=1 
It was necessary to reformulate all of these in terms of variables x,,..., x,,2. 


Analytic formulae for all the relevant partial derivatives are easily obtained, and 
writing the required subroutines was tedious but straightforward. 


For the growth problem, that is to find h(n) = g(n), again we let x,,(1 <i, j < 
n) represent entries of an n X n matrix x, with variable bounds —1 < x;; < 1. We 
required x,, = 1. The objective function is —(x‘"~))?, and the constraints, which 
amount to the requirement that x be CP, are given by 


0 < (x{e-P)? — (xe) < +0 (2<k<n-1,k<i,j<n 
and (i, j) # (k,k)). 


Because any x? is a quotient of subdeterminants of x, analytic formulae for the 
relevant partial derivatives are forbiddingly complicated for arbitrary n, although 
we did use them for n = 3. For larger n we used forward difference quotients to 
approximate partial derivatives, with a step size equal to the square root of machine 
precision. Our subroutines were written in terms of variables x,,...,x,2_,, and 
evaluating the objective and constraint functions and their partial derivatives at one 


value for x meant row reducing n? different n < n matrices. 


Here is a summary of the examples we obtained from NPSOL, and their 
implications. We were able to gather evidence relevant to the determination of c(n) 
and g(n) for n =5,6, and 7, as well as the verification of the known values 
c(3) = g(3) = 9/4 and c(4) = 2(4) = 4. Since NPSOL produces approximations to 
local constrained optima and we are interested in global constrained optima, it was 
appropriate that we use the package repeatedly with many and various initial points. 
So we used a random number generator to produce matrices with random entries 
between —1 and 1, which were then modified so as to satisfy the constraints of the 
problem at hand. For the length problem we used the Gram-Schmidt procedure, 
- followed by normalization. For the growth problem we performed GECP on a copy 
of the random matrix to determine the necessary row and column exchanges to 
make the matrix CP, did those exchanges to the original and then normalized it. 

We will call two matrices “essentially the same” if after permuting the rows of 
one, each pair of corresponding rows are Hadamard equivalent, and the same is true 
of their transposes. This is weaker than the relation of Hadamard equivalence, 
defined in (5.13). 

For the length problem the behavior of the package was very stable. Every 
successful run resulted in a matrix which was essentially the same as one of the 
following: 


1 1 1/2 1 1 1 1 
L,=|-1 172 1]44,=|7' } 71 ! 
-1 -l 1 1 
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1 1 1/3 1/2 -1 1-1 21 #21 #21 = 0 
—] 1 1 1/3 1/2 Pod -t ot O01 

L;=|1/2 -1 1 #1 173}u=]2 $F PF Ff -1 =i 
1/3 1/2 -1 1 1 be 0 et ToT et 
f3 Ve — 1 1 0-1 1 =-1 
1 1/73 1/2 —-1 1 1 1 1 +0 1 «21 
a -1 -1 -b -1 1 =-1 
—1 1 -b 1 -ce -d -1 
1 -1l Cc 1 a —-c -c¢ 

L,=|-1 -1 d -1 -c -1 ~~ bi, 
—-b -a 1 1 -1 #1 ~~2 


1 1 1 -l -e b -—-d 
—-1 -b 1 -a 1 1 -l 
where (a, b,c, d) = ((—2 + V7)/3, (3 — V7)/2,(—1 + V7)/2,(1 + V7)/6). These 
values of a, b,c, and d were determined by the requirement that L, have the above 
form and be NO. Since these values occurred on every successful run, we strongly 
suspect that c(5) = 121/36 = 3.36111, c(6) = 5, and c(7) = (184 — 3577) /18 = 
5.07771. If this value of c(5) is correct, it has the surprising consequence that 
c(5) < c(4). We have examples which show that g(5) > 4 and g(7) > 6, so if the 
above values for c(5) and c(7) are correct then c(5) < g(5) and c(7) < g(7), hence 
c(n) # g(n) in general. 

For the growth problem the behavior of the package was much less stable than 
for the length problem, and the authors are not certain how to interpret this 
observed instability. We applied the package with 3 < n < 7 and only with n = 4 
were the results absolutely uniform: on every successful run we obtained a 4 x 4 
Hadamard matrix. For n = 3, most successful runs yielded a matrix essentially the 
same as L, above, but occasionally we obtained a matrix A with g(A) = 2. We 
speculate that this indicates that for n = 4 all local extrema for the growth problem 
are in fact global extrema, but that for n = 3 there are several local extrema which 
are not global. 

For n = 5, 6 and 7 our successful runs produced matrices A with many different 
values of g( A). For n = 6 the largest value was g( A) = 5, and this occurred with A 
essentially the same as L, above. So we are inclined to conjecture that g(6) = c(6) 
= 5. 

For n = 5, the value g(A) = 4 was the smallest produced by a successful run, 
and it occurred quite often. The largest value we saw occurred with a matrix having 
growth approximately 4.1325, but the results from all the runs were sufficiently 
erratic that we decline to conjecture a value for g(5). The example below has growth 
approximately 4.1296. It was obtained by rounding the entries of one example to 3 
digits and adjusting a few to restore the CP property. There are several entries in 
this example which, if altered slightly, will cause a row exchange to occur during 
GECP and considerably smaller growth as a result. 


1 617 1 178 = —.452 

1 — 836 —.452 —.635 999 
A,;=j}-—.581 —.997 854 1 1 
A451 —1 —1 1 —1 

— 195 —1 .999 —1 — .999 
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For n = 7, successful runs produced values of g(A) ranging from approximately 
5.428 to 6. The matrix which gave growth = 6 did not have entries which were 
recognizably simple. Rounding and adjusting them as we did for A,, we obtain the 
following, which has growth approximately 5.996: 


1 ~] -l1 -1 1 1 ~] 
1 1 -1 -1 -1 -1 -] 
1 —.373 1 -1 797 -228 1 
A,=] 1 4196 -013 1 1 ~] -] 
-1 576 -1 -1 1 ~] 026 
1 1 -1 1 1 999 999 
-1 1 1 -1 999 1 — 999 


These results suggest several questions, among them the following: 

(1) Are our values correct for c(n), n < 7? In particular, is c(5) < c(4)? 

(2) Is c(n) increasing for n > 5? 

(3) Is it true for the length problem that every local extreme point is in fact a 
global extreme point, for all n? 

(4) For what values of n are all local extrema for the growth problem in fact 
global extrema? 

(5) Is h(A) = c(A) = g(A) when A is a normalized orthogonal matrix? 

(6) For what values of n does g(n) = c(n)? 

(7) What is the explanation for the instability we observed in connection with the 
growth problem? 
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The Mobius Group and Invariant Spaces of Analytic Functions* 


STEPHEN D. FISHER, Northwestern University 


STEPHEN D. FISHER: I received my bachelor’s degree from MIT in 1963. My 
graduate work was done at the University of Wisconsin, Madison, where I 
was a Student of Frank Forelli. I was an instructor at MIT from 1967 to 
1969, at which time I joined the faculty at Northwestern University. My 
research focuses on the interaction of complex analysis with functional 
analysis and approximation theory. In addition to a number of research 
papers, I have written a graduate-level book, Function Theory on Planar 
Domains, John Wiley and Sons, 1983, and an undergraduate text, Complex 
Variables, Wadsworth, 1986. pe 


Introduction. One of the most interesting themes of modern, functional analytic, 
analysis is the continual recurrence of theorems rooted in classical analysis. By this I 
mean that what theorems proved some 60, 80, or even 100 or more years ago, say 
about the growth or range of analytic functions are found to be not only relevant 
but actually profoundly connected with, and instrumental in, the development of 
modern Banach-space-oriented investigations. This paper illustrates this principle 
anew, this time in the investigation of the nature of linear spaces of analytic 
functions on the unit disc in the complex plane which are invariant, in a sense to be 
made precise later, under composition with a certain group of linear fractional 
transformations. The basic results in this area are accessible to anyone who has 
studied elementary complex variables and who has a smattering of knowledge of 
what I would term “modern” analysis, that is to say, that portion of analysis that 
takes as its main interest the study of normed linear spaces, their dualities, and the 
operators that act on them. 


1. The Mobius group and some of its properties. Throughout this paper A will 
denote the open unit disc in the complex plane, A = {z: |z| < 1}. The collection of 
those analytic functions on A that are one-to-one mappings of A onto itself form a 
group, with the group operation of composition. We shall call this group the Mébius 
group and denote it by G. It is elementary that each element of G is a linear 
fractional transformation g of the special form 


p(z) =A(a— z)/(1 - az), (1) 
where a is a point of A and A is a constant of absolute value 1; see, for example [8, 
page 198]. Notice, too, that each y © G gives a one-to-one orientation-preserving 
mapping of the unit circle T = {z: |z| = 1} onto itself. Define 


g,(z) =(a-—z)/(0 - @), aed, (2) 
g, interchanges the points a and 0 and is its own inverse: gy, ' = 9,. 


The elements of G possess a number of fascinating properties that will play 
critical roles in our story. The first is the equality 


lp(z) — p(w) |/|1 — e(z)e(v)| = |z - wl /[1 - 20], (3) 


*Research and writing supported in part by a grant from the National Science Foundation. 
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which holds for all points z, w in A; this may be established directly by computa- 
tion. Now divide both sides by |z — w| and let w — 2; the result is that 


(1 — |z/?)le(z)| = 1 -le(2) I. (4) 


In fact, more than this is true. If f is any analytic function mapping A into A, then 


, 2 
(1 — jl? F(z) 1 <1 -1f(2)| (5) 
and strict inequality holds at all points of the disc A unless f is an element of G. 
This is the invariant form of Schwarz’s lemma and its elementary proof follows. Let 
f be an analytic function mapping A into itself, § a point of A, and € = f(£). 
Define a function g by 


a(z) = 9; (f(z))/e;(z). 


g is then an analytic function on A since the zero at ¢ of gp, in the denominator is 
cancelled by the zero at ¢ of »,(f(z)) in the numerator. The maximum principle 
implies that g also maps A into itself. After untangling the relations this yields 


Lf(z) — £(Q) Wd = 28) <1 — fe Ohz - 1. (6) 


To complete the proof of (5), just divide both sides of (6) by |z — ¢| and let ¢ 
approach z. The fact that equality holds in (5) only for elements of G is again a 
consequence of the maximum principle. The expression on the left-hand side of (5): 


If(z) = f(z) — |z\|?) (7) 


is called the Mébius invariant derivative of f, or just the invariant derivative of f for 
short. To see what this means, let » € G and put w = g(z), h = fog. Then 


[A'(z) (1 — |z17) =1f'(e(z)) {le’(2) | = 1217) 
=|f’(e(z)) (1 -le(2) 1) 
=|f’(w)|(1 — |wI?) 


as a result of (4). In short, 


IJ( f° ~)(z)| =|(JF)(o(Z)) I. (8) 
This will have some interesting consequences later, as we shall see. 
A second useful property of elements of the M6bius group is their effect on 
Lebesgue area measure when the change of variables w = (z) is made. We will use 
the notation dA for normalized area measure: 


dA(z) = (1/7) dx dy, z=xt+iy. 


When we write » =u+ iv, the usual change of variables formula using the 
Jacobian gives 
dA(w) = 


u, Uy, 
det b, 2, dA(z). 


However, because » is analytic the Cauchy-Riemann equations, u, =v, and 
u,, = —v,, imply that the determinant equals u2 + v2 = |’|*. Consequently, 


dA(w) =|@'(z) |" dA(z). (9) 
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Let us now apply (4) to (9). We define a measure d= on A by 
d&(z) = (1— |z|?)7 dA(z). 
(Note that d> is o-finite but not finite on A.) With respect to each element of the 
group G, the measure d> then displays the invariance: 
d=(w) = d=(z), w= Q(z). (10) 

For this reason the measure d> will be called the (Mobius) invariant measure on A; 
the fact that we say the Mobius invariant measure is a statement about uniqueness 
which we will not prove. 

On the unit circle T = {z: |z| = 1} elements of the Mobius group also produce 


an important change of variables. If h is a function analytic on a neighborhood of 
the set {z: |z| <1}, an element of G and g(e'”) = e", then 


(n° p)'|d0 =|h'(p(e*)) ||p’(e?) | dd 
=|h'(e'’) | dt. (11) 
The M6bius group has strong connections with the non-Euclidean, hyperbolic 
geometry on the disc A. To start, (3) implies that the function 
d(z,w) = |z—w|/|1 —- zw] 
is invariant under G in the sense that for each y € G 
d((z), p(w)) = d(z, w). 
Moreover, (4) may be rewritten in differential notation as 
jdz|/(1 — |2|*) = lagi /(L— 1817), 8 = (2). 
We define the hyperbolic length of a (smooth) curve y in A by 


L(y) = fiai/( — |z|?) 


and the hyperbolic area of a (Lebesgue measurable) set E by 


A(E) = | f a3. 


It follows that each m € G is a hyperbolic length and hyperbolic area-preserving 
transformation. More generally, the inequalities (5) and (6) show that an analytic 
function f mapping A into itself reduces the function d: d( f(z), f(w)) < d(z, w); 
reduces hyperbolic length: L( f(y)) < L(y); and reduces hyperbolic area: A( f(E)) 
< A(E). Note, too, that the invariant derivative defined in (7) is nothing but the 
derivative taken with respect to hyperbolic length. 

The hyperbolic geometry in the disc A has as its “lines” arcs of circles that meet 
the unit circle T twice, each time at a right angle. These are the geodesics of this 
geometry. The hyperbolic geometry has a metric, the hyperbolic distance, defined by, 


D(z, w) = (1/2) log|(1 + d(z, w))/( — d(z, w))]. 


This is a metric and (3) implies that D(g(z), p(w)) = D(z, w) for all z, w € A and 
all elements g of G. Furthermore, equality holds in the triangle inequality if and 
only if the three points lie on the same hyperbolic line. Thus, the elements of G act 
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as rigid motions relative to this geometry. Moreover, the distance from the origin to 
the point z, |z| =r > 0, is (1/2)log{( + r)/( — r)]. It is thus infinitely far from 
the origin, or any other point in A, to the unit circle T. More on the hyperbolic 
geometry of the disc A and the group G can be found in [25]; also see [1]. 


2. Spaces of functions invariant under G. The chief concern of this paper is to 
survey some recent results on linear spaces of analytic functions that are invariant 
under composition with elements of the MObius group G. 

When I write that X is Mobius invariant I mean by this that X is a linear space of 
analytic functions on A which is complete in a norm or seminorm, || ||, and 
further, 


for each function f in X and each element of G the 


(12) 


There are two other restrictions on what precisely constitutes a MObius invariant 
space; they will be listed later when they are more relevant. It is more important at 
this point to list a number of spaces which possess the invariance property (12). The 
list will be extensive but, of course, not exhaustive. 


function f ° g also lies in X and satisfies || f° p||y = || f lx. 


Example 1. The space H® of bounded analytic functions on A with the norm 
| || = sup{|f(z)|: z € A} satisfies (12). Further, the closed subspace, A, of H™ 
consisting of those functions which are continuous on the closed disc, {z: |z| < 1} 
also satisfies (12). A is called the disc algebra and has been the subject of much 
study in other contexts; see, for instance [13]. 


Example 2. For 1 < p < o the space B? is defined to be all analytic functions g 
on A for which the invariant derivative, Jg, lies in L?(A, d=). For 1 < p < o and 
after the relation (7) and the definition of d= are taken into account this amounts 
to the requirement that the integral 


figr@PG — |z|?)?* dA(z) (13) 


be finite. For p = oo the requirement is that the quantity 


sup{|g’(z)|(1 — |z|?): ze A} (14) 


be finite. When 1 < p < oo the space B? is the Besov p-space; they are part of a 
much more general family of spaces; see [17]. For historical reasons, which I will set 
out below, the space B® 1s called the Bloch space. The seminorm on B? is the pth 
root of the integral in (13) if 1 < p < o and the quantity (14) if p = oo. Note that 
a constant, and only a constant, has seminorm zero. 

The advantage of the initial definition, that the invariant derivative of g belongs 
to L?(A, d), is that it is immediately clear, using (8) and (10), that 


igl,=llgceel, 8B’, pEeG, 1l<p<o. 


By the way, the reader who is now anticipating the definition of the space B! 
along the lines of the integral (13) will be disappointed. It is an interesting exercise 
to show that the integral in (13) does not converge if p = 1 unless g is constant. The 
space B! will be defined below in Example 4 but it will take some effort to show 
that it is connected to the spaces B? as defined here. 
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Let us spend a second now to show that H® is a subset of B™ and, indeed, that 
ISllae < |lgllaz< forall g © H™. Let g © H™, set M = ||g|| 70 = sup{|g(z)|: z € A} 
and define f = g/M. Then f maps A into A and so by (5) 


; 2 
(1 - |z7)If'(2)I<1-lf(){ <1 
for each z € A. Now just multiply through by M to obtain (1 — |z]”)|g(z)| < M, 
which is the desired inequality. 
The closed subspace, b®, of B® consisting of those functions g for which 


limit {(1 — |z|7)|g’(z)|: |z| > 1} =0 


also satisfies (12). Actually, b~® is the closure of the polynomials in z in the 
seminorm (14). b® is called, for notationally clear reasons, the little Bloch space. 


Example 3. The space B? is also known as the Dirichlet space and is usually 
denoted by D. The functions in D are those for which the area of the image of A 
under f, counting multiplicities, is finite since the integral in (13) becomes (remem- 
ber, p = 2) 


purr da = Ji 


Note also that the Dirichlet seminorm is also given by the expression 


co 1/2 ce 
Ill = | » kla,("| ,  f(z)= Li a,z* 
k=1 k=0 
The space D is a semi-Hilbert space meaning that it has an inner product 


(f,8)n = [i's aA (15) 


which satisfies all the usual requirements of an inner product except that it is 
possible that (f, f)p can be zero without f being zero; this happens, as we’ve seen, 
if and only if f is a constant. Later we shall discuss the possibility of the existence 
of other, intrinsically different M6bius invariant semi-Hilbert spaces. 


Example 4. The space B! is defined to be all those analytic functions f on A 
which have a representation as 


co ce 
f(z) = Yo c,, where »,€G and ) |c,| < o. (16) 
k=1 k=1 
Since a function f could conceivably have several such representations, we define 
the norm on B! by the rule 


If || = int y |c,|: (16) holds. (17) 


The invariance property (12) is proved by the following argument. If f € B! and 
gm & G, then fog = Lc,p,°9. Since G is a group, 9,° again lies in G so we 
obtain || f° || < X|c,|. By taking the infimum over all representations of f we find 
that || f° q|| < ||/||. However, because G is a group, the same inequality is true with 
gp! in place of ; equivalently, || /|| < || f° ||. Thus, equality holds. The functions 


~ 
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gm & G form the “atoms” out of which the space B! is built. The interested reader 
could spend a minute and prove that ||q|| = 1 for each  € G. (It’s clear that » has 
norm at most 1; the issue is to show that it cannot be less than 1. One approach is to 
show that each f € B! lies in the disc algebra A and its norm in A is no more than 
its norm in B?.) 


Remark. The familiar Hardy H? spaces are defined for 0 < p < o by the 
requirement that 


7 . Pp 1/p 
lla = sup| [a2n) i "| #(re'®) | a6 O<r< 7 


be finite. It is worth pointing out explicitly that none of these H? spaces satisfy (12). 
Indeed, it is not a difficult computation (see [7]) to show that 


~1 
sup{|lf° Palla: f © H?, II fll <1} > (1 Jal?) 
and hence (12), or anything like it, does not hold for H?. 


Example 5. Those analytic functions h on A for which h’ lies in the Hardy space 
H' also satisfy (12). The seminorm on this space is 


Ill = 1/20) f “"lh'(e') | dO 


and the invariance is a result of the change of variables formula (11). The functions 
in this space are exactly those that map the unit circle T onto a rectifiable curve of 
finite length; see [7]. 


3. Basic properties of Mobius invariant spaces. Let us begin our look at M6bius 
invariant spaces by identifying the largest and the smallest ones. We first list an 
additional, most natural, property that each MObius invariant space is required to 
possess. 


There is some continuous linear functional “on X which is also continuous in the 
topology of uniform convergence on compact subsets of A. 
(18) 


That is, if { f,,} is a sequence of elements of X which converges uniformly on 
compact subsets of A to 0, then 7( f,) — 0. For instance, ( f) = f’(0) satisfies (18) 
on each of the spaces listed in Examples 1—5. With the modest requirements, (12) 
and (18), we are now ready to prove our first theorem, a fundamental result of L. A. 
Rubel and R. M. Timoney [24]. 


THEOREM 1. Let X satisfy (12) and (18). Then X is a linear subspace of the Bloch 
space B® and the embedding is continuous: there is a constant C depending on X but 
not h such that 


All p> < CllAlly forallh € X. (19) 


Proof. The linear function ¢ has the form ¢(h) = {27h(re!®)G(re'®) d0, where 
G(z) is analytic on a neighborhood of the set |z| > r, including at oo, for some r, 
0 <r<1; see [23]. Let G(z) = L1°_,G,z~” be the power series expansion of G 
about oo and let N be any integer such that G, # 0. Replace h by e~'“‘h( ze") and 
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then integrate with respect to ¢ from 0 to 27. Using (12) we obtain the estimate 
r?%ihyGy| < C||h||x where C is the norm of fon X. This is nearly what we want. 
If N = 1, then this inequality shows that |h’'(0)| < C’||h||y for some constant C’ 
independent of h&X. Replace h by hog,; we are led to the inequality 
(1 — Ja|*)|h’(a)| < C'|JA||x, valid for each a © A. This, of course, implies that X is 
a subset B® and we are done. If N> 1, then similar, but more elaborate, 
computations show that X is continuously embedded in B®. (The reader can easily 
work out the omitted case N = 0 and the case N=2.) O 


This is a good point to explain why the functions in B® are called Bloch 
functions. The starting point is the 1924 theorem of A. Bloch [5], which states that 
an analytic function f in A, normalized by the condition f’(0) = 1, must cover, in a 
one-one fashion, a disc in its range of at least a certain minimum size. This is one of 
a number of beautiful and profound results about the range of an analytic function, 
including the theorems of Koebe and Picard, which, by the way, is a consequence of 
Bloch’s theorem. In the course of giving another proof of Bloch’s result and deriving 
a numerical lower bound for the minimum size of this disc, E. Landau [14] showed 
by a series of elementary manipulations that attention could be restricted to 
functions f and satisfied the growth condition |f’(z)|(1 — |z|?) < 1. Of course, 
mathematicians being what they are, this last condition was ultimately recognized as 
defining the unit ball of some space and “Bloch functions” as entities unto 
themselves were born. More on Bloch functions and their fascinating properties can 
be found in [2] and a proof of Bloch’s theorem in [6]. We now return to our primary 
interest, M6bius invariant spaces. 

The third and final restriction that we impose in order that a space be Mobius 
invariant requires us to topologize the M6bius group G. We do this by requiring 
that the mapping (A, a) ~ Aw, of TxA onto G is a homeomorphism. This means, 
basically, that the disc and the unit circle each have their usual topology but you 
can’t reach one from the other. In order that a space X be Mobius invariant we 
require that (12) and (18) hold and, in addition, the final restriction: 


for each f € X the mapping gy — f° of the group 


G into X is norm continuous. (20) 


By the way, for the spaces H® and B®, (20) is not correct. Luckily, all is not lost. 
Both of these spaces are dual spaces and the mapping in (20) is continuous from G 
into them when they are given the weak-* topology. So we can modify (20) for dual 
spaces to require only that the mapping in (20) be continuous from G into X with 
the weak-* topology. 

We are now ready to identify B! as the smallest Mébius invariant space. 


THEOREM 2. If X is a nontrivial Mobius invariant space, then X contains B'. 
Indeed, the containment is continuous in the sense that 


Ifllx < Cllfll for all f © B’ (21) 


where C is a constant depending on X but not on f. 


Proof. We shall first show that X contains each of the functions z”, n = 1,2,.... 
Since X is nontrivial it contains a function F which is not constant. Let F(z) = 
“L%_»b,z* be the power series expansion of F about the origin and choose a positive 
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integer N such that b, # 0. The function g defined by 
g(z) = (1/2m) [°"F (zee dd, EA 
0 


lies in X since the integrand is a continuous, X-valued function of 4 by (20). On the 
other hand, the value of the integral is most clearly b,,z. This shows that z™ lies in 
X. It follows from (12) that (g,)”* lies in X for each a € A and repeating the above 
argument with this function in place of F we see that z” lies in X. (This may involve 
choosing an appropriate a € A.) The rest is now easy. Since z lies in X so does each 
gy & G, again by (12) and 


lPllx = 12° Pllx = |IZIlx = C. 
Now let f = Lc, , be any element of B!. We then compute 


cPellx < LexlllPallx 
= Cyilexl- 
Taking the infimum over all representations of f we obtain (21). O 
COROLLARY 3. A Mobius invariant space contains the polynomials. 


Indeed, it is not hard to show that the polynomials are dense in a MObius invariant 
space. (Or weak-* dense in the case of a MObius invariant dual space.) 

There is a problem here that we cannot ignore—the space B! itself is not 
obviously MObius invariant. We’ve certainly shown that it satisfies (12). (18) is also 
easy since if f = Lic,q,, then f is the absolutely convergent sum of functions of unit 
norm in the disc algebra and so lies in the disc algebra. In particular, for any point 
Zo GA, |f(25)| < Lle,|; this gives (18). The problem is, of course, with (20). While 
it is true that B! does satisfy (20) the proof would take us too far afield so we shall 
just content ourselves with stating this as a proposition; see [4] for the details. 


PROPOSITION 4. The space B' satisfies (20) and so is the minimal Mobius invariant 
space. 


Let’s spend a paragraph or two here to show how the space B! fits into the family 
of spaces B? as defined in Example 2. First, it is not too difficult to establish that 
for 1 < p < co a function f lies in B? if and only if 


fire(@P@ = Py? da(z) (22) 


is finite. The trade-off of one derivative of f for an extra power 1 — |z|? is basically 
just a consequence of Cauchy’s formula. The disadvantage of formulating member- 
ship‘in B? this way is that it is not at all obvious from (22) that B? is Mébius 
invariant. Nonetheless, (22) does permit us to pass to the case p = 1 and connect 
the space B! defined in Example 4 with the B” spaces. 


THEOREM 5. The space B! is identical with the set of those analytic functions f on A 
for which f" lies in L(A, dA). 


Proof. One inclusion is elementary. A computation establishes the inequality 


[ee(2) |44(2) <4, aéA. 
A 


522 STEPHEN D. FISHER [June-July 


From this it is immediate that each element of B! lies in L1(A, dA) and the norm of 
its second derivative in L! is no more than 4 times its norm in B'. On the other 
hand, let h be a polynomial with a zero of order two or more at the origin. Another 
elementary computation establishes the formula 


h(z) = Jha) 90(2)/a 4A(a). 


The integrand is a continuous B'-valued function of a. The measure (1/a) dA 
simplifies to e’’ dtdr, a = re", and this can be used in an elementary fashion to 
obtain the estimate 


Alla < CA (a, aay 
where C is a constant independent of h. The polynomials in z are dense in the 


analytic functions that lie in L1(A, dA) and they are also dense in B’, since B! is 
Mobius invariant. This establishes the other inclusion and finishes the proof. O 


4. Dualities, representations, uniqueness. We begin this section with the general 
thought of finding the dual space of a Mdbius invariant space. There are two 
“principles” that should be kept in mind when searching for dualities among 
MObius invariant spaces: 

—a MObius invariant space should have a MObius invariant dual 
—the pairing used for the semi-inner product in the Dirichlet space D 


((f,8)) = [f'eaA (23) 


is Mébius invariant and hence offers the possibility of a M6bius invariant pairing 
for other pairs of spaces. One point is worth noting here about the integral in 
(23)— generally, it is not absolutely convergent. We usually need to take the pairing 
in the limiting sense: 


((f,8)) = limit{ (1/27) f a f *"#"( re!) g'(re") dtr dr: EL 0}. 


The fact that (23) is invariant under G is easy to prove, assuming say, that the 
functions f and g are analytic on a neighborhood of {z: |z| < 1}. For if gp € G, 
then 


((f° 9, §°@)) = [ (fee) (gee) a4 
= [F(o)a(@ENle'@) 442) 


= [s'(w)e"Qw) dA(w) = (89) 


by invoking (9). By the way, the pairing (23) can be realized another way, too, on 
the unit circle T by 


(Cf. 8)) = (1/2m) f 75-i9¢( e997 0) ag. 


Once again the integral is not absolutely convergent, in general, and we have to take 
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the integral in some sort of limiting sense, say 
Cf, &)) = limit (1/27) ["e-%f(re"*) (re) do:r ni}. 
0 


One other note about (23): if either of f or g is a constant, then the value of 
(<,)) is zero. Therefore, any dualities established with (¢,)) must be taken 
modulo constants. Let’s put the pairing ((, )) to work with fe B™ and g=q EG. 
Another computation produces 


Cf; Pa)) = -(1 7 la|*) f(a), aca. (24) 


(Just check this for f(z) = z™, which is very direct, to be convinced.) The right-hand 
side, of course, is just the quantity that is used in computing the B® norm. 
Meanwhile, the functions {g,: a © A} are the “atoms” which make up the space 
B'. Hence, it should be no surprise that B® and B! are in some sort of duality. The 
results are set forth here; proofs are contained in [4]. 


THEOREM 6. The dual space of B® is B' and the dual space of B' is B®. In both 
cases the pairing is effected by (23) and the correspondences are isometric isomor- 
phisms. 


Actually, the successive dualities involving, in turn, a “little oh” space, an L! space, 
and finally, a “big oh” space are a well known phenomena in functional analysis, 
the classic case being the successive dualities of c,, 7’, and “~. However, this type 
of relationship has already been identified as occurring in spaces of analytic 
functions on A; see, for instance [22] and [27]. 

Other dualities among MObius invariant spaces are also effected by (23). For 
example, the dual space of B? is B?, p>! + g~! = 1, and 1 < p < oo. However, the 
correspondence is not isometric in this case. Using the pairing (23) it is also possible 
to construct, for example, a M6bius invariant predual of H® (the conventional one 
is not MObius invariant), and MObius invariant duals of other Mobius invariant 
spaces. 

While on the subject of duality let us turn here to the topic mentioned in 
Example 3, that is, the possible existence of a Mobius invariant semi-Hilbert space 
other than the Dirichlet space D. The answer is contained in the next theorem and 
the remarks that follow it. 


THEOREM 7. Let X be a Mobius invariant semi-Hilbert space. Then the seminorm 
of X is a constant multiple of the Dirichlet seminorm. 


Proof. Let (f, g) be the semi-inner product in X. The MObius invariance of the 
seminorm of X implies that (f/f, g) =(f°9, g°¢) for all f, g © X and all po EG. 
Take p(z) = ez, f(z) = z*, and g(z) =z”, m # k, and integrate with respect to 1 
from 0 to 27. This implies that 


(z*¥,z™)=0, m#k. (25) 


Next, take f(z) = g(z) =z and p=9,, —1<r<1. Making use of (25), we 
obtain 


N= (2,2) = (9,9) = (1,1) + (L = 7?) Er? 2(24, 24), 


Expand the right-hand side in powers of r*. Each coefficient, except the Oth is zero. 
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This produces the series of equations 
(1,1)=0 and (z2*,z*) — 2(2*"1, 2*-1) + (2*-?, z*-?) =0, kk 32. 


These equations yield (z*, z*) = kA, k =1,2,... . That is, the seminorm in X is 
given by 


(ifllkP =f) =A DL kal, f(z) = YE a,z*. 
k=1 k=0 


Hence, the seminorm of X is A times the seminorm of D and the theorem is proved. 


Actually, a substantially stronger result than Theorem 7 is true. If X is a 
semi-Hilbert space for which (18) and (20) hold and if there is a constant C such 
that (f°, fon) < C(f, f) for all » € G, then the seminorm in X is equivalent to 
that of D and so X is D; see [3]. This last result tells us that there is only one 
MObius invariant semi-Hilbert space. Whether similar uniqueness theorems hold for 
other types of Mébius invariant spaces is an intriguing question. 


Representations of G. Let » be an element of G and define a mapping C, on 
analytic functions by C,(f) = f°. The connection between the group G and the 
family of mappings C, given by 


(9) =C, (26) 


is an anti-isomorphism of groups, the image being a subgroup of the group of 
continuous linear mappings of the Mobius invariant space X into itself. As a matter 
of fact, each C, is an isometry of X and, in particular, of the semi-Hilbert space D 
onto itself. Hence, +r is a representation of G as a group of isometries of a 
(semi)-Hilbert space. There are, however, many other representations of G as a 
group of isometries of some Hilbert space. Another such is as unitary mappings of 
the Hardy space H?, given by 


V(h) = (hep)(o’)”. (27) 


The subject of the representation of a group as a group of isometries of some 
Hilbert space is an important part of harmonic analysis; see, for example [10] and 
[15]. 


5. Hankel operators. A series of recent results draw further connections between 
MObius invariant spaces and other parts of analysis, more specifically portions of 
operator theory. Recall that a Hankel matrix H is a matrix, finite or infinite, whose 
entries are constant on the diagonals from upper right to lower left; that is, H;; is a 
function only of i + j. Hankel matrices arise in many contexts in analysis; see, for 
instance [18] and [20]. One of the most beautiful connections between complex 
analysis and Hankel operators is the following. 


KRONECKER’S THEOREM. Let {H,}% be a sequence of complex numbers. The 
Hankel matrix H = (H;,;) has finite rank if and only if the analytic function 
F(z) = ©%_)H,/2* is rational. If this is so, then the rank of the matrix H is equal to 
the degree of F. 


A proof of Kronecker’s theorem and more on Hankel matrices may be found in [9]. 
A Hankel operator is an operator on a Hilbert space whose matrix is a Hankel 
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matrix. One particularly well understood setting for this theory is on the Hardy 
space H’, considered as a closed subspace of L?(T, d@). Let P be the orthogonal 
projection of L?(T, d@) onto H’. P is given by 


P| » ae = ) a,ei*®. (28) 
— k=0 


Suppose that f is an analytic function on A. The Hankel operator induced by f on 
H? is defined by 


H,(g) = (I - P)fg. (29) 
Explicitly, H, is given by the formula 
H,(g)(z) = (1/2mi) f {7(2) - FED} (EE - 2)" a8 (30) 


To see why H; is a Hankel operator we compute its matrix. The functions e, = e'*", 


k = 0,1,2,... are an orthonormal basis for H? and the functions e_,, k = 1,2, 3,. 
an orthonormal basis for the range of H;. Let f= re_ybje’ be the Fourier 
expansion of f. The (k, n)th entry in the matrix of H; is given by 


(H,(e,),@n) 2 =Onae a>lk>O. (31) 


This shows that H, is indeed Hankel. 

What is the connection between these operators and Mobius invariant spaces? 
The answer to this question lies in finding the relationship between the operators H, 
and H,,,, where » is an element of G. We begin by noting that the operator V, 
defined in (27) commutes with P and, moreover, is connected with the operator M, 
of multiplication by the function u by the tule M,., = V,M,V,_; the latter 
formula is almost a tautology. Let ~ = »~+. We then compute 


Hy. = (1 - P)M;., = (VV, — P)V,MyV, = (V,M;— PV,M;)V, 
= (V,M; — V,PM;)V, = V,(1 — P)MjV,. 
That is, 
Hy. = V,H,V,". (32) 


Therefore, the operator H,,, is unitarily equivalent to H,. In particular, any 
property of H, that is preserved by unitary equivalence of operators (e.g., bounded- 
ness, compactness, Hilbert-Schmidt, etc.) will also be possessed by H;,,. Put into 
different words, if X is the space of those analytic functions f on A for which the 
operator H, possesses property P, P a unitary invariant of operators, then X is (at 
least a good candidate to be) a Mobius invariant space. We show below how this 
observation can be used together with Theorem 7 to provide a short, direct proof of 
the characterization of Hilbert-Schmidt Hankel operators. 

Let us continue our brief look at Hankel operators by characterizing those that 
are bounded (= continuous) and those that are compact. These conditions involve 
the space BMOA. This space consists of those functions f in the Hardy space H? 
which satisfy the additional condition that the quantity 


sup (1/2m) {| f(e")PPa(e) de -|f(a) |: a e A} (33) 
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is finite, where P, is the Poisson kernel for a € A: 
P(e“) = (1 — Jal?)/|1 — e*ay’. 

The space VMOA is defined as the subspace of BMOA consisting of those functions 
for which the quantity in the curly brackets in (33) goes to zero as |a| — 1. The 
reader should take note of the fact that both BMOA and VMOA are Mobius 
invariant spaces, with the square root of the quantity in (33) as the norm. The 
“classical” theorems which characterize boundedness and compactness of the Hankel 
operator H,, then, are due to Z. Nehari [16] and P. Hartman [12], respectively, and 
are simply stated. 


THEOREM 8. H, is bounded on H? if and only if f © BMOA. H, is compact on H? 
if and only if f © VMOA. 


Now we turn to Hilbert-Schmidt operators and the connection with Mdbius 
invariant spaces. A linear operator T mapping a Hilbert space K into itself is 
Hilbert-Schmidt if there is an orthonormal basis {u,} of K such that L°%,||Tu,||’ 
is finite. Further, if T is Hilbert-Schmidt, then this sum is finite for any orthonor- 
mal basis and the value of the sum is independent of the choice of the orthonormal 
basis. The quantity 


ie) 
[T, S] = L (Tu;,Su;) 
j= 
defines an inner product on the linear space of Hilbert-Schmidt operators, turning it 
into a Hilbert space; see [11]. The observations above, in particular (32), then set the 
stage for the following theorem. 


THEOREM 9. The Hankel operator H, is Hilbert-Schmidt if and only if the function f 
is in the Dirichlet space. Further, the Hilbert-Schmidt norm of H, is equal to the 
Dirichlet norm of f. 


Proof. We define X to be all those analytic functions f on A for which the 
operator H, is Hilbert-Schmidt. A seminorm is defined on X by ||f|| = ||4;||, where 
the latter norm is the Hilbert-Schmidt norm of the operator H,. It is then apparent 
from (32) that X is a semi-Hilbert space which satisfies the Mobius invariant 
property (12). Further, it is not difficult to show that properties (18) and (20) also 
hold for X. Theorem 7 then shows that X coincides with the Dirichlet space 
provided it contains a non-constant function. However, the Hankel operator in- 
duced by F(z) =z is simply H,;(g)(z) = g(0). Acting on the orthonormal basis 
{z*\° of H* it produces in turn the constant functions 1,0,0,... and so clearly it 
is Hilbert-Schmidt with Hilbert-Schmidt norm 1. (The reader will note that, in fact, 
(32) then shows that H, has rank one for allg eG.) O 


More recently, V. V. Peller [18], R. Rochberg [21], S. Semmes [26], and others 
have made further investigations of the properties of the operator H,, including 
theorems about containment of the operator H, in certain ideals of compact 
operators in the space of operators. One interesting aspect of these results is that the 
function spaces so arrived at are the B?” spaces defined in Example 2. 


I close by extending my thanks to Professors Jonathan Arazy of Haifa University, Israel and Jaak 
Peetre of the University of Lund, Sweden. They introduced me to the subject of Mébius invariant spaces 
of analytic functions and our research collaboration has been fruitful and enjoyable. This paper is due in 
no small part to the interest and knowledge of this subject which I gained from them. 
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On the Zeros of the Taylor Polynomials Associated with the 
Exponential Function 


BRIAN CONREY and AMIT GHOSH, Oklahoma State University, Stillwater 


BRIAN Conrey did his undergraduate work at Santa Clara University and 
his graduate work at the University of Michigan (with a year at Cambridge 
University). He did postdoctoral work at the University of Illinois and the 
Institute for Advanced Study, Princeton. He is currently a Sloan Fellow. 


AMIT GHOSH did his undergraduate work at Imperial College, London and 
his Ph.D. at the University of Nottingham (with a year at the University of 
Illinois, Urbana) and postdoctoral work at the Institute for Advanced Study, 
Princeton. His main interest is in the analytic theory of Z-functions. 


While investigating a certain mean-value associated with the zeros of the nth 
derivative of the Riemann zeta-function [2] we obtained for each nonnegative 
integer n a formula with a constant factor 


n 
a,i=nt+1—- Ye, 
p=] 
where the complex numbers z, are the roots of the polynomial 


z? z” 


E,(z)m=l+z+ 5 + vee +o 
Thus a, = 1, a, = 2 — e, a, = 3 — 2e cos], and if 
r=(¥2 +1) and s=(¥2 - 1)”, 


then 


3 
a,=4-e7trs— 2e8 28°92 cos (7 +s), 


The numbers z, have been well studied, and for a given n most of them have real 
part smaller than —kn for some positive constant k. Thus it is natural to expect 
that a, should grow exponentially with n. We computed the first few a, and 
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quickly changed our expectations: 


a, =1 a, = —0.0000062064... 
a, = —0.7182818284... & = —0.0000018672... 
a, = +0.0626121201... &19 = —0.0000004703 ... 
a, = +0.0120619221... a1, = —0.0000000989... 
a, = +0.0019468374... e145 = —0.0000000153... 
as = +0.0002139607... a1, = —0.0000000004... 
a, = —0.0000093400... a1, = +0.0000000009. .. 
a, = —0.0000154019... a5 = +0.0000000005... . 


It seems that a, is approaching 0 rather rapidly! Of course a, # 0 by Lindemann’s 
famous theorem [4]. 

Szeg6 [8] initiated the study of the zeros of E,,(z). It is convenient to scale down 
by a factor of n and let 


Szeg6 proved that the & cluster around the simple closed curve [ = {z: |ze!~?| = 1, 
|z| < 1} as n > oo and that the proportion which cluster along a given arc of T° is 
asymptotic to the change in 


I 1—z 
— arg Ze 
20 
as z varies along the arc. We mention that this implies that the proportion of zeros 
of E, with negative real part is asymptotically 


1 1 
5 + — = 0.617099... 


7e 


since the arc of IT which lies in the half plane Rez < 0 has endpoints z = +i/e. 

Buckholtz [1] has shown that the ¢, all lie strictly outside I and are within a 
distance 2e/n'/? of I. By the Enestrém-Kakeya theorem on polynomials with 
monotone coefficients (see Polya-Szeg6 [7, part III, problem 23]) all the §, are inside 
the unit circle |z| = 1. Moreover, Newman and Rivlin [5], [6] have established that 
the region y” < cx has no zeros z, (no scaling) if c is any positive number such that 
ce° < 7/2; their paper [5] also contains a figure showing the location of the zeros of 
E.,(z) for n < 47. The regular spacing of the z, is quite striking as is the parabolic 
region free of zeros. Saff and Varga considered the existence in general of a 
“parabolic” region free of zeros of the sections of power series of entire functions 
and have conjectured a precise relationship between the “width” of such a region 
and the order of the entire function. In [3], with Edrei, they prove the conjecture for 
a class of functions; this work also has an extensive bibliography on this and related 
problems. 

The following indicates another aspect of the interesting geometry of the z,. 


THEOREM. If 8 is any positive number for which B < 1 — log 2 = 0.3068..., then 
ja, < e7*” 


for all sufficiently large n. 
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Thus, a, is an exponentially small sum of terms, most of which are exponentially 
large as functions of n. As a contrast we mention that it is not difficult to prove that 
v7 _,e7” does increase exponentially with n. 

The proof of the theorem is not difficult. We write 


e° = E,(z) + RK, (2), (1) 
where 
R,(z) = se 7: (2) 


The idea of the proof is roughly as follows. Let z, = x, + iy, bea zero of E,,(z) and 
consider Le *”. If x, > n(1 — log2) then e~” is small. If x, < n(1 — log2) we use 
e *»=1/R,(z,). In this case |z,| is not too large because ¢, is near I’ as above. 
Then 1/R,, (2, ) can be expanded into an absolutely convergent series of increasing 
powers of z,; the first term is (n + 1)!z>"~1. Now using the Lagrange interpolation 
formula we can show that 


h l/n! ifm=n+1 

yz" = 60 if2<m<n (3) 

=1 -1l ifm=1. 
Thus the 1 + 1 in the definition of a, arises from m =n + 1 here. Then we show 
that the contribution of terms with “large” x, and m > —1 is small and similarly 
for terms with “small” x, and m < —1. These estimations require a bound for n!, a 


bound for the coefficients in the expansion of R,(z)~! and the fact that the £) are 
near I’. Note that the points z of [ for which x = 1 — log2 satisfy |z| = 1/2. 


LEMMA. With R,(z) as above, 
1 Nh 


R,(z) 


+ Ee" 


where 


1/ 2 \ 
<= | 
ex (5) 


The series is absolutely convergent for |z| < (n + 2)/2. 

We will prove this lemma and (3) later. Now we give the proof of the theorem. 
Fix positive numbers B < y  < y < y*< 1 —- log2. Define a partition of 
{1,2,...,.n} intoSUL by vE Sif x, < ny and v€ L if x, > ny. Note that by 
Buckholtz’s results, 


=e" 40(1) (l<v<n). (4) 


Trivially 

le*|<e"™ (veL), (5) 
and by (4), 
|>ney “' (veEL) (6) 
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and 
: n 
l<nev' <> (ve S) (7) 
for sufficiently large n. From (5) we see that 


1 
<ne"Y< 5 7B (8) 


for large n. By (1), (7), and the lemma, 


Kem = LR, (z,) =(n+ IPL Yo eget} (9) 


ves ves ves k=0 
where cy = 1. By (3), 


n n 


1 
» » c,.zh — nt — Cy » » c,.zp* (10) 


k=0 ves , k=0 vEL 
Then by (9) and (10) 


Ye *=(n+1)+ Ye *—-(n+1)! 
p=] ve L 


. (11) 
x {eu 4 E Eq 5 F each) 


vel k=0 veS k=n+4+1 


Now by (6), (7), and the lemma, we can bound 


n o @) 
C, + » ae a » » cC,.zp 


vel k=0 vES k=n+1 
from above by 
Q\n n Q\k 7 ne co Q\k . ne 
(=| +ny (=) (nev ~1)" ‘hn (=) (nev -1)" 
n k=0\7 k=nt1\" 


o(2) + ten ter . 


- | n_” 1 

= _— + a 

n (ev -1y"" 1—2erY-!} 

since eY ~! < 1/2. It is easy to show that (n + 1)! < n?(n/e)"”. Thus by (5), (11), 
and (12), 


1 nee"? 


4\n 
< -e "B+ (=| bo (13) 
2 e eY “(1 - 2e7"-!) 
if n is sufficiently large. Since (2/e) < e~* and y~> B, the right-hand side of (13) 
is <e "9 if n is sufficiently large. 
Now we prove (3) and the lemma. For any polynomial Q(z) of degree <n — 1, 


Q(z) = E,(z) b a 


ye? — (n+ 1) 
v=] 
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since both sides are polynomials of degree <n — 1 which agree at the n points 
Z1,---5 Z,- We observe that 


gh 


Ex(z,)= -—. 
n!\ 
We obtain the first formula in (3) by taking Q(z) = 1, z = 0. The second formula 
follows from the choice Q(z) = z*, z = 0 for each of k = 1,2,...,n — 1. Finally, 
the third formula of (3) is a consequence of the fact that the numbers z;! are roots 
of z?+2z"-!/1! + ---+1/n! so that their sum is —1. 
To prove the lemma we write 
~1 


n+l zk 


(n+ 1)IR,(z) 
and expand the right-hand side as a geometric series. This is legitimate if |z| < 


(n + 2)/2, since then 
k 
<E (a) 3 


(n+ 1)! x 
The power series we obtain for f(z) is majorized by 


1+ Efiney reser 


(n+1+k)! 


f(z) = 1+ (n+ 1)! x 


,(n+1+k)! 


zk 


wus 


which in turn is majorized by 


1=1\ k=1 (n + 2) 
Since 
of & 0° w \f l-w w 
1+ k) =1+4 | ) - =1+ 

py Xm py l—-—w 1 — 2w 1 —2w 

1 @ 
=1+— ) 2*w*, 

2 ea 


the lemma follows. 

We remark that R,(z) has no zeros z with |z| < n + 2. This fact can be proved 
exactly as the Enestrém-Kakeya theorem mentioned earlier. Thus the series for 
z"+!/R (z) actually converges absolutely for |z|< n+ 2. Since the z, satisfy 
a < n Wwe have the formulae 


Yet YA/Ry(s,) = m+ 1— (4 ley + (M+ IED Cosas Dat (14) 


vp=1 
and 


(15) 


Thus, it is possibly the case that a, is asymptotic to —(n + 1)! c, but it is not clear 
how to better estimate c, for k < 2n and so prove this. 


lim sup|c,,|'/" < , 
mo n+2 
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Letters to the Editor 


Editor: 


The method used in Norwegian books (e.g., [1]) to prove the reflective property 
of a parabola seems more direct than that used by Robert Williams [2]. Referring to 
Williams’s diagram, let Q by the y-intercept of /,. Then, since /,’s slope is x)/2c 
and the parabola’s equation is y = x*/4c, Q = (0, —yp). It follows that OFPD is a 
parallellogram (where D = (x,, —c)), whence its diagonal QP bisects 2FPD and 


a= p. 


REFERENCES 


l. A. Segaard and R. T. Lyche, Mathematikk III for Realgymnaset, Glydendal Norsk Forlag, Oslo, 
1948. 
2. R. Williams, A proof of the refective property of the parabola, this MONTHLY, 94 (1987) 667-68. 


Ragnar Dybvik 
N-6630 Tingvoll 
Norway 


Editor: 


With reference to your recent editorial ‘Strings, Substrings and the Nearest 
Integer Function’ (Amer. Math. Monthly, 94 (Nov. 87) 855-860), I note that in 
your second example you express the nth Fibonacci number F, using the nearest 
integer function. It may be of interest to note that F, can be expressed recursively 
using the nearest integer function; viz., F,, is the nearest integer to the geometric 
mean of F,_, and F,,,. (See my Lemma 1 on diophantine defining relationals in 
Abstracts AMS, 8 (Oct. 87) 437-438.) Indeed, there exist infinitely many recursions 
of this type; e.g., F, is the nearest integer to the geometric mean of F,_,, F,_,, and 
F 49: 


Albert A. Mullin 
506 Seaborn Drive 
Huntsville, AL 35806 
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Editor: 


I read with interest your paper in the November 1987 MONTHLY about strings. 
Concerning your quest for “a legal English word that has initial and final proper 
substrings of length 5 or more that are the same,” here are some suggestions. 

1. UNDERFUNDER. With all the complaints we hear nowadays about people 
and institutions being underfunded, surely the responsibility lies with an 
underfunder or two. If you do not like this word, you certainly won’t enjoy 

2. OVERSHOVER. Haven’t we all been asked occasionally to shove over? The 
person making the request is an overshover. 

3. UPPERSUPPER. Meal served to people in the upper berths of combination 
dining /sleeping cars, perhaps. 

4. INTERSPLINTER. I give up, I can’t think of a remotely plausible definition 
for this one. 


Gerald Myerson 
Macquarie University 
New South Wales 
Australia 


[Dr. Jim Saxe supplies “UNDERGROUNDER’”, which is actually in the Oxford 
English Dictionary.— Ed.] 


Editor: 
Shanks’ Identity is Elementary 


The identity in question, discussed in [1], is 
V5 + y(22 + 2/5) = y(11 + 2/29) + (16 — 2/29 + 2/{55 + 10/29}. 
An elementary proof of the identity follows from the observation that for positive 
x, y the quantities 
A=xy + V(x? + 2xy +y*}, 
B=y+y{x?(1+y*) + 2xy-x} 


are both disguised forms of x + y + xy. Taking x, y = y(11 + 2y29)—x with 
minus, y with plus—we find that A reduces to the left member of the identity and 
B to the right. 

I discovered this in the fall of 1984 and learned six months later of a similar 
discovery by R. Bundy. The identity has been subject to a complicated treatment in 
a recent article [2]. , 
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Editor: 


In his Note “Pointwise Limits of Analytic Functions” (this MONTHLY, 90 (1983) 
391-394) Kenneth R. Davidson reviews the classical theorems needed to prove the 
basic result that if f, is a sequence of analytic functions on an open subset Q of the 
plane which converges pointwise to f on 2 then there is a dense open subset Q, of 
(2 such that the convergence is uniform on compact sets (u.c.c.), and f is analytic, in 
(9. On the relatively closed nowhere dense set C=Q\Q, f is merely the 
pointwise limit of continuous functions (that is, in Baire Class 1, or BC1). In the 
concluding section “What Limits Are Possible” he gives an argument purporting to 
prove that given any such sets 2, Q5, C = \ Q, and any f analytic on Q, and BC1 
on C, then f=lim f, for some analytic function sequence f, on Q with u.c.c. 
convergence on {2,. Unfortunately the truth is much more complex than this, and, 
despite the statement in the opening paragraph that, “The results in this paper... do 
not appear to have been collected together before,” there is a literature going back 
to 1901 and culminating 1 in M. Lavrentiev’s comprehensive treatment of the topic in 
[8] of 1936. 

W. F. Osgood [1] proved that if analytic f, — f in a region Q then / is analytic 
on a dense open subset of { and showed “by methods due to Runge, Acta 
Mathematica 6 (1885) 229” that “the regions of analyticity may be infinite.” P. 
Montel [2] gave a construction to show that f could be continuous yet fail to be 
analytic on the whole of Q. Later in [3] (Chapter V, pp. 108-122) Montel proved the 
u.c.c. convergence of f, to f on a dense open subset 2, of & and asked about the 
structure of C = \ Q, and whether any f analytic on 2, and BC1 on C could be 
represented as the limit of a sequence f, of analytic functions on 2. Here Q was 
assumed to be a simply connected domain and Montel established the simple 
necessary condition on C that the components of 0, be simply connected. Indeed, if 
a contour I’ c Q, and z © interior I’, then 


fw) 


w—z) 


1 
f(z) = lim f,(z 2) = lim 5 | Oe 2) 


uniformly in z bounded away from I, by dominated convergence of f, on [, 
whence interior [ C 2). Montel’s problem, as it came to be called, was finally 
solved in papers of F. Hartogs & A. Rosenthal [6, 7] and M. Lavrentiev [4, 5, 8]. 
Extensions to the analogous problem of sequences of harmonic functions in 3 real 
variables were given by M. Keldysch & M. Lavrentiev [9, 10]. 

The necessary and sufficient conditions on C and f are rather complicated and 
the reader is referred to the literature. For some C Davidson’s argument does work 
and f can be taken arbitrary analytic on 2 \ C and arbitrary BC1 on C. For other C 
there is a more restricted class of limit functions f which have the characteristic 
property that the values of f|. partly determine the values of f|>. The following 
counterexample (not in the literature) shows that even if Montel’s necessary 
condition is satisfied there are cases where C does not allow any limit function f at 
all. 


THEOREM. Let be a simply connected domain and C a relatively closed nowhere 
dense subset of such that Q) = Q\ C is a disjoint union of open discs. Then there 
does not exist any function f on {1 such that f = lim f,, for a sequence of analytic 
functions f,, on Q with Q, as the set of u.c.c. convergence. 
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Proof. Suppose an f and sequence f, did exist. Then f,,|C is locally bounded on 
a dense open subset of C (as in Davidson’s Proposition 1, p. 393) so there is an open 
disc D = D(z), r) C & of radius r and centre z) © C, and an M > 0 such that 
|f,(z)| < M for all n and ze DOC. Let D* = D(Zp, 5r). 

Any disc S of Q, of radius < {r which meets D* has |f,|< Mon OSE DNC 
so that |f,| < Mon S UdS by the maximum modulus principle. Such discs S exist 
because any point of C, in particular Zo, is a limit of arbitrarily small discs of Q). At 
most a finite number S,,..., S, of discs of Q, of radius > 47 meet D* and thus 
fal < Min R = D*\ S\\S)\ +++ \Sy. 

It follows that int R C Q, (e.g., by Davidson’s Proposition 3, p. 393), but it is 
clearly geometrically impossible for int R to be a disjoint union of open discs or 
parts of such discs intersected with D*. So after all f and f, do not exist. Q.E.D. 
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NOTES 


EDITED BY DENNIS DETURCK, DAVID J. HALLENBECK, AND ANITA E. SOLOW 


On Continued Fractions and a Certain Example of a Sequence 
of Continuous Functions 


J. FABRYKOWSKI* 
Department of Mathematics, University of Alberta, Edmonton, Alberta, Canada T6G 2G1 


In this note we shall construct an interesting example of a sequence of continu- 
ous functions f(x) © C[0,1] such that 


, (x) +oo if x 1s irrational (+) 
= * 
n> oo In O(1) if x is rational. 


We require some basic facts from the theory of continued fractions, which we will 
not prove here. There is an extensive literature where one can find more details (see, 


e.g. [1], [2], [3}). 
Define a finite continued fraction as any expression of the form 


1 
Ag + , 
a, t+ , 
a, + ‘ 

az, + 

1 

+ — 

a 


where a,’s are real numbers and a; # 0 for 1 < i < n. For case or reference we will 
designate the above continued fraction by the symbol 


(Ay, Q1,..-,4,). (1) 
The numbers do, a,, a,,... are called the quotients of the continued fraction (1) 
and any expression C, = (do, a),..., 4,),0 < t < nis said to be the ¢th convergent 


to the continued fraction. If all quotients are integers and moreover a; > 0 for 
i > 1, then (1) is called a simple finite continued fraction. 
We have the following: 


THEOREM 1. Let (1) be a simple finite continued fraction. Let the two finite 
sequences of real numbers p, and q, be defined inductively by: 


Po = 40; Pi = a,ay + 1, Pr = 4;Pt-1 + Py-2> 2< r<n, (2) 


*Supported by NSERC Grant #A3062. 
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and 
qo = 1, q, = 4), Gr = 4:9;-1 + Y-2> 2<t<n. (3) 
Then the following facts hold: The p;’s and q,’s are integers, the q,’s are positive 


and 


Pr 


Ga 7 for0O<t<n. (4) 


t 
g, >t, with inequality when t > 3. (5) 
The sequence of even convergents C,, is strictly increasing with ¢ (t >-0), while 
the odd convergents C,,,, strictly decrease. Also every odd convergent is greater 
than any even convergent, that is, 


O<O,< G,< +: <OG<O,<C,. (6) 
THEOREM 2. Let do, a), a5,..., be an infinite sequence of integers, all positive 
except possibly ay. If C, = <ao, a1, 4z,..., a,), then lim, _, ,, C, exists and is finite. 


Theorem 2 justifies the following definition: 


DEFINITION. Let do, a), a@,,... be a sequence of integers, all positive except 
possibly a,. The expression (dg, a), a>,...) is called an infinite simple continued 
fraction and is defined to be equal to the number lim, _,,,C,, where C, = 
(Ao, Ay,+.-, a,)- 


It may be proved that Theorem 1 holds as well for a simple infinite continued 
fraction. Consider an irrational number x. We now show how to find representation 
of x as a simple infinite continued fraction. We write x = a )+ &), where ay 
= [x] — the greatest integer < x and 0 < &, < 1. Let 1/&, = aj, sox =a) + 1/aj. 
Define a, = [a{], thus aj = a, + §,, where 0 < &, < 1. Similarly, let 1/&, = a4 and 
define a, = [a5], so af =a, + &, and x =a,+1/(a, + (1/a$)). Continuing this 
procedure, we obtain an infinite sequence at integers: do, a), a,,... with a; > 1 for 
i > 1 and an infinite sequence of real numbers a/ (i > 1) such that 


X = (Ao, Aj) = (Ao, 4, 43) = °° = (Ag, Ay, - 665 Ay, Angi) = 0° 


It is well known that if C, = (ao, a),..., a,,), where dg, a,,..., a, are found by 
means of the above algorithm, then x = lim, _,,, C, and that the representation of x 
as an infinite simple continued fraction is unique. The convergents to any irrational 
number have a certain characteristic property, described in the following: 


THEOREM 3. Let x be an irrational number and a/b rational with (a, b) = 1 and 
b> 0. If 
. 1 
< en 

2b?’ 


a 
b 


xX 


then a/b is one of the convergents to x. 


Now we are in a position to construct the sequence of continuous functions f,(x) 
satisfying (*). 
Let 7,, %, 73,-.. be enumeration of the rationals in [0, 1] in the following manner: 
112313 123 4415 1 


O,1, =, 5 5 Do De Do Dee: 
7" 2° 3° 3° 4° 4° 5° 5° 5° 5° 66° 7 


1988] NOTES 539 


We shall show that the sequence 


1/k 

f(x) = E {TL - , n=1,2,3,... 
k=1\ j=l 

has the desired property. It is obvious that for every n > 1, f(x) © C((0,1]). For 

any rational x there exists a number ny = n(x) that f(x) =f,,(x) for n > no; 

therefore, for rationals x, lim,,_,,, f,(x) = f,,(x), thus it is finite. 

Suppose now that x is irrational. By enumeration of r, it follows that if we write 
r,;=u,/v;, where u, and v, are positive integers such that (u,;,v;) = 1, then 
obviously v,; <j and, therefore, by (5) none of r, is a convergent to x. From 
Theorem 3 it follows that 


u, 1 1 (7) 
—prl= —-—-—|>-—> >. 
I~ 7 v,| 2ve QZ? 
We have the obvious inequality 
k 1/k 
2j|x — 1, > in 2j|x — rl, 
1 j|x a} ymin 2 j)x — 7) 
so by (7) 
k ve 1 11 
2jlx—/7r, >2j—- =-2>-7. 8 
{U jx i isp 77% (8) 


Therefore, if x is irrational, 
n 1 
f(x) > » k’ thus lim f(x) = 0. 
k=1 n— © 


Related to our example we mention a theorem that goes back to W. F. Osgood (see 
[4}). 


THEOREM. If a function f(x) can be represented as a limit of an everywhere 
convergent sequence f,(x) of continuous functions, then f(x) is continuous except at a 
set of points of first category. 


Since the set of rational numbers is of first category and the set of irrational 
numbers is then of second category (see [4]), therefore, it is not possible to have an 
example of a sequence f,(x) of continuous functions such that 


, _ { +00 if x is rational 
n> 00 I(x) = O(1) if x is irrational ° 


Another example of a sequence of continuous functions f(x) satisfying property 
(*) was found by Professor A. Meir. 
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Using Pythagorean Triangles to Approximate Angles 


W. S. ANGLIN 
Department of Mathematics and Statistics, McGill University, Montreal, PQ, Canada H3A 2K6 


A Pythagorean triangle is a right triangle with integer sides. A well-known 
example is the triangle with sides 3, 4, and 5. Unfortunately, the angles of these 
triangles are not as “nice” as the sides: whether expressed in degrees or radians, 
they are always irrational. 

If a Pythagorean triangle has an angle of A degrees and |A — 20| < 1/100, we 
shall say that the triangle “approximates 20 degrees to within one hundredth.” Let 
B be any given angle in degrees with 0 < B < 90. Let e be any real number such 
that 0 < e < l,e < B, and e < 90 — B. The question we answer in this note is: 
what is the smallest Pythagorean triangle which approximates B degrees to within 
e? 

We begin by defining “smaller.” If (a, b, c) and (a’, b’, c’) are two Pythagorean 
triangles with sides a < b < cc and sides a’ < b’ < c’, then (a, b,c) is “smaller” 
than (a’, b’, c’) just in case (i) c < c’ or (ii) c= c’ and b < b’. Roughly speaking, 
this means we order the triangles by hypotenuse. For example, the triangle (3, 4, 5) is 
the smallest of all Pythagorean triangles since it has the smallest hypotenuse. 

Pythagorean triangles whose sides are relatively prime are called “primitive.” For 
the purpose of finding a smallest triangle approximating a given angle, we need only 
consider primitive triangles. We thus confine our attention to them. Since exactly 
one of the two smaller sides of a primitive Pythagorean triangle is even, the 
following well-known theorem gives a complete characterization of these triangles. 


THEOREM 1. a? + b* =c? with a an even integer and a, b,c relatively prime 
integers iff for some relatively prime positive integers u and v with v < u and u, v not 
both odd, 


a = Qu, b =u? — v’, and c=u’+0?’. 


For a proof the reader may consult [1, pp. 138-139]. 

Roughly speaking, we shall approximate a given angle B by means of approxi- 
mating its tangent or cotangent. In this connection we need the following defini- 
tions. 


X = tan(B — e) + sec(B-e) 
Y = tan(B + e) + sec(B + e) 
X’ =cot(B +e) + csc(B +e) 
Y’ = cot(B — e) + csc(B — e). 
We now have the “Approximation Theorem.” 


THEOREM 2. Let u and v be positive integers. Then X < u/v < Y iff (2uv, u? — 
v*, u* + v?) approximates B to within e by means of the angle opposite u? — v? and 
X’ <u/v < Y’ iff (Quv, u* — v?, u? + v*) approximates B to within e by means of 


the angle opposite 2uv. 
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Proof. Suppose X < u/v < Y. The given conditions on B and e imply that 
1 < X and, hence, v < u (since X = (sin(B — e) + 1)/cos(B — e)). Since X < u/v 
< Y, it follows that —1/X < —v/u < —1/Y. Adding these inequalities, we obtain 
X — 1/X < (u? — v*)/w < Y—1/Y. Moreover, as a straightforward calculation 
will show, X — 1/X = 2 tan(B — e) and Y—1/Y = 2tan(B + e). Hence, we ob- 
tain tan(B — e) < (u? — v?)/2uv < tan(B + e). Hence, in (2uwv, u? — v*, u* + v?) 
the angle opposite u* — v is between B — e and B + e degrees. 

Suppose now that the right-hand side of the first equivalence holds. Then where 
z = u/v we have 2 tan(B — e) < z — 1/z < 2 tan(B + e). Hence, z* — (2 tan(B + 
e))z —1<0 and 0 < z* — (2tan(B — e))z — 1. The first of these inequalities 
implies that z is between the two roots of z? — (2tan(B + e))z — 1. Thus z < 
tan(B + e) + ytan’(B +e) +1 = tan(B + e) + sec(B + e) = Y. Similarly, from 
0 < z* — (2tan(B — e))z — 1 it follows that X < z. 

The proof for the second equivalence is similar. 


Finally, we have the “Ordering Theorem.” 


THEOREM 3. Let V be the smallest positive integer such that [XV| +1 < YV and 
let U=[XV]+ 1. Then QUV,U? — V?,U? + V7) is the smallest Pythagorean 
triangle approximating B degrees to within e by means of the angle opposite the odd 
side. 

Moreover, the corresponding result holds for the “even side case” (with X’, Y’ 
replacing X, Y, respectively). 


Proof. Since X < U/V < Y, the above triangle does approximate B degrees to 
within e by means of the angle opposite U? — V’. 

Let T = (2u’v’, u’? — v’”, u’* + v’*) be the smallest such triangle. Then X < 
u’/v’ < Y and, hence, [Xv’] + 1 < Yu’. Thus V < v’. If U <u’, then U? + V? < 
u’* + v’* against T’s minimality. Hence, u’ < U. Since Xv’ < u’, it follows that 
U=[XV]+1< [Xv’] +1 <u’. Thus u’ = U. By T’s minimality, v’ < V and, 
hence, v’ = V. Thus the two triangles are the same. 

The proof for the corresponding “even side result” is similar. 


For example, suppose we want the smallest Pythagorean triangle approximating 
20 degrees to within 1/100. We compute X = 1.42788278, Y = 1.42841330, X’ = 
5.66838919 and Y’ = 5.67417730. By straightforward testing (on a programmable 
calculator) we find that V = 208 is the smallest positive integer such that [XV] + 1 
< YV and V’ = 46 is the smallest positive integer such that [X’V’] +1 < Y’V’. 
Let U = [XV] + 1 = 297 and let U’ = [X’V’] + 1 = 261. The smallest triangle to 
approximate 20 degrees to within 1/100 by means of the angle opposite the odd 
side is (2UV, U? — V*, U* + V*) = (123552, 44945, 131473) and the smallest 
triangle to approximate 20 degrees to within 1/100 by means of the angle opposite 
the even side is (2U’V’, U’*? —V’?, U’? + V’) = (24012, 66005, 70237). The 
second is smaller. 


I thank Professor J. Lambek for his kind help and encouragement. 
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Convergence-Preserving Functions 


GERALD WILDENBERG* 
Dept. of Mathematics, St. John Fisher College, Rochester, NY 14618 


It is common for calculus texts to include as an exercise the question of whether 
the convergence of Xa, (with a, real) implies that La? also converges. If La, is 
absolutely convergent then the comparison test applied to the tails of La, and La? 
will show that La? is convergent. But if we do not have absolute convergence it is 
easy to come up with a counter-example (e.g., a, = (—1)”"/ vn ). 

The situation is more interesting if we replace a? with f(a,,) and then ask: Does 
the convergence of Xa, imply the convergence of Uf(a,,)? If f is x? or arcsin x, or 
x/(1 + |x|), we have questions that will make most mathematicians pause. In this 
note we answer, negatively, the question as to whether there are any nontrivial real 
functions for which Xa, converges only if Lf(a,,) converges. 


DEFINITION. A function on the reals, f, is convergence preserving (abbreviated 
CP) if for every convergent series Ya,, the series Lf(a,,) also converges. For 
example, a constant multiple function, f(x) = kx is CP. As we shall show, there are 
essentially no other CP functions. 


The following properties are immediate. 

(1) If f is CP, then f(0) = 0. (Consider the 0 sequence.) 

(2) If f and g are CP, so are f+g, kf, f°g and, therefore, h, where 
h(x) = f(kx). (Here k is a constant.) 


LEMMA 1. If f is CP, then there exists a real number M and an ¢ > 0 such that 
f(x)/x <M for0<x<e. 


Proof. If not, then for each n we can find x, with 
O<x,<1/n? and f(x,)/x,> Nn. 


Define j,, to be the least integer that is > 1/(n*x,). Then clearly j,x, < 2/n?. 
Now consider the series 


Xp PX, tte FX FX FX tee XQ Hee 
Ed a_i” 


j, times j> times 


This converges by comparison with £2/n?. 
When we apply f to each term of the series and sum the first n blocks we get 


Lint (Xn) > LX, > LAs, 
which diverges, contradicting that f is convergence preserving. 
LEMMA 2. If f is CP, then f is differentiable at 0. 


Proof. Consider D*f, Df, D~f, D_f, the upper and lower derivatives of f from 
the right and left at 0. (So, for example, D,f is the lower-right derivative of f at 0, 
that is, the lim inf of f(x)/x as x — 0°.) We claim that D_f > D*f. 


*The author thanks John Kennison for his suggestions regarding this paper. 
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Suppose instead that D_f<s<t< D*f< oo. Then let {d,} be a monotonic 
decreasing sequence with 0 < d, < 2~”, but f(d,)/d, > t for all n. 
Similarly, let {e, } be a monotonic increasing sequence with 


—2°-"<e, <0 and ff(e,)/e,<s forall n. 
We can further assume that 
d, > |e;| > 4, > |e,|> °°: 
We now construct a series whose underlying sequence is 
{a,} =d),..., d,,€),-.., €;,d5,.-., dy, €o,... . 


The number of d, in each block, say j,, is defined by letting j, be the smallest 
integer for which j,d, > 1/i. Similarly, k; is the smallest integer such that k,e, < 
—1/i, where k, is the number of e,’s. The series La, is convergent, but compare 


Lf(a;) to 
> (4,4, + sk en) = V(t _ S) indy + Ys i, + k,e,)- 
(Note that L(t —s)j,d, diverges and s(Xj,d, + k,e,) converges.) We see that 
~f(a;) is divergent. From this contradiction we conclude that D_f > D*f. Simi- 
larly, considering f(—x), D,f > Df. Therefore, 
D,f>Df>D_f>D*f>D,f. 
So f is differentiable at 0. 


LEMMA 3. Suppose that f(0) = f’(0) = 0. Let A, B,e > 0 be given. Then there 
exists e > 0 and a positive integer k such that 


lel}<e and A<kle|<A+t+e and k|f(e)|<B. 
There also exists e < 0 with the same properties. 


Proof. We may as well assume that e < 1. Choose e with | f(e)|/e < B/(A + 1) 
and 0 < e < «. Let k be the smallest integer with k > A/e. Then 


k| f(e)| < Bke/(A +1) < B(A + )/(A +1) < B. 
The case with e < 0 is essentially the same. 


THEOREM. f is CP iff there exists a constant k with f(x) = kx in a neighborhood 
of 0. 


Proof. Assume that f is CP. We may as well assume that f’(0) = 0, otherwise we 
could replace f(x) by f(x) — kx, where k = f’(0). Using the indirect method, we 
further assume that f(x) is not identically zero in any neighborhood of 0. Then 
we can obtain a sequence {d,,} with 


0 <f(d,) < d,<27". 


[Note that | f(x)| is less than |x| near 0 as f’(0) = 0. We can require d,, and f(d,,) 
to be positive, otherwise f(x) can be replaced by f(—x) or —f(x) or —f(— x).] 
Let j,, be the least integer for which j, f(d,,) > 1/n. From Lemma 3, there exists 
e, with —-2°"<e, <0 and k, with jd, <k,|e,|<Jj,d, +27” and k,|f(e,)| < 
L/n’. 
Consider now a series whose underlying sequence consists of d; and e,, inter- 
leafed where j, and k,, are the number of terms in the nth block of d, and e,, 


544 DAVID W. BOYD [June-July 


respectively. For each i we interleaf the d, and e; so that the partial sums within the 
ith block of d,; and e; do not exceed 2~”*! in absolute value. Since the ith block 
sums to j,d, — k,|e,|, the series is convergent. Applying f we get a series where the 
ith block sums to j,f(d;) + k,;f(e;). This series diverges, since k, f(e,;) is convergent 
but j,f(d;) is not. This contradicts the CP property for f, proving the proposition. 
(The converse is, of course, obvious.) 


Remark. While the main theorem is apparently a negative result, it does guaran- 
tee the existence of many series having strange properties. For example, there is a 
series {a;} for which La} diverges but © tan a, converges, otherwise (arctan x)? is 
CP. Similarly, there is a {b,} with Xb? convergent and ¥ tan b, divergent, else tan 7Vx 
is CP. 


The Diophantine Equation x? + y” = 77" 


DAVID W. BoYD 
Department of Mathematics, University of British Columbia, Vancouver, Canada V6T 1Y4 


We will show here that the equation of the title has infinitely many primitive 
solutions in integers provided m and n are relatively prime. In fact, our proof gives 
a procedure for finding all solutions. 

The result requires little more than unique factorization and thus could be 
presented at an early stage in a course in elementary number theory. It would be 
enough to present a couple of special cases, say x? + y? = z4 or x7.+ y4 = 2° Of 
course, one would have first presented the classical equation x* + y* = z*, which is 
also a special case. 

The equation x? + y? = z* has been considered by Mathieu [1, p. 664] and Kiss 
[2]. The former exhibited an infinite family of solutions by using the Pell equation 
a* — 2b* = 1. The latter found all solutions with x, y, and z relatively prime. One 


of the cases he considered required factorization in the field O(/2). Thus the tools 
used in these papers are a little more sophisticated than those needed here. In 
addition, the methods do not seem capable of treating the more general equation we 
consider. 

The word primitive used in the first paragraph requires clarification. We may 
restrict our attention to solutions with xyz # 0, x > 0, and z > 0. Note that if 
(x, y, Zz) solves x7 +y"= 27", then so does (kx, k?"y,k™z) for any integer 
k>0O. A_ primitive solution (Xo, Yo, Zo) iS one which is not of the form 
(k™"x, k?"y, kz) for any k > 1. 

Each solution is thus of the form (k”"xo, k?"yo, kz 9) for some primitive 
solutidn (Xo, Yo, Z9) and some k > 1. We say that the two solutions lie in the same 
class if they correspond to the same primitive solution. We are really interested in 
classes of solutions; the primitive solution is the representative of the class which 
has z minimal. 

Note that if (m, n) # (2,1), the equation x? + y™ = z?" is not homogeneous so 
a primitive solution need not have x, y, z relatively prime. For example, (28, 8, 6) is 
a primitive solution of x* + y? = z‘4. Thus the solutions found by Kiss are not all 
the primitive solutions of this equation, as he observed. We will assume that m > 1 
and n > 1 since the cases m = 1 or n = 1 are trivial. 
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Our method is based on the simple observation that, given m > 1, any integer 
N > 1 can be written uniquely in the form N = s,s} --- s™—ju™, where S,,..., Sj,—1 
are positive, square-free, and pairwise relatively prime. (Here we consider 1 to be 
square-free and allow s,= 1). To see this, let N =IIp%‘?) be the canonical 
decomposition of N into prime powers and define 


s, =II{ p:a(p) =k (mod m)}, 


forl <k <m-— 1, where s, = 1 if the product is empty. Clearly N/(s,s3 --- s™7}) 
is an mth power. 


For convenience, write P,, for the set of (m— 1)-tuples (s,,...,5,,_,) with 
Sj5+++5 Sm— 1 Positive, square-free, and pairwise relatively prime. Now suppose that 
(x, y, Z) is a solution of 

x2 + y™ = 77" (1) 
with xyz # 0, x > 0, and z > 0. Then one can write 
z"+x = 5,53 --- sm™tu™ (2) 
with (5,,---, 5S,,-,) in P,, and u > 0. 
Since (1) can be written as 
(z" + x)(z2"—x)=y™, (3) 
the decomposition in (2) shows that 
z"—x=gmlgm-? ... 5 vm (4) 
and 
Y= S1S8q 07° Spy U, (5) 


for some integer v with sgn v = sgn y. 

Changing (x, y, z) to (k™"x, k?"y, kz) multiplies both z” + x by the factor 
km” = (k")™. Thus s,,..., 5,,-; are determined by the class of the solution. The 
class of the solution also determines the ratio u/v = a/b, say, where a and b are 
relatively prime and a > 0. Since we have assumed x and z to be positive, we have 
z" + x > |z” — x|, so (2) and (4) give the condition 


S189 °° Sqpaia™ > Sy 'Sy* +++ Syq_ |B)”. (6) 


Conversely, given any (5S,,..., 5,,-,) in P,, and any relatively prime a and b with 
a > 0 which satisfy (6), we will show that (1) has a unique primitive solution 
satisfying (2), (4), and (5), with u/v = a/b. This will substantiate our claim that 
there are infinitely many solutions and give a method for finding all such solutions. 

Thus, write u = aw and v = bw where w > 0 is to be determined. Then (2) and 
(4) give ° 


2x = (5,52 --- smila™— si)... 5) b™)w™ = Aw”, (7) 
and 

22" = (5,57 --- s™ia™+ sm}... 5) b™)w™ = Bw”, (8) 
say. Here A and B are constants determined by s,,...,5,,-,, a, and b. The 


condition in (6) insures that A and B are positive. 
Since A and B have the same parity, if (8) has a solution (z, w) then (1) has a 
solution (x, y, z) where x is given by (7) and y by (5). 
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But the solution of (8) is straightforward. If B is even, write 


B=2c,c3 --- ct id", 
where (c,,...,¢,—,) is in P,. Then (8) becomes 
z"=¢,c5 +++ cM id"w™. (9) 
In order for the right member of (9) to be an nth power, w must be of the form 
W= CH eee Cli-1k", (10) 
where 
mi, + j = 0(mod n), (11) 


for 7 =1,2,...,n—1. Since m and fn are relatively prime, this has a unique 
solution i; with 1 < i; < n— 1 for each j. 

The choice k = 1 in (10) clearly gives a primitive solution of (9) and a corre- 
sponding primitive solution of (1). 

If B is odd, write instead 


B=c,c5 +++ ct td" (12) 
so (8) becomes 
22" =c,c5 +--+ cP id"w" (13) 
for which the general solution has 
w = loch ... Cina”, (14) 


where the i, for j > 1 satisfy (1) and ig is determined by mig = 1 (mod n) and 
1 < lo < n— 1. 


Example 1. Consider first x* + y? = z*. Then there are four parameters s,, 55, a, 
and b from which we calculate 


A, B=5,s2a? + s?s,b’, 


and then solve (7) and (8). For example, the choice s, = a=b=1 and s, = 4, 
which satisfies (6), gives A, B = 16,20. Hence (8) becomes 2z* = 20w?; that is, 
z* = 2-5w?. This has primitive solution w = 2 - 5, z = 27 - 5%. Hence, x = Aw?/2 
= 24-3.-5°% and y = s,s,abw* = 4-10? giving the primitive solution 
(24 -3- 53,24 - 57,27 - 57) = (6000, 400, 100) of (1). 


Example 2. Consider x* + y* = z®. Now there are five parameters 5,, 55, 53, 4, 
and b. From these one computes 


— 6 o2e344 1 ede2e 4 
A, B = 8,83583a" + S{S55,b". 


If, for example, s; =s,=s5s,;=b=1 and a=2, then A, B=15, 17 so (8) 
becomes 2z? = 17w*. This has primitive solution w = 2-177, z = 17? and hence 
x = 2?-15-17% y = 23-17% giving the primitive solution (2? - 15 - 178, 23 - 174, 
2-173) of (1). 

The choice s, = 2, 5, = 3,5; =5,a=b=1 gives A, B = (2- 37- 5)(5* + 27). 
So B = 2- 3?- 5-29 and hence (8) becomes z* = 37-5 - 29 - w‘. This has primi- 
tive solution w = 3- 57-29%, z = 3*- 53-293, Since A = 2-33-5-7 we have 
x = Aw4/2 = 3’-5°-7-29% and y=2-3-5w7=2-33-5°- 294 giving the 
primitive solution (37 - 59-7 - 2982 » 33 - 55- 294 32. 53. 293), 
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Concluding remarks. Mathieu’s solutions to x? + y? = z4 


well known identity 


are suggested by the 


B+ 234 --- 4+n3 = (n(n + 1/2)’. 


Thus, if one takes x = n(n — 1)/2, y = n and z* = n(n + 1)/2, then x* + y? = 274. 
In order for z to be an integer, one must solve 2z* = n(n + 1) which is essentially a 
Pell equation and thus has infinitely many solutions. 

On a recent visit to U.B.C., Edwin Hewitt showed me Mathieu’s solution and 
mentioned that a computer search had revealed many other solutions not of this 
form. An attempt to understand this led to the above method of finding all solutions 
and its generalization to x? + y™ = 22”. 

A perusal of Reviews in Number Theory unearthed the paper of Kiss which 
happens to be written in Romanian but is nevertheless rather easily read. Although 
Kiss does not obtain all solutions of x? + y? = z4, his result that there are infinitely 
many solutions with x, y and z relatively prime is stronger than the statement that 
there are infinitely many primitive solutions. Thus it is understandable that more 
difficult methods are needed. 

We do not know whether the more general equation x” + y™ = z?” has infinitely 
many solutions with x, y and z relatively prime. 


Acknowledgements. I would like to thank Edwin Hewitt for posing the problem of completely solving 
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Modular Arithmetic in the Marketplace 
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1. Introduction. It is not surprising that there are many schemes that utilize 
modular arithmetic to append a check digit to product identification numbers for 
error detection. Two of the better known schemes—the ZIP code bar code and the 
International Standard Book Number (ISBN)—have been the subjects of articles in 
the UMAP Journal [2], [7]. The schemes used for the Universal Product Code (UPC) 
and passports are described in [6] and [1]. 

What is surprising to us is the diversity of the methods in use and the fact that 
some of them are poorly conceived! In this note we examine several of these 
schemes. 


2. Check digits: from money orders to library books. We begin with the least 
effective of the methods we have found and work our way up to the best one. The 
Postal Service’s money-order identification number consists of ten digits and a 
check digit. The check digit is the remainder modulo 9 of the 10-digit number. In 
contrast to the schemes mentioned in the introduction, this method does not detect 
all single errors! Indeed, excluding the check digit, a substitution of a 9 for a 0 or 
vice versa goes undetected. All single-digit errors involving the check digit are 
detectable. Thus the single-digit error-detection rate for this method is 970/990 or 
98.0%. (We assume all errors are equally likely.) Moreover, the only transposition 
errors detected by this method are those involving the check digit. That is, an error 
resulting from the transposition of two consecutive digits such as ...53... instead 
of ...35... is undetected while ...53 instead of ...35 is detected. Because the 
digit 9 can never occur as a check digit, this method detects 9 - 10? — 8 of a total of 
90 - 10’ — 8 possible transposition errors for a rate of 10.0%. 

A similar and equally ineffective method is employed on VISA traveler’s checks. 
There the check digit is the additive inverse modulo 9 of the remainder upon 
division by 9. 

Federal Express, airline companies, and the United Parcel Service use a method 
that is slightly less effective for detecting single errors but fairly effective for 
detecting transposition errors in their identification numbers. The U.P.S. identifica- 
tion number, for instance, consists of nine digits plus a check digit. The check digit 
is the remainder modulo 7 of the 9-digit number. Of course, any substitution of b 
for a in the first nine digits where |a — b| = 7 will go undetected. The single-error 
detection rate for this method is 846/900 or 94.0%. This method detects transposi- 
tion errors at the rate of (762 - 10’ — 5)/(810 - 10’ — 5) or 94.1%. 

The Chemical Abstract Service assigns chemicals a registry number together with 
a check digit calculated in the following way. The number a,a, -:- a, (k < 7) has 
appended the check digit (a,, a5,..., a,)°(k,k —1,...,2,1) mod10. All single 
errors in positions with weighting factors 1, 3, or 7 as well as the check digit position 
are detected; errors of the form a — b where |a — b| = 5 go undetected in positions 
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with weighting factors 2, 4, or 6; errors of the form a — b where |a — b| is even go 
undetected in the position with a weighting factor of 5. For k = 7, this yields a 
single-error detection rate of 65/72 or 90.3%. On the other hand, all transposition 
errors not involving the check digit are detected. The errors of the type ... ab > 
... ba go undetected when (a,, a5, 43, a4, 45, ag) * (7, 6,5, 4, 3,2) = 5 mod 10. Since 
the probability that this dot product is any particular digit is .1, this yields a 
transposition error detection rate of 62/63 or 98.4%. 

Identification numbers for banks (as appearing on checks, for instance) have 
eight digits, aya, -+- ag, and a check digit 

c = (a, a5,..., ag) - (7, 3, 9, 7, 3, 9, 7, 3)mod 10. 


For example, the bank number 09190204 gives c= 0+ 27+9+63+0+18+0 
+ 12 = 129 = 9mod 10. This method detects 100% of all single errors and 80/90 or 
88.9% of all transposition errors. In particular, the only undetectable transpositions 
are those of the form ab — ba where |a — b| = 5. The advantage this weighting 
scheme has over one involving just two distinct weighting factors such as 
(7, 3, 7, 3,7, 3,7,3) is that the former will detect most errors of the form 
... abc... > ...cba... while the latter will detect no errors of this form except 
those involving the check digit. 

The most sophisticated method we have found in use is the “Code-a-bar’’ system 
used by many libraries. Here each thirteen digit identification number a,a, +--+ aj; 
is assigned the check digit 


—( a4, 45,..., 3) + (2,1,2,1,2,1,2,1,2,1,2,1,2) — r mod 10, 
where r is the number of the digits among aj, a3, 45, A7, Ag, 44,, 4, greater than or 
equal to 5. For example, the identification number 3125600196431 yields the check 
digit 
—(64+1+4+4+54+12+04+0+14+18+6+8+4+3+2)—2= -—68 
= 2mod 10. 

This method detects all single errors and all transposition errors except 09 = 90. 
Thus the detection rate for transposition errors is 88/90 or 97.8%. As shown by 
Gumm [5], it is not possible to improve upon these rates with any system that uses 
conventional techniques based on addition modulo 10. 


3. A foolproof method. The highly effective scheme used by libraries raises the 
interesting question of whether it is possible to devise a method that will detect all 
single errors and all transposition errors with a single check digit. Actually, the 
ISBN method achieves this [7]. But it does so in an artificial way by using the 
character X to represent the possible check number of 10, which consists of two 
characters. (The method involves modulo.11 arithmetic.) 

Recently Gumm (3], [4] has discovered a group theoretical method that uses a 
single check digit and is 100% effective in detecting single errors and transposition 
erros. To describe this method we need the dihedral group of order 10, D,, 
represented in the form shown in TABLE 1, and the permutation o = 
(0)(14)(23)(58697). To append a check digit to any string of digits we “weight” the 
digits with powers of o and, using TABLE 1, multiply them and take the inverse of 
the product. For example, consider 1793. The check digit is 


(o4(1) * 03(7) *02(9) *0(3)) = (1*645#2) '=4-1=1, 
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For 17326, we obtain the check digit 
(05(1) * 0 4(7) * o3(3) *02(2)*0(6)) = (4*9*2424%9) 1) =071=0. 
To see that this scheme detects all single-digit errors we observe that an error-free 
number a,a,_1 ‘°° @,@, (where ay is the check digit) has the property that 
o"(a,)* o"~"(a,_4)* com o(a,) * ag = 0 
and, therefore, any particular factor in this product is uniquely determined by all of 
the others. Thus a single-digit error does not result in a product of 0. That all 


transposition errors are detected can be verified by showing that for all distinct a 
and b, o(a)* b # o(b)* a. For then, for all i, 


o't!(a)*oa'(b) # o'*!(b) *0'(a) 


and consequently a transposition will not result in a product of 0. 
In addition to being foolproof in detecting single errors and transpositions, 
Gumm’s method will detect approximately 90% of all other types of errors. 


TABLE 1. The multiplication table of Dg. 


* 0 1 2 3 4 5 6 7 8 9 
0 0 1 2 3 4 5 6 7 8 9 
1 1 2 3 4 0 6 7 8 9 5 
2 2 3 4 0 1 7 8 9 5 6 
3 3 4 0 1 2 8 9 5 6 7 
4 4 0 1 2 3 9 5 6 7 8 
5 5 9 8 7 6 0 4 3 2 1 
6 6 5 9 8 7 1 0 4 3 2 
7 7 6 5 9 8 2 1 0 4 3 
8 8 7 6 5 9 3 2 1 0 4 
9 9 8 7 6 5 4 3 2 1 0 


For the purpose of comparison, we summarize the calculations given above in 
TABLE 2 and average the two rates. However, this average does not represent the 
detection rate for either one of a single-digit error or a transposition error unless 
these two types of errors are equally likely to occur. 


TABLE 2. Summary of error-detection rates. 


scheme single error rate transposition error rate average 
U.S. Postal Service, VISA 98.0% 10.0% 54.0% 
Airlines, U.P.S. 94.0% 94.1% 94.1% 
Chemical 90.3% 98.6% 94.5% 
Bank 100% 88.9% 94.5% 
Library 100% 97.8% 98.9% 
Gumm 100% 100% 100% 
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On a Property of x"e* 


GABRIEL KLAMBAUER 
Department of Mathematics, University of Ottawa, Ottawa, Ontario, Canada, KIN 6N5 


Let f(x) = x"e~*, where x > 0 and n > 2. The function f is increasing in the 
interval (0, n) and decreasing in (n, 00) and has points of inflection for x = n + Yn. 
My colleague Ian Iscoe of the University of Ottawa asked me to find a simple proof 
that f(n + Vn) > f(n — yn). I wish to show here that, more generally, f(n + h) > 
f(n —h), where 0 <h <n. 

Indeed, the inequalities 

(n+ h)"e"~* nt+h 2h 


—_—___—___- > ] d | 
(n —h)"e~"*? ee hon 


are equivalent. The latter inequality can be proved as follows. Note that 


In{(n+h)/(n —h)} 


is the area of the region H below the hyperbola y = 1/x and above the interval 
[n —h,n +h] on the x-axis. On the other hand, 2h/n can be interpreted as the 
area of the trapezoidal region T below the tangent line to y = 1/x at x =n, the 
midpoint of the interval [n — h,n + h], and above the interval [n — h,n + h] on 
the x-axis. Observing that y = 1/x is concave upward for x > 0, it is apparent that 
the region T is contained in the region H and so the area of 7 is smaller than the 
area of H. 


Noncentral Difference Quotients and the Derivative 


P. P. B. EGGERMONT 
Department of Mathematical Sciences, University of Delaware, Newark, DE 19716 


We all have proved at one time or another the result that if a function f, defined 
on a neighborhood of the origin, is differentiable at the origin, then the central 
difference quotients converge, i.e., 


h)-—f(-h 
aan Lt) — Hoh) 
h—0 2h 
exists and equals /’(0), but that the converse is not true in general, the counterexam- 
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ple being f(x) = |x| (see, e.g., [2, Problem 27.0] or [1, Problem 5.20]). What can be 
said about noncentral difference quotients? For instance, does the existence of the 
limit 

h) — f(-—2h 
im LL) a) 


h>0 

imply that f is differentiable at the origin? To avoid trivialities, we must assume 
that f is continuous at the origin. Note that f(x)'= |x| is not a counterexample in 
the noncentral case. 

The surprise is that the existence of the limit (1) does indeed guarantee the 
differentiability of f. This simple result does not appear to be well known, which is 
amazing since the proof is entirely within the scope of standard advanced calculus. 
Results in the literature mostly deal with consequences of one-sided limits, (see, e.g., 
[3], [4], [5]). As a general reference, see [6]. We prove the following. 


THEOREM. Let f, defined on a neighborhood of 0, be continuous at 0, and let 
a# +1. If the limit 


_ f(h) flan) | 
im Goan” 
exists, then f is differentiable at the origin. 
Proof. Let 
f(h) — f(ah) 
D(a, h) = ~(l—ajh 


The proof is simplified if we could take L = 0 and 0 < a < 1. Can we? Observe 
that (i) if L #0, we can consider f(x) — Lx; (ii) if a= 0, we just have the 
definition of differentiability; (ai) if |a| > 1, we can take as our starting point 

_ f(h) — flh/a) 


lim D(a, h = im —-——_.—_, 
h—»0 (4, h/a) hoo (l—a™')h 


for which the new constant a is the old a~'; (iv) if a < 0, we can consider D(a’, h) 
since 
D(a,h) aD(a, ah) 

l+a l+a 


We may thus assume that L = 0 and 0 <a <1. 
Observe that 


D(a?,h) = 


f(h) — f(a"h) 
~ (T—a\h (2) 
(1 -—a)h 
Let e« > 0. There exists a 6 > O such that for all h with 0 < |h| <6 we have 
|D(a, h)| < e. But then for all j >0 we have |D(a, a’h)| <e as well. Conse- 
quently, for all 0 < |h| < 6 and for all n > 1, using (2), 
f(h) — f(a"h) 
(1 -—a)h 


J 


n-l 
> a’D(a, ath) = 
j=0 


<(l-a")e/(1- a), 
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from which we obtain by the triangle inequality that 
f(h) — f(0) f(a"h) — f(0) 
h h , 
Since f is continuous at 0, and 0 <a<1, we can choose n so large that 


lf(a"h) — f(O)| < elh|. Note that » may depend on h. Thus, we have shown that 
for all h with 0 < |h| < 6, 


<erct 


f(h) — f(0) 
= 10), 


and we are done. Oo 


In the same vein, we may ask under which conditions on the function g(h) is the 
differentiability of f at 0 implied by the existence of the limit 


fh) = f((0)) 
hoo h—g(h) — 


A sufficient condition appears to be that |g(h)| < |A| and g(h)g(g(h)) always has 
the same sign. (Just imitate the proof of the theorem with a”h replaced by g,(h), 
the n-fold composition of g with itself.) So the anwer to the above question is yes 
for g(h) = sin(h), but what if g(h) = s(h)sin h where s(h) takes on the values +1 


and —1 in some “random” manner? 
Acknowledgements. Thanks are due to the referee for a job well done. 
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E 3269. Proposed by M. J. Pelling, University College, London, England. 


For what positive integers n does there exist a permutation (X,, xX2,...,X,,) of 
(1,2,...,) such that the differences |x, — k|, 1 < k <n, are all distinct? 


E 3270. Proposed by Berndt Lindstrom, University of Stockholm, Sweden. 


Determine those positive rational numbers m such that 
1 
—arctanym 
T 


is also rational. 


E 3271. Proposed by Walter Noll and Joseph C. Keane, Carnegie Mellon Univer- 
sity. 


For each r © Q@ (the set of all rational numbers), denote the denominator of the 
representation of r as a reduced fraction by 5(r). For each a & R (the set of all real 
numbers) consider 


a(a) = 2, (8(r)) *, 


res 
where 
={reQ|0<rsl}. 


Show that o(«a) is finite if and only if a > 2 and evaluate o(a) when a > 2. 
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E 3272. Proposed by K. I. Appel and C. G. Jockusch, Jr., University of Illinois at 
Urbana-Champaign. 


It is well known that 4(n? + 2n — 3) multiplications /divisions are required to 
find the determinant of an n X n matrix by Gaussian elimination, while obviously 
the permutation expansion requires (n — 1)n! multiplications. Obtain a formula in 
closed form for the number a, of multiplications to evaluate an n X n determinant 
by the cofactor method. The formula should not involve summations but may use 
the greatest integer function. 


E 3273. Proposed by Orrin Frink, Pennsylvania State University. 


Give appropriate conditions under which a simple closed curve in the plane 
contains three points which form the vertices of an equilateral triangle. For the 
corresponding problem for the square see Problem 4325 [1949, 39; 1950, 423] and 
Victor Klee, Some Unsolved Problems in Plane Geometry, Math. Magazine, 52 
(1979) 131-145. 


E 3274. Proposed by David Newman, Beer Sheva, Israel. 


Fix a positive integer k. Define f(n) on positive integers by f(n) = 1 for 
n<k+1 and f(n)=f(f(n —1)) 4+ f(n — f(n — 1)) for n >k +1. Define the 
sequence F by F, =1form<k and F, = F,_,+ F,_, for m>k. (Note that 
the F are powers of 2 when k = 1 and ordinary Fibonacci numbers when k = 2.) 

(a) Prove that f(n) — f(m — 1) is 0 or 1 for all 1, and that f(”) is unbounded. 


(b) Prove that f(F,,,,) = F,, for m > 1. 


E 3275. Proposed by Thomas J. Laffey, University College, Dublin and Desmond 
MacHale, University College, Cork, Ireland. 


(a) Suppose R is an associative ring, not necessarily with unity, in which every 
element is a square. If squaring is a ring homomorphism, prove that R is commuta- 
tive. 

(b) Suppose R is an associative ring and suppose there is a function f from R 
onto R which satisfies f(x + y) = f(x) + f(y) and f(xy) = (xy)? for all x, y © R. 
Must R be commutative? 


SOLUTIONS OF ELEMENTARY PROBLEMS 


A Generalized Fibonacci Recurrence 


E 3130 [1986, 131]. Proposed by Mark Kantrowitz (student), Brookline, MA. 


Let {H,,} be a generalized Fibonacci sequence, i.e. H, and H, are arbitrary 
integers and, for n > 2, H, = H,_, + H,,_>. 

(a) Find 7, in terms of H, and H,, such that H,,H,,,,.+T7, H>,H>,44+ T, 
Ay, -~Hon+1 — T, and H,,,_,H>,43 — T are all perfect squares. 

(b) Prove that T is unique. 
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Solution by Samuel W. Bent, Dartmouth College, Hanover, NH. 
(a) Applying the Fibonacci recurrence and induction, we have 
An+2  Ansi} | Hn+1 An ac i| = A, Hy, ac 1 
H,., 4, H, 4H,.,| |1 0 H, H,| |1 O}] ° 
Letting 7 = H,H, — H} and taking determinants converts this to H,,,,H, + 
(—1)"T = H?,,, which yields the first and third results for even and odd values of 
n, respectively. 
Similarly, since H,,,— H, = H,,, and 2H,.,,.—- H, = A,4.+ Hy. = Anas, 
we have 
A+ A, +2 _ A+3 A+) ; 2 1] _ AL, Al, ; 2 a 
A, +2 Hi, Any, Ay-1 -1l -!1 A, AA, -1 -1 ) 


Using the further identity 


Hs Hy) _ F 1 H; Hy, F ° 

H, 4H, 0 1)J)/H, A, il1 1 
and taking determinants, we have H,H,,,+(-—1)"T = H7,,, which yields the 
second and fourth results for even and odd values of n, respectively. 


(b) It is easy to verify by induction that H,,, = F,_,H, + F,H>, where { F,} 
are the usual Fibonacci numbers. The well-known identity 


F i n 5”) 
nn J5 (o ~~ oo) ’ 
where @ = (1 + ¥5)/2 and $ = (1 — ¥5)/2, implies 


1 A 
Ana = ye h/e + H,)¢" + O(|9|”). 


Furthermore, (H,/¢ + H,) # 0, since ¢ is irrational. 

Suppose that 7” is a solution to the problem, so that H,,,,H, + (—1)"T’ = b?,, 
for some sequence of integers {b,}. Then H7,, — b?,, = (—1)"(T — T’). The left 
side is either zero or has magnitude at least 2|H,,,,| — 1 (the distance from H7,, to 
the nearest square). But 2|H,,,,| — 1 = co” + O(1) for some nonzero constant c, 
and for large enough » this exceeds the constant |7 — T’|. Thus for large n the left 
side must be zero, which implies T — 7’ = 0. 


Editorial Comment. The problem has appeared before, as noted by several 
readers. G. Philippou cited Problem B-549 in The Fibonacci Quarterly [1985, 182; 
1986, 184] and H. V. Krishna cited a paper of A. F. Horadam, “A generalized 
Fibonacci sequence,” Amer. Math. Monthly, 68 (1961) 455-459. 


Solved also by 18 other readers and the proposer. Several incorrect or partial solutions were received. 


Trapezoids in a Polygon 


E 3157 (1986, 482]. Proposed by Liviu I. Nicoaescu, Iassy, Romania. 


How many sets of four distinct points forming the vertices of a trapezoid are 
there if the points are chosen from the vertices of a regular n-gon (n > 4)? 
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Solution by Dennis Estes and Charles Lanski, University of Southern California. 

The answer is n( a *) if n is odd and (n — 3)( " ") if n is even. Any trapezoid 
determined by four vertices of a regular n-gon is defined by two parallel chords 
joining vertices, so we count the number of such pairs of parallel chords. Any chord 
joining two vertices is parallel to a side or is perpendicular to a diameter (if 7 1s 
even); we organize the counting using this fact. 

If n is odd, then each side has (n — 1)/2 chords parallel to it (including itself), 


giving ((" maa *| pairs of chords. The sets of chords corresponding to distinct sides 


are disjoint, so there are n{ a *| pairs of parallel chords. We claim each pair 


generates a distinct trapezoid. Two pairs generate the same trapezoid if and only if 
they form a parallelogram. This requires the chords in a pair to have the same 
length, meaning their endpoints differ by the same number of vertices around the 
n-gon. Hence, vertices of an n-gon can determine a parallelogram only if n is even. 


If n is even, then there are ("/ "} pairs of chords parallel to each side. Opposite 


sides yield the same pairs, so there are (n /D( " | pairs of chords parallel to sides. 
Each diagonal has (n — 2)/2 chords perpendicular to it, so altogether there are 
(n /2( nf 7 ‘| pairs of chords perpendicular to diameters. As before, two pairs 


generate the same trapezoid if and only if they form a parallelogram. In terms of 
vertex distance around the n-gon, the lengths of these sides must be i, j,i, j in 
order, with i + j =n/2. There are n/2 —1 such parallelograms starting at any 
vertex. The n vertices give rise to (n/2 — 1)n parallelograms, each counted four 
times, once for each of its four vertices. Thus there are 


(n/2 —1)n/4 = a, 


distinct parallelograms. Eliminating the duplication, we have 
(n/2)("6) n (2) "77 ‘| _ ("4 | =(n- ("2° | 


trapezoids when n is even. 


Also solved by J. Cohen, R. B. Eggleton (Australia), N. Felsinger, W. Janous (Austria), J. M. Rojas 
(student), K. L. Stellmacher, J. T. Ward, H.-F. Yeung (Australia), and the proposer. Five incorrect 
solutions were received. 


A Natural Limit 


E 3159 [1986, 565]. Proposed by Emeric Deutsch, Polytechnic Institute of New York. 


Define a sequence ( p,) by py) = 1 and 


1 1 1 1 
Pn + Ty Pn-1 + Fy Pn-2 F ° Topi?! + Po = I. 


Show that ( p,,) is convergent and find its limit. 


Solution I by Jonathan M. Borwein, Dalhousie University, Halifax, Nova Scotia. 

Let p(x) = Lp,x” be the ordinary generating function for (p,). The recurrence 
given for p, is the nth term of the series product p(x)e* = Lx" =1/(1 — x). 
Hence, p(x) = e*/(1 — x) and p, = X%_,.(—1)*/k!, which has limit 1/e. 
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Solution IT by Paul Cull, Oregon State University, Corvallis, OR. 

Let g(x) = L?_op,_,x*/k!. The recurrence implies g,(1) = 1 for all n, and the 
definition of g, implies g/,,(x) = g,(x). The solution to this recursive differential 
equation is g,(x) = L"_)(x — 1)*/k!. Setting x = 0, we have p, = Li_9(—1)*/K!, 
which has limit 1 /e. 


Solution III by G. Behrendt, Universitat, Tiibingen, West Germany. 
Let D, be the number of derangements of n elements, i.e. the number of 
permutations of 1,..., leaving no element in its original position. There are 


,)D,—% Permutations leaving exactly k elements fixed. Since every permutation 


leaves some number of elements fixed, we have v7 o(")D,—x = n!. Dividing by n! 
shows that the ratios D,/n! satisfy the recurrence for the p,; also Dj = ky = 1. By 
the inclusion-exclusion principle we know that D,/n! = X%_,(—1)*/k!, which has 
limit 1/e. 


Editorial comment. S. H. E. Hou noted that another solution appears in D. E. 
Knuth, The Art of Computer Programming, Vol. 1, Addison-Wesley (1973), 62-63. 


Also solved by over 75 other readers and the proposer. 
A Varied Menu 


E 3169 [1986, 650]. Proposed by Dean S. Clark, University of Rhode Island. 


A gentleman, who decides randomly what to eat for dinner so long as it isn’t the 
same thing two nights in a row, will return to his favorite restaurant next Friday. Is 
it more likely that he will order (a) Chicken Mediterranean or (b) the dinner he had 
(which he cannot remember) when he first visited the restaurant? How much more 
likely? 


Solution by David Callan, University of Bridgeport. 

It depends on whether he has visited the restaurant an even or odd number of 
times. The choice of dishes on successive visits forms a finite Markov chain, since 
each choice depends only on the previous one. Assuming there are n dishes 
available, the n by n transition probability matrix P has zeroes on the diagonal and 
1/(n — 1) everywhere else. So P = (n — 1)7\(J — 1), where J is the n by n matrix 
of ones. Replacing (n —1)7! by 1/n+1/{n(n — 1)] yields P= n-V —- 
(n — 1)~*(I — n7~4J), which is the basis step for an elementary inductive proof that 


1 (-1)’ 
* (1) 


ea 


The induction step uses the fact that J? = nJ. 

From this, the probability of repeating a dish r visits later is 1/n + 
(—1)’"1/{n(n — 1)"~1]. Since he cannot remember the dish he had on his first 
visit, the probability of (a) is 1/n. The probability of (b) differs from this by 
1/[n(n — 1)"~+], and is larger on even visits and smaller on odd visits. 


(I-n-‘J). 


Editorial comment. Several solvers interpreted the statement of the problem to 
exclude the possibility that the dinner ordered on the first visit was Chicken 
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Mediterranean. This leads to the same qualitative result as above, but the difference 
in probabilities becomes 1/(n — 1)’. 


Also solved by S. F. Barger, J. W. Grossman, R. Hill (England), C. Hurd, O. P. Lossers (The 
Netherlands), J. T. Ward, and the proposer. 


ADVANCED PROBLEMS 


For instructions about submitting solutions of Problems, which should be mailed before October 31, 
1988, see the inside front cover. Please place the solver’s name and mailing address on each 
(double-spaced) sheet. Include a self-addressed card or label if an acknowledgment is desired. 


6574. Proposed by Moshe Laub, Jerusalem, Israel. 


Put M, = L'_,j? forr = 1,2,.... If n © N (the set of positive integers), define 
m(n) to be the unique positive integer r such that M. <n < M,,, 

(a) Let S, be the set of n © N such that n is expressible as a sum of m(n) 
distinct squares of positive integers. Prove that S, has asymptotic density one but 
that N \ S, is infinite. 

(b) Let S, be the set of n © N such that n? is expressible as a sum of m(n*) 
distinct squares of positive integers. Prove that S, has asymptotic density one. *Is 
N\ S, infinite? 


6575. Proposed by Richard Stanley, Massachusetts Institute of Technology. 


Let X1,...,X,, Vis-++> Y, be variables and for] <i<j <n set X;;= x; + Xj4, 
tots +X, and Yi,=Yit Via t+ ++ ty, Also set X;,1; = Yj41,; = 0, and any 
product TT?7? equal to 1. Prove the identity 

n+1~-i 


n 0 (Xi + Y, 1) ; sat (Xistonei—k + Y; nt1—k) 
) ars Sn = 
I TE (Xe + Yea) I(x i+1,n+1— st Vine :) 


For instance, when n = 2 we get 


xX1(x.+y¥, +) fi (x, +X. + yy) x2), 
Xo yy Xo t+ yy 


= XY, + XY. + X2)p. 
SOLUTIONS OF ADVANCED PROBLEMS 
Maximizing an Integral over a Diagonal Strip 


6521 [1986, 485]. Proposed by C. S. Gardner and C. Radin, University of Texas at 
Austin. 
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Let a, b be fixed, with 0 < a < b, and let f be a variable function, subject to 


f(x) > 0, —00 <x<oo,and f~ f(x) dx =1. 


What is the maximum value of 


VN) = [Of Ps») aa? 


Solution by Robert B. Israel, University of British Columbia, Vancouver. The 
maximum value of V(f) is n/(2n + 2) where n is the greatest integer strictly less 
than b/a. We first show that this value can be attained. Choose c and 6 > 0 such 
that a < c — 26 and nc + 26 < b. Then for any xo,..., x,, with |x, — kc| < 6, we 
will have a < |x; — x,| < b whenever i + j. Let f be a function supported in the 
union of the intervals (kc — 6, kc + 6) for k = 0 to n, with 


f° F(x) dx = 
kc—6 


n+1- 
Then 


VN= LY fe pe oso) aay 


i=0 j=i+1 ic—8 jce—6 


af EU a) = a5 


i=Q \*ic—6 7 2(n + 1) 


Next, we show that this value is an upper bound for V(/). It is easy to see that it 
suffices to prove this for functions f with bounded support, since, for any f 
satisfying the requirements, a suitable cutoff and rescaling produces a function g 
with bounded support and V(g) arbitrarily close to V(f). 

Consider two intervals J and J, each of length a or less, and separated by a 
distance of at least b. We claim that if a = f,f(x) dx and B = f,f(x) dx are both 
nonzero, then there is a function g that is equal to f outside J and J, zero on one of 
the intervals and a positive multiple of f on the other, so that 


[oa dx =1andV(g) >V(f). 


Let two candidates for g be 


0 xel, 
a+ B 
(x) = B f(x) xeJ, 
f(x) otherwise, 
and 
- - P Ax) xeEl, 
(x) = 0 xeJ, 


f(x) otherwise. 


1988] PROBLEMS AND SOLUTIONS 561 


Then 


= ——g, + ——-,. 
f a+ Be a+ Be? 


Note that if y+a<x<y+b then x and y arenot bohin]UJ.Ifx€IUJ 
then 


f(x)f(y) = f(x)| = a pel) t at pe? 


_ eee + cal) 


A similar argument shows that the extreme left- and right-hand sides of the above 
equality are equal when y ¢ J U J. Therefore, 


B 
VWN= 253 7. 


Thus at least one of V(g,) and V(g,) is greater than or equal to V( f), which proves 
the claim. 

Now let f be any function of bounded support. A finite number of applications 
of the result of the last paragraph produces a function g with support of diameter at 
most b, and V(g) > V(/). Since (n + 1) a > b, the support of g is contained in the 
union of n+ 1 intervals Jo,..., 7, (from left to right) of length a. Let p, = 
J, g(x) dx. Then 


Vis)< EY Pi Pj = 5{1- Yet) < sap 


i=0 j=i+1 


) + —— V(g,). 


since the minimum of L7_,p; subject to LX". )p, = 1 is attained with all p, = 
1/(n + 1). 

Editorial Comment. Most akin to Israel’s solution was that of O. P. Lossers. The 
proposers and (independently) Kenneth Schilling used the language of graph theory 
in essentially the same way to solve the problem. The relevant lemma (in Schilling’s 
notation) is the following. Let G be an undirected graph with e edges, v vertices, 
and no subgraph that is a complete graph on n + 1 vertices. Then 


n—1 
2n 


p2 


e< 


This is a weak form of a theorem of Turan, which asserts that 
n—-1 
2n 


n 
e< v? —- —6(1 — 8), 
2 
where 6 = v/n — |v/n}. 
Quotients Which Are Roots of Unity 


6523 [1986, 485]. Proposed by David G. Cantor, University of California, Los 
Angeles. 
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Suppose f(z) is an irreducible polynomial of degree d over the field of rational 
numbers, and suppose that f(z) has two roots a, 8 with a/f a primitive nth root 
of unity. Show that $(n) < d. 


Solution by I. M. Isaacs, University of Wisconsin, Madison, WI. We are given 
f(a) = 0 = f(B) where f is irreducible over Q and a/f = «, a primitive nth root of 
unity. Let E be a splitting field for f containing a and B and let G = Gal(E/Q), 
the Galois group. Let H = Gal(E/Q(a)) and K = Gal(E/Q(£)) and note that the 
subgroups H and K are conjugate in G since f is irreducible. Also 


|G: H| =d=(|G: KI, 
where d = deg(/). 

Now let N = Gal( E/Q(e)). Since ¢ is a root of unity, Q(e) is Galois over Q and 
so N is normal in G. Also, G/N is isomorphic to Gal(Q(«)/Q) and so it is abelian 
of order ¢(n). Now HN and KN are conjugate in G and since G/N is abelian, we 
conclude that HN = KN D K. We also have HN K = H (1 N, since an element of 


H fixes a = Be and so it fixes B if and only if it fixes e. 
We have 


|HN: H|-|HN: K| > |HN: HO K|=|HN: HON| =|HN: Al: |H: HOM, 


and so |HN: K| > |H: HO N| = |HN: N|. We conclude that |K| < |N| and so 
d > $(n), as required. 


Editorial Comment. Modifying the above argument by replacing group indices by 
the degrees of the corresponding field extensions leads to the following more general 
result, as was pointed out by Cantor. Let a and B be conjugate algebraic elements 
over a field k, and suppose K is an abelian extension of k contained in k(a, £); 
then [K: k] < [k(a): k]. A proof along these lines and a similar generalization was 
presented by Ze-Li Dou (student, Queens College, CUNY). 


Many readers took questionable shortcuts here, and a number of proposed solutions were judged as 
either incorrect or incomplete. 

Also solved by Ze-Li Dou, Edward H. Grossman, David R. Richman, John Henry Steelman, and the 
proposer. 


An Alternating Sum of Products of Beta Random Variables 


6524 [1986, 573]. Proposed by Gérard Letac and Guy Yéterian, Université Paul 
Sabatier, Toulouse, France. 


Let p and q be positive numbers, and { X,,}%°_) a sequence of independent 
random variables with the same distribution 


(p+ q) 
r(p)T(q) 


on [0,1]. Find the distribution of 


xP-\(1 —x)7" dx 


B,, q( 4x) = 


n 


00 
» (-1)"X)X, so X,, 
n=0 
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Solution by Western Maryland College Problems Group, Western Maryland Col- 
lege, Westminster. Let 


n 


Zm = d (-1)"XoX + X,, 
n=0 


and set Z = Z,,. We will show that Z has a £ density 
o(t) = ['(2p + q) 

I p)I(p +4) 

Observe that, since P{ X, < }} is constant, the event { X, < 5} occurs infinitely 

often (with probability one). Hence, the (m + 1)-fold product of X, contains a factor 


of an arbitrarily large power of 3 if m is large enough; hence it converges to zero. 
Now 


t?-1(1 — t)?*7* on [0,1]. 


oO 
P[X,=1forsomen] < )) P[X, =1] =0. 
n=0 
Thus Z,, converges with probability one because the series is alternating and it fails 
to decrease with probability zero. It follows that Z,, converges in distribution to Z. 
Denote by G,, the distribution of Z,, and G the distribution of Z. We can factor out 
X_ and write 


m 


Lm = wh — » (-1)" 'X, a x, — X,[1 -_ Yn—11 


n=1 
where Y,,_, is independent of X, and has the same distribution as Z,,_,. This will 
give us an integral equation satisfied by G. We write f for the density of Xp. 
Now G,,(t) = P[Z,, < ¢] satisfies 


G,,(t) = P[X(1 — ¥,-1) S$ ¢] 
=1-P{[Y,-,<1-t/X] 
=1- [G10 — t/s) f(s) ds. 

By letting m go to infinity we get | 
G(t)=1- fea — t/s) f(s) ds. 


This equation has a unique solution for G € L'[0,1]. To see this consider the 
operator defined by 


o[H](t) =1- f ‘H(1 — t/s) f(s) ds. 


By a change in the order of integration, and the substitution u = 1 — t/s, we find 
that 


lolz] - s{x]I=|f [ (HQ. = t/s) ~ K( ~ t/s)) f(s) asa 
< ff HG -#/s) — K( ~ t/s)|f(s) ads 
= [fH (w) ~ K(u)|sf(s) duds 


= || — Kllp, 
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where p = E[ X,] = p/(p + qg) < 1. By the contraction mapping theorem ¢ has a 
unique fixed point. Also, the equation for G shows that G has a continuous 
derivative on (0,1). Upon differentiating the equation for G we get an equation 
satisfied by the density of Z, namely, 


s(t) = f 'g(1 — t/s)s~ f(s) ds. 


This must also have a unique solution for g a probability density, because we can 
integrate to recover the equation for G and then differentiate to get g. It is now a 
relatively straightforward exercise in integral calculus to show (after some changes 
of variable) that the g(t) mentioned at the start, ie. the B( p, p + q) density, 
satisfies this equation. 


Editorial Comment. Specialists in probability theory might prefer a short proof 
given by Norman L. Johnson (University of North Carolina) who quickly obtains 
the result by taking rth moments of both sides of 

| LZ x — X)(1 a Y,.)s (*) 
and by using the fact that the product of two mutually independent beta variables 
with parameters (p,q) and (p+, p) has a beta distribution with parameters 
(p, p + q). O. P. Lossers (The Netherlands), whose solution is somewhat similar to 
the one from Western Maryland, also relies on moments to establish uniqueness. 
Lossers adds that the product relationship between ( p, g), (p + 4, p) and (p, p + 


q) is related to equalities for the order statistics Y,.,, from the exponential 
distribution by 


exP(— Yur n) = Xn-z, ke 


Here X, , denotes a random variable with distribution B, , and equality means 
equality in distribution. 

The proposers note that for p = g = 1, the result is essentially due to T. Ugaheri, 
On a limit distribution, Ann. Inst. Statist. Math. Tokyo, 1 (1950) 157-160. 


Also solved by Ignacy Icchak Kotlarski, Kenneth Schilling, Douglas P. Wiens, and the proposers. 
A Matrix Whose Cube Is the Identity 


6527 [1986, 659]. Proposed by Nicholas Strauss, Boston University. 

Let J(m) be the m X m matrix whose (i, /')th entry is 

i+] 
J 

Show that for all primes p and positive integers n, the matrix J( p") is a cube root 

of unity modulo p. 


Solution by Ira Gessel, Brandeis University, Waltham, MA. Let K(m) be the 
m X m matrix whose (i, /)th entry is 


(ay (" 51), 0<i,j<m-1. 


Let © be the linear transformation on polynomials in x of degree less than m 
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defined by 


a(s(x)) = (a= x)"'¥{—} 


It is easily seen that K(m) is the matrix of © with respect to the basis 1, x,..., x71. 
We find that 


er(4(x)) = (-x)" “(= =] 
and 


@°(f(x)) = (-1)" 'f(x), 
so K(m)? = (—1)"—1J, where J is the m X m identity matrix. 
If m = p”, where p 1s a prime, then we shall show that 


‘({m—i- 1 itj—-—m i+] 

—] (™ = | | | mod * 
(-1/ / ;/|(modp) —(#) 
for 0 <j < m. The equality is a simple identity. (Cf. Alan Tucker, Applied Combi- 


natorics, Wiley, 1984, p. 210.) To obtain the congruence, observe that for x an 
indeterminate and k an integer we have the power series identity 
(1+x)*=(1+x)*"(1+x)? 
=(1+x)* "(1 + x?") (mod p). 
Upon equating coefficients of x’ we obtain 


pets" 


, mod p), 

j (mod p) 

since j <p”. Setting k'=i+j gives (*). Thus K(m) =J(m) (mod p). Since 
(—1)”"—! = 1 (mod p), it follows that J(m)> = I (mod p). 


The solutions given by O. P. Lossers and the proposer involved sums of products of binomial 
coefficients. Jeffrey Mitchell Cohen showed directly that the matrix J(m) has determinant 1. 


The Normalizer Revisited 


6528 [1986, 659]. Proposed by William P. Wardlaw, United States Naval Academy, 
Annapolis, MD. 


The following is problem 11 on page 44 of C. Chevalley’s Fundamental Concepts 
of Algebra, Academic Press, 1956. 

Let H be a subgroup of a group G. Show that the elements s of G such that the 
mapping ¢ —> sts_‘ of G into itself maps H into itself form a subgroup N of G, of 
which H is a normal subgroup; show that every subgroup of G containing H and in 
which H is normal is contained in N. 

Provide an example to show that the assertion of the problem is false. 


Solution by Aage Bondesen, Sondermarken 25, 3060 Espergaerde, Denmark. Let 
G be the group of permutations of the set {1,2,...} of natural numbers, and let H 
be the subgroup consisting of those elements of G that map every odd number onto 
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itself. Define 
N = {s € G|sHs~' c BH}. 
Then define s € G by s(1) = 2 and 
s(2m) = 2m + 2, s(QQm+1)=2m-1, m=1,2,3,.... 
For every ¢ in A it follows that 
sts~*'(2m — 1) = st(2m +1) =s(2m+1)=2m-1, m=1,2,3,.... 


Thus sts~! € H, and, hence, s © N. On the other hand, let t € H map 2 onto 4 (ft 
could then, for instance, map 4 onto 2, and every other natural number onto itself). 
Then s~*ts(1) = s714(2) = s~1(4) = 2, so s-4ts € H and s-' € N. Since s & N this 
proves N is not a subgroup. Note also that sHs~! is a proper subgroup of H. 

Editorial Comment. A number of solutions used essentially this construction; 
Bondesen’s exposition seemed the most elementary. Many readers found counterex- 
amples in G = SL(2, R), while others used only slightly more esoteric G. Counter- 
examples can also be constructed using HNN extensions. The original assertion can 
be made correct either (i) by assuming all elements of H are of finite order or (ii) by 
defining N as the set of all s in G such that 


sHs~'=H 
or (iii) by defining N as the set of all s in G such that 


sHs-! c Hand s'Hs ¢ H. 


Also solved by Miroslav D. ASsi¢, Jeffrey Bergen, Jeffrey Mitchell Cohen, Jests Ferrer (Spain), 
Stephen M. Gagola, Jr., Enzo R. Gentile (Argentina), James E. Humphreys, A. A. Jagers (The 
Netherlands), Charles Lanski, O. P. Lossers (The Netherlands), Pat Morandi, Arthur Rothstein, Eugene 
Salamin, San Bernardino Problem Solving Group, John Henry Steelman, Douglas B. Tyler, Ken 
Yanosko, and the proposer. 


REVIEWS 


EDITED BY JOSEPH KONHAUSER 
Department of Mathematics, Macalester College, St. Paul, MN 55105 


Episodes in the Mathematics of Medieval Islam. By J. L. Berggren. Springer-Verlag, 
New York, 1986. xiv + 197 pp., $23.00. 


GERALD J. TOOMER 
History of Mathematics Department, Brown University, Providence, Rhode Island 02912 


Although the author does not say so, it is clear from the book’s tone and 
structure (“exercises” at the end of each chapter) that it is intended to be used 
primarily as a textbook in undergraduate courses in the history of mathematics. He 
tells us that “the book reflects my conviction that a proper study of the history of 
mathematics begins with a study of the texts themselves, and for this reason I have 
written the chapters not as catalogues of results but rather as expositions of portions 
of mathematical treatises.” So it is pertinent to ask, first, is it useful and practical to 
present, to an undergraduate audience, the mathematics of earlier epochs in the 
form of “the texts themselves”; and, secondly, how successful is the present 
attempt? 

As a historian of mathematics, I agree completely with Berggren’s emphasis on 
the importance of reading the original texts. But when one is dealing with mathe- 
matical texts from the ancient or medieval periods, the difficulties of presenting 
them to an undergraduate audience are formidable. Simply translating them is 
obviously insufficient, since the differences in terminology, notation, and the 
general approach to mathematical exposition are so great that a “faithful” transla- 
tion requires explanatory annotations longer than the original to aid the uninitiated, 
most of whom would quickly be deterred by this approach. Moreover, one has to 
assume that for the typical undergraduate (whether of a scientific or humanistic 
orientation), the ancient and medieval periods of human history are a vast series of 
blank pages occasionally enlivened by entertaining vignettes (such as Archimedes in 
the bath, to take a “mathematical” example). Yet the cultural background is of real 
importance in understanding why the practitioners of mathematics were interested 
in certain topics: the central role of astrology, for instance, is rightly stressed by 
Berggren in the present book. 

Necessarily, then, the writer of such a book has to supply at least a skeletal 
background of general historical information. Problems of “truth” immediately 
arise, for almost any general statement about a civilization requires a complex of 
qualifying statements to make it acceptable to the historian. This problem is 
intensified for the ancient and medieval periods by the scarcity of the evidence on 
many subjects, but, as I know by experience, undergraduates in introductory courses 
are made very uneasy by what they perceive as uncertainty in their textbooks or 
instructor (while the latter simply regards it as healthy skepticism). So I understand 
why (although I still find it difficult to approve) Berggren states as bald facts a 
number of things which are merely disputed hypotheses (to cite just one with which 
I disagree, that the application of numerical procedures to Greek planetary models 
is “at least as early as the time of Apollonius of Perga in the third century B.c.”’) 
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In the present case, the problems of the proper “historical” approach are further 
complicated by the difficulties of defining “Islamic” mathematics. One minor 
dispute is terminological: since “Islam” is primarily a religious term, it seems 
inappropriate to use it to qualify a science which had very little to do with religion 
(especially when a number of its practitioners in the period in question were not 
Muslims). I prefer “Arabic,” although that term too requires many qualifications. 
But, even when we allow “Islamic” to stand as a shorthand word for a cultural 
complex, we are still faced with the fact that the mathematics (like all the sciences) 
of that culture are simply a continuation of the Hellenistic Greek tradition. One of 
the most remarkable features of Islamic civilization was the way in which it took 
over and continued, in a different language and mostly in a different geographical 
area, the scientific heritage of antiquity, which was moribund in the contemporary 
Byzantine Empire, and in so doing breathed new life into it. There are a number of 
brilliant achievements in Arabic mathematics, but it has to be viewed as the direct 
continuation of the Greek tradition (and indeed is unintelligible without that 
background). Thus making “Islamic mathematics” a separate subject of study is 
artificial. Berggren, a scholar with a notable record of investigation of previously 
unstudied medieval mathematical texts, is of course well aware of all this, and of 
necessity allots some space to laying out the ancient Greek background to the topics 
he treats. But I should have liked to have seen in the book a more forceful 
presentation of the essential unity of Greek and Arabic mathematics. On the 
contrary, where possible, he devotes a part of each chapter to “The Islamic 
Dimension,” in which he attempts to show how the topic is related to peculiarly 
Islamic features of the civilization. These aspects are usually trivial, and in one case 
that classification is simply inappropriate. In the chapter on geometry, constructions 
with a “Rusty Compass” are attributed to the “Islamic Dimension.” There are 
indeed some interesting problems in Arabic mathematics involving the use of a 
compass restricted to a fixed opening, but there is nothing peculiarly Islamic about 
this, and the basic idea, as usual, is of Greek origin (although, quite by chance, the 
one known detailed treatment of the subject from the Hellenistic period survives 
only in the Arabic translation of a Greek work). 

How well has the author performed his admittedly difficult task? It is probably 
impossible to present the necessary historical background in a way which is at the 
same time accurate, intelligible and useful in the present context, but the effort is a 
valiant one. My principal objections to the “historical” first chapter are the use of 
photographs of buildings which illustrate nothing pertinent (apparently on the 
principle that any illustration is better than none), and exercises involving the 
Dictionary of Scientific Biography (which encourage the beginner’s notion that 
“research” consists of copying information out of reference works). For the rest of 
the book, the author has the advantage of being able to select “episodes” of intrinsic 
interest, and for the most part he has selected well. Readers of all mathematical 
levels will find things to interest and instruct them. Pedagogically, the presentation 
of the texts is most successful where the author was able to stick most closely to the 
originals. This is notably so in the chapter on geometry, which is consistently 
interesting and intelligible (not coincidentally, this is the part of mathematics where 
methods and notations have changed least in modern times). In the chapter on 
arithmetic, Berggren has done an excellent job of presenting the numeral systems 
(both decimal and abjad-sexagesimal) in such a way that the determined student can 
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actually decipher some of the Arabic texts which are presented in facsimile. 
Elsewhere, he relies heavily on modern notation, which is probably necessary in the 
treatment of algebra if the student is not to be completely put off by the Arabic 
rhetorical exposition. Trigonometry, a usually dismal subject, is enlivened here by 
the account of al-Birtini’s measurement of the earth (where, for once, the anecdotal 
approach is enlightening). I suspect that the audience to whom this book is 
primarily addressed will find considerable difficulty in the chapter on spherics, 
where too much is presented in a short space on a topic which is utterly strange to 
most undergraduates. The chapter also suffers from containing a “catalogue of 
results” (which the author had foresworn in his preface). A preferable approach 
might have been to treat the astrolabe in enough detail that the reader could 
actually construct one: this leads to numerous illuminating exercises. 

The proofreading has been sloppy, and the misprints or slips extend, unfor- 
tunately, to numerals which are significant to the course of the exposition. But the 
instructor may make a virtue of this deficiency by encouraging the students to 
“reconstruct” the correct proofs or procedures. A particularly good candidate for 
this “exercise” is the example of division of polynomials on pp. 115-117. It is 
immediately apparent that there is something wrong, but it will take an astute and 
determined student to unmask all 13 numerical errors, and in doing so he will 
certainly gain a thorough understanding of al-Samaw’al’s procedure. I have to 
confess that I did not succeed until I looked at the original Arabic text. 


Mathematics and Optimal Form. By Stefan Hildebrandt and Anthony Tromba. 
Scientific American Books, Inc., New York, 1985. xu + 215 pp., $29.50. 


FRANK MORGAN 
Department of Mathematics, Williams College, Williamstown, Massachusetts 01267 


Why is a soap bubble round? That is the question I like to ask new acquaintances 
who wonder about mathematics. After some persistent encouragement, a good 
number come up with this excellent answer: 


“Since the pressure is the same throughout the bubble, the curvature must be 
the same everywhere.” 


Well, that answer deserves commendation. But does it actually follow that a 
bubble must be a round sphere? Is the round sphere the only closed smooth surface 
of constant (mean) curvature? In 1958, A.D. Alexandrov [1] proved that if the 
surface does not cross or intersect itself (as soap films do not), then it must indeed 
be a round sphere. Good. But now suppose we would allow self-intersections. In 
that case, under the additional assumption that the surface is a (possibly deformed) 
sphere, H. Hopf [6] was still able to prove that it actually must be round. Of course, 
that left an outside possibility of a closed, constant-mean-curvature surface of 
another type. In 1986, Henry Wente [10] of the University of Toledo actually found 
one: a constant-mean-curvature, self-intersecting torus (see FIGURES 1 and 2). 

The computer graphics in this review were kindly provided by that famous team 
at the University of Massachusetts at Amherst: Professors David Hoffman and Joel 
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but not with constant mean curvature. 


’ 


Fic. 1. A torus 


CSS SISOS 


Fic. 2. Wente’s twisting constant-mean-curvature torus. Courtesy D. Hoffman, J. Spruck, J. Hoffman, 


and M. Callahan. 
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Spruck, graphics programmer Jim Hoffman (no relation), and high school senior 
Michael Callahan (now at Harvard). 

Even more recently, Nicolaos Kapouleas [7], a graduate student at the University 
of California at San Diego (and now Stanford), produced more closed, constant- 
mean-curvature surfaces of infinitely many types (of any genus greater than two). 
They can be approximated by piecing together spheres and the undulating Delaunay 
surfaces, as suggested by FIGURE 3. 


Fic. 3. One of Kapouleas’ constant-mean-curvature surfaces (schematic). 


Compound bubbles (see FIGURE 4) have more geometry than single ones. In the 
double bubble the larger bubble, the smaller bubble, and the separating surface all 
come together along a curve, always at 120° angles. In the triple bubble, four such 
singular curves meet at a point, always at the “tetrahedral” angle of about 109°. 
Remarkably, in more complicated froths of thousands of bubbles, no further types 
of singularities occur. Surfaces meet along curves only three at a time and always at 
120° angles. Such curves meet only four ata time and always at about 109° angles. 
The Belgian physicist, J. A. F. Plateau, observed and recorded these rules over a 
hundred years ago. But it was not until 1976 that Jean Taylor, now Professor of 
Mathematics at Rutgers University, succeeded in deducing them from a single 
principle of area-minimization alone ({9], nicely explained with illustrations in [2]). 

Taylor’s purely theoretical reasoning stands on its own. One might wonder how 
she ever came up with the complicated mathematical surfaces she needed. Once 
when visiting her at home I saw the answer: models of experimental soap films. 

These soap bubble singularities are the same singularities which occur in insect 
wings and radiolarian skeletons. 
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Ufo, 


FIG. 4. Compound soap bubbles. Drawings by Jim Bredt. Photograph courtesy of Gordon Gahan /Prism. 
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In contrast to the unique possible round shape for a single soap bubble, some 
physical geometric problems have more than one solution. A wire dipped in soap 
solution may support two different soap films (see FIGURE 5). 


Fic. 5. Two soap films with the same boundary. Drawings by Jim Bredt. 


It is today an open question whether a smooth boundary can support infinitely 
many stable soap films. (For an illustrated expository article see [8].) 

Alternatively consider complete embedded minimal surfaces, that is, soap-film-like 
surfaces that continue on indefinitely, without edge or boundary or self-intersection. 
Until a few years ago, only two of finite total curvature were known: the plane and 
the catenoid (see FIGURE 6). 


FIG. 6. The plane and the catenoid: the only known complete embedded minimal surfaces of 
finite total curvature until 1985. 


In the past few years, University of Massachusetts Professors David Hoffman 
and Bill Meeks ([4] and [5], nicely described and illustrated in [3]), have discovered 
infinitely many others. Their beautiful pictures have adorned the covers of mag- 
azines and the walls of offices (see FIGURE 7). 


Mathematics and Optimal Form by S. Hildebrandt and A. Tromba provides a 
magnificent, liberal essay on physical and geometric shapes. Hundreds of beautiful 
illustrations range from an electron micrograph of a radiolarian skeleton to a rich 
color reproduction of Chardin’s famous painting of the soap-bubble-blower, from 
brilliant photographs of crystals and liquids in free fall to computer graphics of 
theoretical soap films. The first simple figure of a MObius strip is supplemented by a 
more imaginative sketch by Escher and by an old photograph of A. F. M6bius 
himself. You may have to read the book to appreciate how the story of the 
“isoperimetric problem’ is illustrated by a medieval map of Cologne and a 
seventh-century painting of Dido and Aeneas. 

A vivid account of Archimedes includes his mathematics and his marvelous 
contributions to the defense of Syracuse against the Romans. In some interesting 
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Fic. 7. One of the new complete embedded minimal surfaces discovered by D. Hoffman and 
W. Meeks. (Courtesy D. Hoffman, J. Spruck, J. Hoffman, and M. Callahan) 


selections taken from Plutarch, the Roman commander Marcellus refers to Archi- 
medes as “this geometric Briareus [a hundred-armed giant of Greek legend].” 

Gradually the book proceeds to lead the general reader enchantingly from 
ancient Greece right into modern mathematics. Unfortunately, after giving such a 
convincing and sympathetic portrayal of Archimedes, it generally omits even the 
names of contemporary mathematicians. This omission seems especially sad and 
ironic in this kind of book, so long awaited, which explains the mathematician so 
well. 

From the beginning to end the book provides readable mathematics and applica- 
tions, colorful history, and even art and poetry. The opening pages give an alluring 
new translation of Morgenstern’s poem, “The Bewitchment,”’ about the magic of 
blowing bubbles. 

Anyone who likes to read and think should love this book. As a mathematician, I 
want to give it to all my friends. 
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General, S*(18), P**, L*. Proceedings of the In- 
ternational Congress of Mathematicians, August 3- 
11, 1986. Ed: Andrew M. Gleason. AMS, 1987, $195 
set (ISBN: 0-8218-0110-4]. Volume 1, ci + 872 pp; 
Volume 2, iii + 837 pp. Well-written expository ar- 
ticles by prominent researchers on central questions 
in mathematics. The two volumes contain reports 
on the work of the Fields medalists and Nevanlinna 
prize winner, fifteen plenary addresses, and over one 
hundred invited addresses grouped by field, as well 
as a list of short communications. Focusing on the 
idea rather than the technical, the articles success- 
fully give the mathematician an appreciation of top- 
ics outside his or her own specialty. The proceedings 
also include coverage of the opening and closing cer- 
emonies, and several lists of committees, donors, and 
participants. GG 


Elementary, T*((13: 1). Mathematics for Liberal 
Arts: A Problem Solving Approach. Rick Billstein, 
Johnny W. Lott. Benjamin/Cummings, 1986, xvii + 
646 pp, $29.95. [ISBN: 0-8053-0863-6] A book cover- 
ing basic algebra, but introducing it in such a way as 
to emphasize the structure behind it. After covering 
the basics, it uses them in probability, statistics, and 
linear programming. MZ 


Education, P*, L**. Science Achievement in Sev- 
enteen Countries: A Preliminary Report. IEA. Perg- 
amon Pr, 1988, xi + 125 pp, (P). [ISBN: 0-08-036563- 
9] First report of an international assessment of sci- 
ence education for students of ages 10, 14, and 18. 
U.S. results in science are comparable to those in 
mathematics (see The Underachieving Curriculum, 
TR, Match 1987): very low. U.S. performance de- 
clined as children got older, from middle for 10-year- 
olds to very low (last in biology) for students who 
are still studying science as high school seniors. One 
sub-study reports that the bottom 25% of U.S. 14- 
year-olds perform only at chance level. LAS 


Education, P, L. Perspectives on Research on Ef- 


fective Mathematics Teaching, Volume 1. Ed: Dou- 
glas A. Grouws, Thomas J. Cooney, Douglas Jones. 
NCTM, 1988, viii + 261 pp, $15 (P). [ISBN: 0-87353- 
254-6] One of a series of four reports from NCTM- 
sponsored conferences intended to establish research 
agendas in areas of mathematics education where 
conceptual consensus is feasible. Twelve survey pa- 
pers outline issues in teaching amenable to research, 
including higher-order thinking, cross-cultural stud- 
ies, content determination, knowledge construction, 
and professionalism of teachers. Proceedings of other 
conferences—on problem solving, on learning alge- 
bra, and on number concepts—will be published sub- 
sequently, as will an overview volume. LAS 


History, L*, P. I Have A Photographic Mem- 
ory. Paul R. Halmos. AMS, 1987, vii + 326 pp, 
$55. [ISBN: 0-8218-0115-5] Six-hundred snapshots 
of mathematicians culled from Halmos’s collection of 
6000, captioned with chatty, informative commen- 
tary on mathematics, meetings, and careers. Ar- 
ranged more or less chronologically from the early 
50’s to the mid 80’s. Index of names. A unique, 
visual portrait of the mathematical community. LAS 
Logic, T*(15-17: 1, 2), L*. Computability and 
Logic. Daniel E. Cohen. Math. & Its Applic. Hal- 
sted Pr, 1987, 243 pp, $64.95. [ISBN: 0-470-20847-3] 
Textbook for introductory course in logic or com- 
putability. First part covers elementary recursion 
theory. Includes chapter on Hilbert’s tenth problem. 
Second part develops propositional and predicate cal- 
culus through completeness theorems using natural 
deduction. Final two chapters cover incompleteness 
and undecidability theorems and decidability of nat- 
ural numbers under addition. KS 

Linear Algebra, T*(13-14: 1, 2), S, L. A Uni- 
fied Introduction to Linear Algebra: Models, Meth- 
ods, and Theory. Alan Tucker. Macmillan, 1988, xiii 
+ 540 pp. [ISBN: 0-02-421580-5] An introduction 
to matrix algebra via extensive models that illus- 
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trate important issues in applied linear algebra: cal- 
culation, sensitivity, appropriateness of linearity, es- 
timation, errors, etc. Applications include digital fil- 
tering, Markov processes, linear programming, adja- 
cency matrices, condition numbers, and all canonical 
examples. The text consists predominantly of page- 
long examples of models, loosely glued together with 
definitions, occasional theorems, and boxed rules of 
calculation. Good exercises, both routine and in- 
novative. An imaginative, stimulating text for “the 
most useful college mathematics course most stu- 
dents ever take.” LAS 


Calculus, T(13: 2-3). Calculus. William E. Boyce, 
Richard C. DiPrima. Wiley, 1988, xvii + 1124 pp, 
$54.82. [ISBN: 0-471-09333-5] Fairly standard set 
of topics. Good applications, mostly from mechan- 
ics and physics. Numerical methods are reasonably 
treated. Odd numbered answers included. Solution 
manual available. MR 


Calculus, T(13: 2). Calculus and Analytic Ge- 
ometry, Third Edition. Philip Gillett. DC Heath, 
xix + 1034 pp, $38. [ISBN: 0-669-13100-8] Major 
changes include addition of new exercises, moving 
the trigonometry review to Chapter 1, having lim- 
its precede derivatives, adding a new section on dif- 
ferentiation of sines and cosines, and moving limits 
involving infinity to Chapter 5. (First Edition, TR, 
August-September 1981.) LC 


Calculus, T(17: 2), S. An Introduction to the Cal- 
culus of Variations. Charles Fox. Dover, 1987, viii 
+ 271 pp, $6.95 (P). [ISBN: 0-486-65499-0] Covers 
all the important techniques including first and sec- 
ond variation, principle of least action, and Weier- 
strassian E-function. Many worked examples and 
references. Some exercises with no solutions in- 
cluded. Would serve as a good reference but not 
as a text for an undergraduate course. MR 


Calculus, T(13: 1). Calculus, Fourth Edition. 
Marvin L. Bittinger. Addison-Wesley, 1988, xviii + 
570 pp, $39.75. (ISBN: 0-201-12216-2] For a one- 
semester intuitive calculus course for non-mathemat- 
ics majors. “Applications” in a wide variety of dis- 
ciplines. Loaded with pedagogical features. Sup- 
plementary materials available include an instruc- 
tor’s manual, a test bank, videotapes, computerized 
testing, computer software, and a student’s solutions 
manual. What is left for the teacher? (First Edition, 
TR, June-July 1976; Second Edition, TR, January 
1981; Third Edition, TR, August-September 1985.) 
JK 


Calculus, T(13-14: 3). Calculus and Analytic Ge- 
ometry, Seventh Edition. George B. Thomas, Jr., 
Ross L. Finney. Addison-Wesley, 1987, xv + 1253 
pp, $48.95. (ISBN: 0-201-16320-9] Now contains 
chapter reviews, historical notes, lessons in three- 
dimensional drawing, enhanced artwork, and quick- 
reference charts that group formulas, emphasize 
procedures, and describe problem-solving strategies. 
Among many revisions is a considerably changed pre- 
sentation of multivariable calculus. Many new appli- 
cations. Same level of rigor. (Fourth Edition, TR, 
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December 1968; Extended Review, June-July 1970; 
Alternate Edition, TR, June-July 1973; Fifth Edi- 
tion, TR, June-July 1979.) DFA 


Calculus, T(13: 2). Calculus with Analytic Ge- 
ometry, Second Alternate Edition. Earl W. Swokow- 
ski. Prindle, Weber & Schmidt, 1988, xiii + 914 
pp. [{ISBN: 0-87150-008-6] Contains essentially the 
same material as Calculus with Analytic Geometry, 
Fourth Edition, but this Alternate Edition introduces 
trigonometric functions earlier. (First Edition, TR, 
December 1975; Second Edition, TR, June-July 1979; 
Third Edition, TR, November 1984.) JNC 


Calculus, T(13: 2). Calculus with Analytic Geom- 
etry, Alternate Edition. Robert Ellis, Denny Gulick. 
Harcourt Brace Jovanovich, 1988, xi + 1069 pp, $39. 
[ISBN: 0-15-505700-6] The major difference between 
this and the Third Edition is that here the chapter on 
applications precedes rather than follows techniques 
of integration. Other changes include introducing 
limits via tangent lines and velocity, and a new sec- 
tion on integration using tables. (First Edition, TR, 
August-September 1978; Third Edition, TR, October 
1986.) JNC 


Calculus, T(13: 2). Calculus and Analytic Geom- 
etry, Alternate Second Editon. C.H. Edwards, Jr., 
David E. Penney. Prentice-Hall, 1988, xiii + 1089 pp. 
[ISBN: 0-13-111469-7] This Alternate Edition has an 
earlier introduction of the Chain Rule, later coverage 
of trigonometric functions, and a more intuitive in- 
troduction to exponential and logarithmic functions. 
(First Edition, TR, October 1983; Second Edition, 
TR, May 1986.) JNC 


Calculus, T(13-14: 3). Calculus and Analytic Ge- 
ometry, Fourth Edition. Al Shenk. Scott Foresman, 
1988, 1135 pp, $35.96. [ISBN: 0-673-16721-6] Ex- 
tensively rewritten edition, now at a higher level; 
primarily for mathematics, science, and engineering 
students. Some notation and terminology have now 
been standardized. Rearranged presentation of many 
topics, following reader suggestions. New section on 
Jacobians and changing variables in multiple inte- 
grals. Manuals, computer supplements, transparen- 
cies, test bank available. DFA 


Calculus, T(13: 1, 2). Mathematical Analysis for 
Business and Economics, Second Edition. Charles 
W. Schelin, David W. Bange. Ser. in Math. Prindle, 
Weber & Schmidt, 1988, xiii + 601 pp. [ISBN: 0-534- 
91493-4] Covers mathematical topics of use to stu- 
dents in business and economics, including calculus, 


‘multivariate models, sequences and series, probabil- 


ity, matrix algebra, linear programming, and curve 
fitting. The topics are introduced through examples. 
The mathematical concepts are drawn from the ex- 
amples and then developed and applied to more gen- 
eral situations. Applications to business and eco- 
nomics are strongly emphasized in the examples and 
exercises, A good background in algebra is assumed, 
but a review is provided in an appendix. RH 


Calculus, T(13: 1, 2). Applied Calculus, Second 


Edition. Marvin L. Bittinger, Bernard B. Morrel. 
Addison-Wesley, 1988, xix + 779 pp, $41.95. [ISBN: 
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0-201-12211-1] Many varied examples and problems 
for students in business and the social sciences. Stan- 
dard chapters on differentiation; delays treatment of 
trigonometry. Integration techniques are substitu- 
tion, parts, and tables. Nice chapters on differential 
equations and numerical methods. Several variables 
chapter includes least squares and Lagrange multi- 
pliers, but applies only local methods to global ex- 
tremum problems! Error estimates for Taylor series 
are not discussed! Features include exercises in the 
Margins, answers to odd problems, and formulas on 
the book’s covers. A test generator, graphing soft- 
ware, and student solution manual are available. GG 


Calculus, T(13: 2). Calculus: Theory and Appli- 
cations in Technology and the Physical and Life Sci- 
ences. R.M. Johnson. Math. & Its Applic. Halsted 
Pr, 1987, 333 pp, $39.95 (P). [ISBN: 0-470-20898- 
8] An expensive, compact paperback whose orienta- 
tion reflects its origin in a British College of Tech- 
nology. Covers the usual calculus topics but mini- 
mizes the motivational discussion and number of ex- 
amples. JNC 


Complex Analysis, T(18: 1), P. Applications of 
Harmonic Measure. John B. Garnett. Lect. Notes 
in Math. Sci., V. 8. Wiley, 1987, vii + 69 pp, $22.95 
(P). (ISBN: 0-471-62772-0] After a concise but lu- 
cid introduction to harmonic measure, the author 
presents several applications to such topics as level 
curves, interpolating sequences, and periodic spec- 
tra of Hill’s equation. No exercises, good bibliogra- 
phy. TAV 


Differential Equations, T(15-17: 1), S, P, L. 
Mathematics for Dynamic Modeling. Edward Bel- 
trami. Academic Pr, 1987, xvi + 277 pp, $27.50. 
(ISBN: 0-12-085555-0] A text for an applications- 
oriented second course in differential equations, as 
taught by non-pure mathematics departments. “A 
number of central results are motivated and ex- 
plained by a combination of a little rigor and a lot 
of intuition.” Equilibrium and stability in differen- 
tial equations modeling; orbit behavior of nonlinear 
models. Dozens of exercises. RB 


Partial Differential Equations, T**(17: 1). 
First-Order Partial Differential Equations, Volume 
I: Theory and Application of Single Equations. Hyun 
Ku Rhee, Rutherford Aris, Neal R. Amundson. In- 
tern. Ser. in Physical & Chemical Eng. Sci. Pren- 
tice-Hall, 1986, xiii + 543 pp, $45.95. [ISBN: 0- 
13-319195-8] A very nice book on partial differential 
equations intended to be used by first-year engineer- 
ing students, so it has many applications of partial 
differential equations. As well as applications, it con- 
tains the theory of how to arrive at the equations and 
the different techniques needed to solve them. MZ 


Partial Differential Equations, P. Nonlinear El- 
liptic and Parabolic Equations of the Second Order. 
N.V. Krylov. Math. & Its Applic. Kluwer Academic, 
1987, xiii + 462 pp, $99. [ISBN: 90-277-2289-7] The 
author constructs a solvability theory “covering the 
Bellman equation, the Monge-Ampére equation, and 
the quasilinear equations under natural conditions 
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on their coefficients,” based on differential equations 
theory without the usual recourse to probabilistic or 
geometric methods. RB 


Partial Differential Equations, P. Recent Topics 
in Nonlinear PDE IT. Ed: Kyiya Masuda, Masayasu 
Mimura. Math. Stud., V. 128. Elsevier Science, 
1985, vii + 228 pp, $70 (P). [ISBN: 0-444-87938-2] 
A collection of ten papers based on lectures given at 
a meeting on nonlinear partial differential equations 
given at Tohoku University, February 27-29, 1984. 
Topics presented range over various areas of mathe- 
matical physics. AM 


Numerical Analysis, P. Difference Schemes: An 
Introduction to the Underlying Theory. S.K. Go- 
dunov, V.S. Ryabenkii. Transl: E.M. Gelbard. Stud. 
in Math. & Its Applic., V. 19. Elsevier Science, 1987, 
xvii + 489 pp, $77.75. (ISBN: 0-444-70233-4] Trans- 
lation of the 1977 Russian edition. Convergence and 
stability theory of various difference schemes that 
arise in the numerical solution of the ordinary dif- 
ferential equations and partial differential equations 
of boundary value problems. RWN 


Analysis, T(18: 2), P. Ergodic Theory and Differ- 
entiable Dynamics. Ricardo Mané. Transl: Silvio 
Levy. Ergebnisse der Math. und ihrer Grenzgebiete 
3 Folge, Band 8. Springer-Verlag, 1987, xii + 317 
pp, $82. [ISBN: 0-387-15278-4] A translation from 
the Portuguese edition. Following a quick review of 
measure theory, the author presents an overview of 
the nature of dynamical systems that generate fun- 
damental questions of ergodicity. The following two 
sections on ergodicity and entropy are modern in ap- 
proach and quite deep. Many useful exercises and an 
excellent bibliography. TAV 


Algebraic Geometry, P. Lecture Notes in Mathe- 
matics-1273: Singularities, Representation of Alge- 
bras, and Vector Bundles. Ed: G.-M. Greuel, G. 
Trautmann. Springer-Verlag, 1987, xiv + 383 pp, 
$32.90 (P). [ISBN: 0-387-18263-2] Proceedings of a 
symposium held in Lambrecht, West Germany, De- 
cember 1985, which drew together researchers in di- 
verse topics. Three survey talks and eighteen re- 
search articles (half on singularities) by authors from 
Germany, Norway, India, Holland, United States, 
and France. RB 


Algebraic Geometry, T*(18: 3), P*. Relative 
Invariants of Sheaves. A. Verschoren. Pure & Appl. 
Math., V. 104. Dekker, 1987, xi + 249 pp. [ISBN: 0- 
8247-7734-4] Rich in examples and explanation, this 
introduction to sheaf theory combines methods from 
relative invariants of rings with classical techniques 
from sheaf theory. The author derives extension 
properties of coherent and quasicoherent sheaves, re- 
vealing some connections to local cohomology. In- 
cludes two treatments of global invariants together 
with torsion couples and morphisms between them; 
Krull schemes; Hecke actions; contravariant maps be- 
tween relative invariants. Beautifully written and 
filled with insights. There are, however, too few ex- 
ercises, which may make it difficult to use this book 
as a text. LW 
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Differential Geometry, P. Lecture Notes in 
Mathematics-1251: Differential Geometric Methods 
in Mathematical Physics. Ed: P.L. Garcia, A. Pérez- 
Rendon. Springer-Verlag, 1987, vii + 300 pp, $31.50 
(P). (ISBN: 0-387-17816-3] Proceedings of the four- 
teenth international conference held June 24-29, 1985 
at the University of Salamanca. A collection of four- 
teen papers covering superalgebras and supermani- 
folds, superfield theory, classical and quantized field 
theory, and differential geometric techniques in gen- 
eral. AM 


Differential Geometry, P. Differential Geometry 
and Its Applications. Ed: D. Krupka, A. Svec. 
Math. & Its Applic. D Reidel (US Distr: Kluwer 
Academic), 1987, xiii + 381 pp, $98. [ISBN: 90-277- 
2487-3] Proceedings of a conference on the topic at 
Brno, Czechoslovakia, August 1986. Thirty-four pa- 
pers in differential geometry and geometric methods 
in physics, predominantly by eastern European au- 
thors. MU 


Differential Geometry, T*(17-18), S. Riemann- 
tian Geometry. S. Gallot, D. Hulin, J. Lafontaine. 
Universitext. Springer-Verlag, 1987, xi + 248 pp, $29 
(P). [ISBN: 0-387-17923-2] A graduate text which 
starts with abstract manifolds rather than the local 
theory of embedded surfaces, and which illustrates 
new concepts and results with a repertoire of recur- 
rent examples (including spheres, projective spaces, 
compact Lie groups). Presumes some exposure to 
elementary notions of differential manifolds, repre- 
sentation theory, homotopy. Hundreds of solved ex- 
ercises! Intriguing approach. RB 


Differential Geometry, P. Seminar on New Re- 
sults in Nonlinear Partial Differential Equations. 
Anthony J. Tromba. Aspects of Math., V. E10. 
Friedr Vieweg & Sohn, 1987, vi + 198 pp, (P). [ISBN: 
3-528-08975-X] A collection of eight papers describ- 
ing results reported during the “year” in partial dif- 
ferential equations, January-September 1984 at the 
Max-Planck-Institut fir Mathematik. Participants 
included Jerry Kazdan, Tudor Ratiu, Jerry Mars- 
den, Henry Wente, Anthony Tromba, Wei-Ming Ni, 
Vincent Moncrief, Uwe Abresch, and Sergiu Klainer- 
man. AM 


Algebraic Topology, P. Algebraic Topology and 
Algebraic K-Theory. Ed: William Browder. Annals 
of Math. Stud., No. 113. Princeton U Pr, 1987, ix + 
563 pp, $65; $22.50 (P). [ISBN: 0-691-08415-7; 0-691- 
08426-2] Proceedings of a Princeton conference, Oc- 
tober 1983, in honor of the sixtieth birthday of John 
C. Moore, the “doctor-father” of numerous promi- 
nent topologists, including Swan, Browder, Stasheff, 
and May. Twenty papers on homotopy theory, al- 
gebraic topology more generally, and aspects of K- 
theory related to topology. RB 


Operations Research, T(16-18: 1, 2), P. Math- 
ematical Programming: Theory and Algorithms. M. 
Minoux. Transl: Steven Vajda. Wiley, 1986, xxviii 
+ 489 pp, $66.95. [ISBN: 0-471-90170-9] A survey 
of mathematical programming algorithms. Presents 
a unified introduction to linear programming, con- 
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strained and unconstrained nonlinear programming, 
integer programming, and dynamic programming. 
Each chapter includes an extensive bibliography. AO 
Probability, P. Lecture Notes in Mathematics- 
1193: Geometrical and Statistical Aspects of Prob- 
ability in Banach Spaces. Ed: X. Fernique, et al. 
Springer-Verlag, 1986, 128 pp, $11.60 (P). [ISBN: 
0-387-16487-1] Proceedings of a conference held in 
Strasbourg, 1985. Starts with a brief survey of work 
of Antoine Ehrhard. Covers invariance principles for 
the empirical measure of a mixing sequence, and for 
the local time of Markov processes; almost exchange- 
able sequences in L%,1 < q < 2; uniform conver- 
gence of Gaussian and Rademacher Fourier quadratic 
forms; metric entropy and the central limit theorem 
in Banach spaces. MZ 


Probability, P. Lecture Notes in Mathematics- 
1206: Probability and Analysis. Ed: G. Letta, 
M. Pratelli. Springer-Verlag, 1986, vill + 283 pp, 
$23.40 (P). [ISBN: 0-387-16787-0] Lectures given at 
the First 1985 Session of the Centro Internazionale 
Matematico Estivo held at Varenna, Italy. Lectures 
cover such topics as probability and geometry, Mar- 
tingales and Fourier analysis, cylinder measures, and 
local bases and nuclearity. MZ 


Stochastic Processes, T(18: 1), P. Lectures on 
Stochastic Analysis: Diffusion Theory. Daniel W. 
Stroock. Math. Soc. Stud. Texts, V. 6. Cambridge 
U Pr, 1987, ix + 128 pp, $11.95 (P); $34.50. (ISBN: 
0-521-33645-7; 0-521-33366-0] After an introductory 
section on Wiener measure and its setting, the book 
contains an introduction to diffusion theory, Martin- 
gale theory, and the Ito calculus. The tract concludes 
with the formulation of the Martingale problem of 
diffusion. Quite technical. Surprisingly, no bibliog- 
raphy. TAV 

Stochastic Processes, P. Ergodic Theory of Ran- 
dom Transformations. Yuri Kifer. Progress in Prob. 
& Stat., V. 10. Birkhauser Boston, 1986, 210 pp, 
$34. [ISBN: 0-8176-3319-7] A study of ergodic the- 
oretical properties of evaluation processes generated 
by independent application of transformation chosen 
according to some probability distribution. Applica- 
tions to problems of dynamical systems and stochas- 
tic flows. TAV 


Statistics, T(15-18: 1, 2), S, L. Statistical Meth- 
ods for the Analysis of Biomedical Data. Robert F. 
Woolson. Prob. & Stat. Wiley, 1987, xx + 513 
pp, $49.95. [ISBN: 0-471-80615-3] Reference guide or 
text for biostatistics. Presupposes only algebra. De- 
scribes procedures by presenting worked examples. 
Chapters on epidemiological and clinical data and 
on survival curves. FLW 


Statistics, T(17: 1), S. Multiple Comparison Pro- 
cedures. Yosef Hochberg, Ajit C. Tamhane. Prob. 
& Math. Stat. Wiley, 1987, xxii + 450 pp, $44.95. 
[ISBN: 0-471-82222-1] This textbook addresses many 
controversial subjects connected with multiple com- 
parison techniques including their misuse in practice. 
It includes an excellent bibliography of relevant lit- 
erature. Includes detailed derivations and operat- 
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ing characteristics for procedures, as well as exam- 
ples for implementation. The first part deals with 
fixed effects models, while the second part deals with 
other models and alternative approaches. Good for 
researchers and practitioners. LB-E 


Programming, T(14-18), S, P, L. Introduction 
to Common Lisp. Taiichi Yuasa, Masami Hagiya. 
Trans]: Richard Weyhrauch, Yasuko Kitajima. Aca- 
demic Pr, 1987, ix + 293 pp, $29.95. [ISBN: 0-12- 
774860-1] Common Lisp is a completely specified, 
hardware and operating system-independent dialect 
of the programming language Lisp; the authors pro- 
duced one of the first implementations (Kyoto Com- 
mon Lisp). Here they present Common Lisp for 
Lisp beginners (with computer science experience) 
and a Common Lisp reference for Lisp program- 
mers. Examples throughout, including complete pro- 
grams. RB 

Programming, T*(13), S**, P*, L*. The Lit- 
tle LISPer, Trade Edition. Daniel P. Friedman, 
Matthias Felleisen. MIT Pr, 1987, xiv + 186 pp, 
$12.95 (P). (ISBN: 0-262-56038-0] An extension of 
a noted text for novices which quickly introduces re- 
cursive thinking via Lisp to novices. Asserting that 
Lisp programming is essentially pattern recognition, 
these Indiana University authors instruct by means 
of an inviting, informal question-answer format, al- 
lowing combined advantages of abundant answered 
exercises and conversational tone. Aptly suited to 
non-programmers without quantitative backgrounds. 
(First Edition, TR, January 1975.) RB 


Computer Science, P. Proceedings of the 1987 In- 
ternational Conference on Parallel Processsing. Ed: 
Sartaj K. Sahni. Penn St U Pr (215 Wagner Build- 
ing, University Park, PA 16802), 1987, xx + 923 pp, 
$85 (P). (ISBN: 0-271-00608-0] Over 170 papers (ac- 
ceptances were only 36% of submission): architec- 
ture, algorithms, compiler techniques, interconnec- 
tion networks, hypercube computing, software sup- 
port, image processing, logic programming and sys- 
tems, systolic computing, numerical algorithms, mul- 
tiprocessor concurrency, data structures, graph algo- 
rithms, programming languages, fault tolerance and 
testing, programming systems, embeddings. RB 
Applications (Biological Science), S(18), P, L. 
Lecture Notes in Biomathematics-72: Modeling and 
Management of Resources under Uncertainty. Ed: 
T.L. Vincent, et al. Springer-Verlag, 1987, viii + 
318 pp, $32.60 (P). [ISBN: 0-387-17999-2] Proceed- 
ings of the Second U.S.-Australia Workshop on Re- 
newable Resource Management held in Honolulu in 
December 1988. Divided into three parts: mod- 
eling/biology, control/techniques, management /real 
problems. Twenty-three papers plus referees’ com- 
ments. SM 


Applications (Fluid Mechanics), T(18). Topics 
in Geophysical Fluid Dynamics: Atmospheric Dy- 


namics, Dynamo Theory, and Climate Dynamics. 
M. Ghil, S. Childress. Appl. Math. Sci., V. 60. 
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Springer-Verlag, 1987, xv + 485 pp, $39 (P). [ISBN: 
0-387-96475-4] The book is divided into four parts, 
starting with fundamentals like the effects of rota- 
tion, shallowness, and the quasi-geostrophic approx- 
imation. The rest covers large-scale atmospheric dy- 
namics, the dynamo theory, and theoretical climate 
dynamics. MZ 


Applications (Physics), P. General Relativity and 
Gravitation. Ed: M.A.H. MacCallum. Cambridge U 
Pr, 1987, xvii + 407 pp, $49.50. (ISBN: 0-521-33296- 
6] Proceedings of the eleventh international confer- 
ence held in Stockholm, July 1986. Thirteen ple- 
nary lectures surveyed recent advances: black holes; 
supergravity, Kaluza-Klein, superstrings; quantum 
gravity; twistors; numerical relativity, gravitational 
radiation, cosmology; inflation in the universe. Re- 
ports by chairmen on symposia in which over 500 
papers and posters were presented. MU 


Applications (Physics), P. Introduction to Super- 
analysis. Felix Alexandrovich Berezin. Ed: A.A. 
Kirillov. Transl: J. Niederle, R. Kotecky. Math. 
Physics & Appl. Math., V. 9. D Reidel (US 
Distr: Kluwer Academic), 1987, xii + 424 pp, 
$109. [ISBN: 90-277-1668-4] The first three chapters, 
Grassman Algebra, Superanalysis, and Linear Alge- 
bra in Z-Graded Spaces, were virtually completed 
before Berezin’s untimely death. The last two chap- 
ters, Supermanifolds in General, and Lie Superalge- 
bras, were compiled largely from his preprints and 
notes. MU 


Applications (Physics), T(18: 1), S, P. The In- 
teracting Boson Model. F. Iachello, A. Arima. Mono. 
on Math. Physics. Cambridge U Pr, 1987, x + 250 
pp, $59.50. [ISBN: 0-521-30282-X] Using the the- 
ory of group transformations, this book describes the 
mathematical framework on which the interacting 
boson model of the nucleus is based and discusses 
applications to the calculation of properties of nu- 


clei. MU 
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and opinionated, with a good sense of historical circum- 
stance, a fine impatience with humbug, a sound grasp 
of the entire mathematical scene, and a gift for clear and 
orderly explanation.” , 

James R. Newman, The World of Mathematics 


“Bell’s writing style makes you feel as if you are a for- 
tunate student spending time with a charming and gifted 
teacher. Not a history of mathematics, but a wide rang- 
ing exposition of ideas in the subject, the book is a classic 
and should be required reading for every teacher of 
mathematics.”’ 

Harold Jacobs, Grant High School, Van Nuys, California 


“This book deserves a place in today’s market. It is a 
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This is an absorbing account of the development 
in pure and applied mathematics from the geom- 
etry of Euclid more than two-thousand years ago 
to some of the more recent developments in math- 
ematical physics. Published 30 years ago, MATHE- 
MATICS: QUEEN AND SERVANT OF SCIENCE 
remains a valuable tool for teachers, students, and 
all those enthusiastic about mathematics. The 
book includes 20 chapters on selected areas in 
pure and applied mathematics, including discus- 
sions on topics such as geometry, algebra, logic, 
probability, infinity, transformations, rings, groups, 
topology, and continuity. Also covered are sub- 
jects such as: Mersenne primes, quantum mechan- 
ics, and three-valued logic. The book is not a 
history of mathematics, but rather an enticing 


sampling of selected topics in mathematics and 
science. 
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including Déligne-cohomology, Adams 
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a thorough introduction to a central topic 
in arithmetic algebraic geometry and 
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area of research. 
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Martin Gardner has charmed readers for over fifty 
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answered in the First Answers section. The solu- 
tion in turn raises another problem that is solved 
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Gauss, Landen, Ramanujan, the Arithmetic-Geometric Mean, 
Ellipses, 7, and the Ladies Diary 


GERT ALMKVIST, University of Lund 


BRUCE BERNDT™*, University of Illinois 


GERT ALMKVIST received his Ph.D. at the University of California in 1966 
and has been at Lund since 1967. His main interests are algebraic K-theory, 
invariant theory, and elliptic functions. 


BRUCE BERNDT received his A.B. degree from Albion College, Albion, 
Michigan in 1961 and his Ph.D. from the University of Wisconsin, Madison, 
in 1966. Since 1977, he has devoted all of his research efforts to proving the 
hitherto unproven results in Ramanujan’s notebooks. His book, Ramanujan’s 
Notebooks, Part I (Springer-Verlag, 1985), is the first of either three or four 
volumes to be published on this project. 


Virtue and sense, with female-softness join’d 

(All that subdues and captivates mankind!) 

In Britain’s matchless fair resplendent shine; 
They rule Love’s empire by a right divine: 

Justly their charms the astonished World admires, 
Whom Royal Charlotte’s bright example fires. 


1. Introduction. The arithmetic-geometric mean was first discovered by Lagrange 
and rediscovered by Gauss a few years later while he was a teenager. However, 
Gauss’s major contributions, including an elegant integral representation, were 
made about 7-9 years later. The first purpose of this article is, then, to explain the 


arithmetic-geometric mean and to describe some of its major properties, many of 
which are due to Gauss. 


*Research partially supported by the Vaughn Foundation. 
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Because of its rapid convergence, the arithmetic-geometric mean has been signifi- 
cantly employed in the past decade in fast machine computation. A second purpose 
of this article is thus to delineate its role in the computation of 7. We emphasize 
that the arithmetic-geometric mean has much broader applications, e.g., to the 
calculation of elementary functions such as log x, e*, sin x, and cos x. The inter- 
ested reader should further consult the several references cited here, especially 
Brent’s paper [14] and the Borweins’ book [13]. 

The determination of the arithmetic-geometric mean is intimately related to the 
calculation of the perimeter of an ellipse. Since the days of Kepler and Euler, several 
approximate formulas have been devised to calculate the perimeter. The primary 
motivation in deriving such approximations was evidently the desire to accurately 
calculate the elliptical orbits of planets. A third purpose of this article is thus to 
describe the connections between the arithmetic-geometric mean and the perimeter 
of an ellipse, and to survey many of the approximate formulas that have been given 
in the literature. The most accurate of these is due to Ramanujan, who also found 
some extraordinarily unusual and exotic approximations to elliptical perimeters. 
The latter results are found in his notebooks and have never been published, and so 
we shall pay particular attention to these approximations. 

Also contributing to this circle of ideas is the English mathematician John 
Landen. In the study of both the arithmetic-geometric mean and the determination 
of elliptical perimeters, there arises his most important mathematical contribution, 
which is now called Landen’s transformation. Many very important and seemingly 
unrelated guises of Landen’s transformation exist in the literature. Thus, a fourth 
purpose of this article is to delineate several formulations of Landen’s transforma- 
tion as well as to provide a short biography of this undeservedly, rather obscure, 
mathematician. 

For several years, Landen published almost exclusively in the Ladies Diary. This 
is, historically, the first regularly published periodical to contain a section devoted 
to the posing of mathematical problems and their solutions. Because an important 
feature of the MONTHLY has its roots in the Ladies Diary, it seems then dually 
appropriate in this paper to provide a brief description of the Ladies Diary. 


2. Gauss and the arithmetic-geometric mean. As we previously alluded, the 
arithmetic-geometric mean was first set forth in a memoir of Lagrange [30] pub- 
lished in 1784—85. However, in a letter, dated April 16, 1816, to a friend, H. C. 
Schumacher, Gauss confided that he independently discovered the arithmetic-geo- 
metric mean in 1791 at the age of 14. At about the age of 22 or 23, Gauss wrote a 
long paper [23] describing his many discoveries on the arithmetic-geometric mean. 
However, this work, like many others by Gauss, was not published until after his 
death. Gauss’s fundamental paper thus did not appear until 1866 when E. Schering, 
the editor of Gauss’s complete works, published the paper as part of Gauss’s 
Nachlass. Gauss obviously attached considerable importance to his findings on the 
arithmetic-geometric mean, for several of the entries in his diary, in particular, from 
the years 1799 to 1800, pertain to the arithmetic-geometric mean. Some of these 
entries are quite vague, and we may still not know everything that Gauss discovered 
about the arithmetic-geometric mean. (For an English translation of Gauss’s diary 
together with commentary, see a paper by J. J. Gray [24].) 

By now, the reader is anxious to learn about the arithmetic-geometric mean and 
what the young Gauss discovered. 
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Let a and b denote positive numbers with a> b. Construct a sequence of 
arithmetic means and a sequence of geometric means as follows: 


1 

a, = 5(a + b), b, = vab, 
1 

a= 564 + by), b, = \a,b,, 


1 
An+1 = 5 (Gn + b,,), B41 = ya b ? 


Gauss [23] gives four numerical examples, of which we reproduce one. Let a = 1 
and b = 0.8. Then 


a, = 0.9, b, = 0.894427190999915878564, 
a, = 0.897213595499957939282, b, = 0.897209268732734, 
a, = 0.897211432116346, b, = 0.897211432113738, 
a, = 0.897211432115042, b, = 0.897211432115042. 


(Obviously, Gauss did not shirk from numerical calculations.) It appears from this 
example that {a,} and {b,} converge to the same limit, and that furthermore this 
convergence is very rapid. This we now demonstrate. 
Observe that 
b<b, <a, <a, 
b<b,<b,<a,<a, <a, 


b<b, <b, <b, <a,<a,<a,<a, 


etc. Thus, {b,} is increasing and bounded, and {a,,} is decreasing and bounded. 
Each sequence therefore converges. Elementary algebraic manipulation now shows 
that 
a, — 5b, a—b a—b 
= _ < _, 
a-—b 4(a,+b,) 2(at+b)+4b, 2 
Iterating this procedure, we deduce that 


1 n 
a,~b, <(5| (a — b), n> 1, 
which tends to 0 as n tends to oo. Thus, a, and b, converge to the same limit, 
which we denote by M(a, b). By definition, M(a, b) is the arithmetic-geometric 
mean of a and b. 
To provide a more quantitative measure of the rapidity of convergence, first 


define 
c,=ya,—be, nO, (1) 
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where a, = a and by = b. Observe that 
1 , c? c? 
= — — = L 
Cnt 9) (a, ”) 4an44 4M(a, b) 


Thus, c, tends to 0 quadratically, or the convergence is of the second order. More 
generally, suppose that {a, } converges to L and assume that there exist constants 
C > 0 and m 2 1 such that 


l@,417~L| < Cla, - L|", n> 1. 


Then we say that the convergence is of the mth order. 
Perhaps the most significant theorem in Gauss’s paper [23] is the following 
representation for M for which we provide Gauss’s ingenious proof. 


THEOREM 1. Let |x| < 1, and define 
K(x) = Mae ~ x°sin’p) "dg. (2) 
0 
Then 


WT 
M(l + x,1—x)= sey: 


The integral K(x) is called the complete elliptic integral of the first kind. 


Observe that in the definition of K(x), sin’p may be replaced by cos’. 
Before proving Theorem 1, we give a reformulation of it. Define 


I(a,b) = [" (a°eos%e + bsin’p) dg. (3) 
It is easy to see that 
I(a,b) = ~ K(x), 
where 


Since 
M(a,b) = M(a,,b,) and M(ca,cb) = cM(a,b), (4) 


for any constant c, it follows that, with x as above, 
1 
M(1+x,1-—x) =—M(a,b). 
a 


The following reformulation of Theorem 1 is now immediate. 


THEOREM 1’. Leta > b> 0. Then 


M(a, b) = 2I(a, b) 
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Proof. Clearly, M(1 + x,1 — x) is an even function of x. Gauss then assumes 
that 


1 00 
= OS 4 x. 
M(1 + x,1— x) Py km (5) 


Now make the substitution x = 2t/(1 + t?). From (4), it follows that 


1 
M(1+x,1—x)= M((1+1t)’,(1-1)’) = Tap +? ,1 — 1), 


14+? 
Substituting in (5), we find that 


4?) ¥ Apt = yA, a ; 


k=0 k=0 


Clearly, A, = 1. Expanding (1 + 1?)~2*~1, k > 0, in a binomial series and equating 
coefficients of like powers of t on both sides, we eventually find that 


1 1 LY (13) ag (E35) 6 
i + — etre woe 
M(1 + x,1—x) 5) [—F] a) 


al . 


cao (Kk!) 

Here we have introduced the notation 
(a), =a(at+1)(a+2)---(at+k-—1). (7) 
Complete details for the derivation of (6) may be found in Gauss’s paper [23, pp. 


367-369]. 
We now must identify the series in (6) with K(x). Expanding the integrand of 
K(x) in a binomial series and integrating termwise, we find that 


] 


K(x) 
(8) 


Combining (6) and (8), we complete the proof of Gauss’s theorem. 


Another short, elegant proof of Theorem 1 has been given by Newman [39] and is 
sketched by J. M. and P. B. Borwein [10]. 

For a very readable, excellent account of Gauss’s many contributions to the 
arithmetic-geometric mean, see Cox’s paper [18]. We shall continue the discussion of 
some of Gauss’s discoveries in Section 5. 


3. Landen and the Ladies Diary. We next sketch another proof of Theorem 1 (or 
Theorem 1’) which is essentially due to the eighteenth-century English mathemati- 
cian John Landen. 
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Second proof. Although the basic idea is due to Landen, the iterative procedure 
that we shall describe is apparently due to Legendre [33, pp. 79-83] some years 
later. 

For brevity, set 


X,=C,/a,, nzO, (9) 


where c, is defined by (1). In the complete elliptic integral of the first kind (2), 
make the substitution 

sin(29) 
x, + cos(2p) 


(10) 


This is called Landen’s transformation. After a considerable amount of work, we 
find that 


tan y, = 


K(x) = (1+ x,)K(~,). 
Upon n iterations, we deduce that 
K(x) = (1+ ~x,)(1 + x,)--- (1+ x,)K(x,). (11) 
Since, by (1) and (9), 1 + x, = a,_,/a,, k > 1, we see that (11) reduces to 


a 
K(x) = —K(x,). 
a, 
We now let n tend to oo. Since a, tends to M(a, b) and x, tends to 0, we conclude 
that 


a an 
M(a, b) 2M(a,b) 


Landen’s transformation (10) was introduced by him in a paper [31] published in 
1771 and in more developed form in his most famous paper [32] published in 1775. 
There exist several versions of Landen’s transformation. Often Landen’s transfor- 
mation is expressed as an equality between two differentials in the theory of elliptic 
functions [17], [37]. The importance of Landen’s transformation is conveyed by 
Mittag-Leffler who, in his very perceptive survey [37, p. 291] on the theory of elliptic 
functions, remarks, “Euler’s addition theorem and the transformation theorem of 
Landen and Lagrange were the two fundamental ideas of which the theory of 
elliptic functions was in possession when this theory was brought up for renewed 
consideration by Legendre in 1786.” 

In Section 4, we shall prove the following theorem, which is often called Landen’s 
transformation for complete elliptic integrals of the first kind. 


K(x) = K(0) = 


THEOREM 2. If 0 < x <1, then 


2Vx 


1+x 


| = (1+ x)K(x). 


In fact, Theorem 2 is the special case a= 7, B = 7/2 of the following more 
general formula. If x sina = sin(2B — a), then 


~1/2 
(1 + x) f (1 — x?sin’p ) ” dp = p=2f ‘{1- ay ; > in’ dq, 
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which is known as Landen’s transformation for incomplete elliptic integrals of the 
first kind. 
To describe another form of Landen’s transformation, we introduce Gauss’s 
ordinary hypergeometric series 
2 (a), (b), 
F(a,b;c;x) = ¥) ———x*, [x] <1, (12) 
kao (c¢),k! 
where a, b, and c denote arbitrary complex numbers and (a), is defined by (7). 
Then 


Fia,b; 2b; 


1 1 
= (1+ x)"F[a,a- b+ 536+ 5527] (13) 


4x 
ce , 


is Landen’s transformation for hypergeometric series. Theorems 1 and 2 imply the 
special case 


=(1+ oF 5 = 


1 4x 
Fl -,=;1; —-- 
PS Tey 

Thus, a seemingly innocent “change of variable” (10) has many important 
ramifications. Indeed, Landen himself evidently never realized the importance of his 
idea. 

Since Landen undoubtedly is not known to most readers, it seems appropriate 
here to give a brief biography. He was born in 1719. According to the Encyclopedia 
Britannica [20], “He lived a very retired life, and saw little or nothing of society; 
when he did mingle in it, his dogmatism and pugnacity caused him to be generally 
shunned.” In 1762, he was appointed as the land-agent to the Earl Fitzwilliam, a 
post he held until two years before his death in 1790. 

As a mathematician, Landen was primarily an analyst and geometer. Most of his 
important works were published in the latter part of his career. These include the 
aforementioned papers and Mathematical Memoirs, published in 1780 and 1789. 
For several years, Landen contributed many problems and solutions to the Ladies 
Diary. From 1743-1749, he posed a total of eleven problems and published thirteen 
solutions to problems. However, Leybourn [34] has disclosed that contributors to 
the Ladies Diary frequently employed aliases. In particular, Landen used the 
pseudonyms Sir Stately Stiff, Peter Walton, Waltoniensis, C. Bumpkin, and Peter 
Puzzlem, who, collectively, proposed ten problems and answered seventeen. 
Leybourn [34] has compiled in four volumes the problems and solutions from the 
Ladies Diary from 1704-1816. Especially valuable are his indices of subject classifi- 
cations and contributors. (The problems and solutions from the years 1704—1760 
had been previously collected by others in one volume in 1774 [50].) 

First published in 1704, the annual Ladies Diary evidently was very popular in 
England with a yearly circulation of several thousand. The Ladies Diary is “de- 
signed principally for the amusement and instruction of the fair sex.” It contains 
“new improvements in arts and sciences, and many entertaining particulars... for 
the use and diversion of the fair sex.” The cover is graced by a poem dedicated to 
the reigning queen and which normally changed little from year to year. Our paper 
begins with the poem from 1776 paying eloquent homage to the beloved of King 
George III. Among other things, the Ladies Diary contains a “chronology of 
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remarkable events,’ birth dates of the royal family, enigmas, and answers to 
enigmas from the previous year. The enigmas as well as the answers were normally 
set to verse. 

The largest portion of the Ladies Diary is devoted to the solutions of mathemati- 
cal problems posed in the previous issue. Despite the name of the journal, very few 
contributors were women. Leybourn’s [34] index lists a total of 913 contributors of 
which 32 were women. Because proposers and solvers did occasionally employ pen 
names such as Plus Minus, Mathematicus, Amicus, Archimedes, Diophantoides, and 
the aforementioned aliases for Landen, it is possible that the number of female 
contributors is slightly higher. In 1747, Landen gave a solution to a problem which 
was “designed to improve gunnery of which there are several things wanting.” Does 
not this have a familiar ring today? Geometrical problems were popular, and rigor 
was lax at times. Here is an example from 1783. Let 


1 
= la and b= Dae 


Show that a/b = ¥2 — 1. In 1784, Joseph French provided the following “elegant” 
solution. We see that 


Thus, b(V2 — 1) = a, and the result follows. 

Those readers wishing to learn more about Landen’s work should consult 
Watson’s very delightful article, “The Marquis and the Land-agent” [52]. Readers 
desiring more knowledge of the mathematical content of the Ladies Diary should 
definitely consult Leybourn’s compendium [34]. (Only a few libraries in the U.S. 
possess copies of the Ladies Diary. The University of Illinois Library has a fairly 
complete collection, although there are several gaps prior to 1774. T. Perl [43] has 
written a detailed description of the Ladies Diary with an emphasis on the 
contributions by women and an analysis of both the positive and negative sociologi- 
cal factors on womens’ mathematical education during the years of the Diary. For 
additional historical information about the Ladies Diary and other obscure English 
journals containing mathematics, see Archibald’s paper [2].) 


4. Ivory and Landen’s transformation. In 1796, J. Ivory [25] published a new 
formula for the perimeter of an ellipse. A very similar proof establishes Theorem 2, 
a version of Landen’s transformation discussed in the previous section. 

Before proving Theorem 2, we note that it implies a new version of Theorem 1. 


THEOREM 1”. If x > 0, then 
m(1+ x) 
2vx 


1+x 


M(1+x,1-x)= 
2K 


Theorem 1’ also follows from Theorem 1”; put x = (a — b)/(a + b) and uti- 
lize (4). 
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Proof of Theorem 2. Using the definition (2) of K, employing the binomial series, 
and inverting the order of summation and integration below, we find that 


2vx L pal, 4x 
rex} 34, Ga+xy} CO 
1 ps 2x 
=>f 1 
2 Yo 


1 m - 
= st x)f (1 + x2 + 2x cos(2~))”” dp 


~1/2 
— Gan" _— cs(29)) do 


1 7 _— _— 
= mae + x)f (1 + xe?) axel + xe~ 7?) ” dep 


(s).y" > (2). se dp 


m! 0 


i) 

_ xn 

7 0° | 

= 7(+x)y +*~ 
2 n=0 


(n!)° 


= (1 + yd 


(1 + x) K(x), 
by (8). This concludes the proof. 


Ivory’s paper [25], establishing an analogue of Theorem 2, possesses an unusual 
feature in that it begins with the “cover letter’ that Ivory sent to the editor John 
Playfair when he submitted his paper! In this letter, Ivory informs Playfair about 
what led him to his discovery. Evidently then, the editor deemed it fair play to 
publish Ivory’s letter as a preamble to his paper. The letter reads as follows. 


Dear Sir, 

Having, as you know, bestowed a good deal of time and attention on the study of that part of 
physical astronomy which relates to the mutual disturbances of the planets, I have, naturally, been 
led to consider the various methods of resolving the formula (a? + b? — 2abcos ~)" into infinite 
series of the form A + Bcosgy + Ccos2q + &c. In the course of these investigations, a series for 
the rectification of the ellipsis occurred to me, remarkable for its simplicity, as well as its rapid 
convergency. As I believe it to be new, I send it to you, inclosed, together with some remarks on 
the evolution of the formula just mentioned, which if you think proper, you may submit to the 
consideration of the Royal Society. 


I am, Dear Sir, 
Yours, & c. 
James Ivory 


5. Calculation of 1. First, we define the complete elliptic integral of the second 
kind, 


E(x) = era — x?sin’p) dg, 


where |x| < 1. Two formulas relating the elliptic integrals E(x) and K(x) are the 
basis for one of the currently most efficient methods to calculate 7. The first is due 
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to Legendre [33, p. 61]. We give below a simple proof that appears not to have been, 
heretofore, given. 


THEOREM 3. Let x’ = V1 — x”, where 0 < x < 1. Then 


K(x) E(x’) + K(x) E(x) — K(x) K(x’) = > (14) 


Proof. Let c = x* and c’ = 1 — cc. A Straightforward calculation gives 


d d n/2 c Sin’ @ 
—(E-K)=-— dy 
dc x, (1 — csin’p) 
-=-—(7__* 
2e 2¢%0 (1 — sin’) 
Since 
d SIN Y COS M 1/2 _ 3/2 
—— | ——-—————;; | = 1 —csin 1 —csin 
Ie Y | ( 2) = ( “p) 


l-ce sin’p )”” 


we deduce that 


d E E 1 pas. a sin © COs 
Z(E-K)=s-Gat rd eed y 
Y) (15) 
E 1 E 
7 =. 7 =| Qe" 
For brevity, put K’ = K(c’) and E’ = E(c’). Since c’ = 1 — ¢, it follows that 
d E’ 
ql e = 50° (16) 
Lastly, easy calculations yield 
dE E-K dE’ E’ — K’ 
de 2 and ‘de 2c (17) 


If L denotes the left side of (14), we may write L in the form 
_L=EE’—-(E-K)(E’— K’). 
Employing (15)-(17), we find that 


dL (E-K)E' E(E’~K’) , Fle K’) (E-K)E'_ 


Hence, L is a constant, and we will find its value by letting c approach 0. 
First, 


n/2 sin’ 
E-K=-c{ © ——~,; dp = O(c 
i (1 — csin’p)” 
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as c tends to 0. Next, 
a /2 ~ n/2 _ 
k'= | / (1 — c’sin’p) Me dp < / ; (1-0) dg 
0 0 


= O(c" '/”), 
as c tends to 0. Thus, 
lim b= lim {(E — K)K’' + E’K} 
c> c-0 


lim { O(c!”? 1 ° ° 
= + _— _- — 
lim { (e“") 5 | 2” 


and the proof is complete. 


The second key formula, given in Theorem 4 below, can be proved via an 
iterative process involving Landen’s transformation. We forego a proof here; a 
proof may be found, for example, in King’s book [29, pp. 7, 8]. 


THEOREM 4. Let, fora > b> 0, 
J(a,b) = [(aeos* + b?sin2p )”” dq, (18) 
0 


and recall that c,, is defined by (1). Then 


00 


J(a,b) = [a — ; y ra b), 


n=0 
where I(a, b) is defined by (3). 


Note that 
J(a, b) = aE(x), 


where x = (1/a)Va’ — b’. 
Theorems 3 and 4 now lead to a formula for 7 which is highly suitable for 
computation. 


THEOREM 5. If c,, is defined by (1), then 
4M7(1,1/y2 ) 
T= ———o9SOCOStCSCS<2C;~S~=S;«T“~S~S 
1-— Yo 2"*le? 


n=1 


Proof. Letting x = x’ = 1/2 in Theorem 3, we find that 


2x(Jo(2-)—w[L]- 5. as 


Setting a = 1 and b = 1/2 in Theorem 4, we see that 


o(}= [1-4 Eoe)a( 4) 20 
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since I(1, V2) = K(1/ y2) and J(1,1/ v2) = E(1/ y2). Lastly, by Theorem 1’, 
M(1,1/v2 ) - 21 
’ / _— y) K(1 / /2) . ( ) 
Substituting (20) into (19), employing (21), noting that c? = 1/2, and solving for 7, 
we complete the proof. 


According to King [29, pp. 8, 9, 12], an equivalent form of Theorem 5 was 
established by Gauss. Observe that in the proof of Theorem 5, we used only the 
special case x = x’= 1/2 of Legendre’s identity, Theorem 3. We would like to 
show now that this special case is equivalent to the formula 


f’ dx f x? dx _% 
oVvl—-x44o Vi—-x4 9 4? 


first proved by Euler [22] in 1782. (Watson [52, p. 12] claimed that an equivalent 
formulation of (22) was earlier established by both Landen and Wallis, but we have 
been unable to verify this.) The former integral in (22) is one quarter of the arc 
length of the lemniscate given by r* = cos(2q), 0 < » < 2m. The latter integral in 
(22) is intimately connected with the classical elastic curve. For a further elaboration 
of the connections of these two curves with the arithmetic-geometric mean, see 
Cox’s paper [18]. 

In order to prove (22), make the substitution x = cos qm. Then straightforward 
calculations yield 


(22) 


ae res 


and 


1 1 1 x*dx 
lee) -alie| eC ee 
It is now easy to see that Legendre’s relation Theorem 3 in the case x = x’ = 1/ V2 
implies (22). 

In 1976, Salamin [47] rederived the forgotten Theorem 5, from which he estab- 
lished a rapidly convergent algorithm for the computation of 7. Recall that in 
Section 2 we demonstrated how rapidly the arithmetic-geometric mean converges 
and thus how fast c, tends to 0. Tamura and Kanada have used this algorithm to 
compute 7 to 274 (over 16 million) decimal places. An announcement about their 
calculation of 7 to 27? decimal places was made in Scientific American [48]. Their 
paper [27] describes their calculation to 10,013,395 decimal places. More recently, 
D. H. Bailey [3] has used a quartically convergent algorithm to calculate 7 to 
29,360,000 digits. 

Newman [40] has obtained a quadratic algorithm for the computation of 7 that is 
somewhat simpler than Salamin’s. His proof is quite elementary and avoids 
Legendre’s identity. It should be remarked that Newman’s estimates of some 
integrals are not quite correct. However, the final result (middle of p. 209) is correct. 

In 1977, Brent [14] observed that the arithmetic-geometric mean could be 
implemented to calculate elementary functions as well. Let us briefly indicate how 
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to calculate log2. From Whittaker and Watson’s text [53, p. 522], as x tends to 
1- ’ 


K(x) ~ 


4 
los — 
8h — x? 


From Theorem 1’, 


K(x) = 


7 
2M(1,V¥1 — x?) 
and so 


7 


Ce a EE aan re TD 


4 
logs ———— = | 
ex? 2M(1, V1 — x?) 


Taking ¥1 — x? = 4-27”, we find that 
T 


082 ~ FaM(L,2=")” 


for large n. 

Further improvements in both the calculation of 7 and elementary functions 
have been made by J. M. and P. B. Borwein [8], [9], [10], [11], [12], [13]. In 
particular, in [8], [11], and [12], they have utilized elliptic integrals and modular 
equations to obtain algorithms of higher order convergence to approximate 7. The 
survey article [10] by the Borwein brothers is to be especially recommended. Carlson 
[16] has written an earlier survey on algorithms dependent on the arithmetic-geo- 
metric mean and variants thereof. 


Postscript to 7. The challenge of approximating and calculating 7 has been with 
us for over 4000 years. By 1844, w was known to 200 decimal places. This 
stupendous feat was accomplished by a calculating prodigy named Johann Dase in 
less than two months. On Gauss’s recommendation, the Hamburg Academy of 
Sciences hired Dase to compute the factors of all integers between 7,000,000 and 
10,000,000. Thus, our ideas have come to a full circle. As Beckmann [5, p. 104] 
remarks, “It would thus appear that Carl Friedrich Gauss, who holds so many firsts 
in all branches of mathematics, was also the first to introduce payment for computer 
time.” The computer time now for 29 million digits (28 hours) is considerably less 
than the computer time for 200 digits by Gauss’s computer, Dase. 


6. Approximations for the perimeter L of an ellipse. If an ellipse is given by the 
parametric equations x = acosq@ and y = bsing, 0 < @ < 27, then from elemen- 
tary calculus, 


2a . 1/2 
L=L(a,b) = a’sin’y + b’cos*p) “” dp 
| | (23) 
= 4](b, a), 


where J(b, a) is defined by (18). Thus, we see immediately from Theorems 1’ and 4 
that elliptical perimeters and arithmetic-geometric means are inextricably inter- 
twined. Ivory’s letter and our concomitant comments also unmistakenly pointed to 
this union. 
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Before discussing approximations for L, we offer two exact formulas. The former 
is due to MacLaurin [36] in 1742, and the latter was initially found by Ivory [25] in 
1796, although it is implicit in the earlier work of Landen. 


THEOREM 6. Let x =acosqg and y=bsng, 0< gm << 2a. Let e= 
(1/a)Va? — b’, the eccentricity of the ellipse. Then if F is defined by (12), 


L(a, b) = dnaF| =, — . 1; °) (24) 
2 2 
= na + 6)F{-3 - 55158 (25) 
27> 2° PP 
where 
a-—b 
A= a+b. 


Proof. The proofs are very similar to those in Sections 2 and 4. First, using (23), 
expanding the integrand in a binomial series, and integrating termwise, we deduce 
that 


L(a, b) = 4a (1 — e*cos’p)'”” dp 
0 


1 
(a) a. 
=4a) 4 en f ”* cos?" do (26) 


Thus, (24) is established. 


We indicate two proofs of (25). First, in Landen’s transformation (13) of 
hypergeometric series, set a= —1/2, b=1/2, and x = A. We immediately find 


that 
rit toy. 2) a ote et bee 
| 2 ’ 2 9 +9 e —_ .a | 9) ’ 9) 779 ° 
By this formula and (24), formula (25) is demonstrated. 
The second proof that we mention is that of Ivory [25]. Using (26), proceed 
exactly in the same fashion as in the proof of Theorem 2 in Section 4. 


In fact, there exists a third early formula for L(a, b). In 1773, Euler [21] proved 


that 

1 
—7, 731; 
4°4 


L(a, b) = 7/2(a2 + B*) F 


Although Euler proceeded differently, we mention that his formula may be derived 
from MacLaurin’s via a certain quadratic transformation for hypergeometric series 
that is different from Landen’s. Euler’s formula also trivially leads to an approxima- 
tion for L(a, b) given in our table below. 
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The problem of determining L(a, b) is not as venerable as that for determining 
a. However, some have argued (not very convincingly) that the problem goes back 
to the time of King Solomon, who hired a craftsman Huram to make a tank. 
According to 1 Kings 7: 23, “Huram made a round tank of bronze 5 cubits deep, 10 
cubits in diameter, and 30 cubits in circumference.” The implication is clear that the 
ancient Hebrews regarded 7 as being equal to 3. It has been suggested, perhaps by 
someone who believes that ““God makes no mistakes,” that “round” and “depth” 
are to be interpreted loosely, and that the tank really was elliptical in shape, with 
the major axis being 10 cubits and the minor axis being about 9.53 cubits in length. 

As might be expected. the primary impetus in finding methods for calculating 
elliptical perimeters arises from astronomy. In 1609, Kepler [28] offered perhaps the 
first legitimate approximations 


L=a(a+b) and L = 2zayab, 


although, as we shall see, his arguments were not very rigorous and 27Vab was 
intended to be only a lower bound for L. Kepler [28, p. 307] first remarks that the 
ellipse with semiaxes a and b and the circle with radius Yab have the same areas. 
Since the circle has the smaller circumference, 


L > 2avab. 


He [28, p. 368] furthermore remarks that (1/2)(a + b) > Vab, and so concludes 
that 


1 
Lz= 2a-(a + b). 


Kepler appears to be using the dubious principle that quantities larger than the 
same number must be about equal. 

Approximations of several types, depending upon the relative sizes of a and b, 
exist in the literature. In this section, we concentrate on estimates that are best for a 
close to b. Thus, we shall write all of our approximations in terms of A = 
(a — b)/(a + b) and compare them with the expansion (25). For example, Kepler’s 
second approximation can be written in the form 


L=a(at+b)1—%X)”. 
We now show how the formula 


27 1 2 
L(a, b) = 4J(a, b) = wep” — 3 > 2 (27) 


arising from Theorems 1’ and 4, can be used to find approximations to the 
perimeter of an ellipse. Replacing M(a, b) by a, and neglecting the terms with 
n > 2, we find that 


a+b \ 


va + vb 


This formula was first obtained by Ekwall [19] in 1973 as a consequence of a 
formula by Sipos from 1792 [54]. 


= £47 


27 
L(a,b) = —|a*-—-¢ 
ay 
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If we replace M(a, b) by a, in (27) and neglect all terms with n > 3, we find, 
after some calculation, that 


2(a +b)’ — (va — vb) 
(Va + Vb) + 22Va +b Vad. 


This formula is complicated enough to dissuade us from calculating further ap- 
proximations by this method. 

We now provide a table of approximations for L(a, b) that have been given in 
the iiterature. At the left, we list the discoverer (or source) and year of discovery (if 
known). The approximation A(A) for L(a, b)/7(a + b) is given in the second 
column in two forms. In the last column, the first nonzero term in the power series 
for 


L(a,b) = 27 


L(a, b) 
A(X) — m(a+b) 


is offered so that the accuracy of the approximating formula can be discerned. For 
convenience, we note that 


1 
=A — — << —_— — * e 2 
(A) F| 2’ zis A | 


elt bye) ara tag ben £3 25 yn 
--,-731; =1+4+ —N + GM + GN + aN + aN + 
| 2° 2°” | 4 4° 44 4’ 4° 


2Vab 
Kepler [28], 1609 =(1 —)/” 
a+b 


/2(a* + b7) 
Euler [21], 1773 —————— = (14+ )!” 
a+b 


Sipos [54], 1792 2(a + b) 2 


Ekwall [19], 1973 (Va + yoy —14yi-x 


1 
Peano [42], 1889 = (1 — Wy! 


2/3 
Muir [38], 1883 


a? + p3/2 | 


1 
= ja {a + A741 -Apeye 


Lindner [35, p. 439], 
1904-1920 
Nyvoll [41], 1978 


Imer [49], 1975 1+ ———__ 
Selmer [49] (5a + 3b)(3a + 5b) 


1 
=1+ 7% 1 
1-—Y¥ 
16 
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Ramanujan [44], [45],1914  2_V04 + 34)Ga + 5) 


Fergestad [49], 1951 a+? 


=3-y4-X 
; 2(a + by - (Va — vb)" 
(a+ b){ (Va + Vb) + 2V2 Va + 6 vab | 
- (1+vi-¥) +¥V1_-x% 
(4+ Vi-¥)(14+ f=) 
Bronshtein and 1 64(a +b) — 3(a— by! 


Semendyayev — 7 
[15], 1964 16 (a + b) (3a + b)(a + 3b) 


64 — 3x 


Almkvist [1], 1978 


Selmer [49], 1975 


Selmer [49], 1975 


a+b 


) 7 2/2(a? + 6ab + b*) 


Jacobsen and 256 — 480° — 21X" 33 lo 
Waadeland [26], 1985 256 — 11222 + 3X4 2i8 
2 
Ramanujan 14 3X _ _ xlo 
[44], [45], 1914 10 + ¥4 - 3% au 


The two approximations by India’s great mathematician, S. Ramanujan, were 
first stated by him in his notebooks [46, p. 217], and then later at the end of his 
paper [44], [45, p. 39], where he says that they were discovered empirically. 
Ramanujan [44], [45] also provides error approximations, but they are in a form 
different from that given here. Since 


a-b 1-—-yl-—e*~ e? 
ath y4+yi-e 4’ 


we find that, for the first approximation, 


ns o2/4 6 ol2 ol2 
a(a + b) > = ma(1 +yl1-—- ayy < 27a—= = 7470» 


which is the approximate error given by Ramanujan. Similarly, for the second 
approximation, Ramanujan states that the error is approximately equal to 


20 


31 36 » 


which is in agreement with our claim. The exactness of Ramanujan’s second formula 
for eccentricities that are not too large is very good. For example, for the orbit of 
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Mercury (e = 0.206), the absolute error is about 1.5 X 10~'? meters. Note that if 
we set b = 0 in Ramanujan’s second formula, we find that 7 ~ 22/7. 
Fergestad [49] rediscovered Ramanujan’s first formula several years later. 
Despite Ramanujan’s remark on the discovery of these two formulas, Jacobsen 
and Waadeland [26] have offered a very plausible explanation of Ramanujan’s 
approximations. We confine our attention to the latter approximation, since the 
arguments are similar. Write 


(5 -pbx)-1+ a (28) 
oa a 4(1+w)- 
Then it can be shown that w has the continued fraction expansion 
_ 7} — 2 32 _}), 
16 16 16 48 


re 


If each numerator above is replaced by —3A?/16, then we obtain the approximation 
1 


w= 1 (-? + V4— 30), 
Substituting this approximation in (28) and then using (25), we are immediately led 
to the estimate 


L(a, b) 1 3x 
———— = 1 + ——_————.. 
m(a +b) 10 + ¥4 — 3 
Since Ramanujan’s facility in representing analytic functions by continued fractions 
is unmatched in mathematical history, it seems likely that Ramanujan discovered his 
approximations in this manner. 


In the next section, we examine some approximations for L(a, b) of a different 
type given by Ramanujan in his notebooks [46]. 


7. Further approximations given by Ramanujan. [n his notebooks [46], Ramanu- 
jan offers some very unusual formulas, expressed in sexagesimal notation, for 


L(a, b). The first is related to his approximation 3 — V4 — ’ given in Section 6. 


THEOREM 7. Put 


1 
L(a, 6) = a(a + b)(1 + 4sin?50), 0<b6<2/4, (29) 
and 
ind = Asi a7? (30 
sin@ = A sina, = TS. ) 


Then, when the eccentricity e =1, a = 30°18’6”, and as e tends to 0, a tends 
monotonically to 30°. 


It is not clear how Ramanujan was led to this very unusual theorem. The variance 
of a over such a small interval is curious. 


Proof. We shall prove Theorem 7 except for the conclusion about monotonicity. 
However, we shall show that a > 7/6 always. 
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For brevity, we write (25) in the form 


L(a, b) 00 
——— = 2" A| <1. 31 
w(a+b) oN IAI (31) 
It then follows from (29) and (30) that 
1 00 
3 — 2V¥1 — Msin’a = 1+ 4sin’> 6 = Yar", |A| <1. (32) 
n=0 
Next set 


3-V4-N = YBr", JA] <2. (33) 
n=0 


As implied in Section 6, a, = B,, n = 0,1,2. We shall further show that, for n > 3, 
B, < «,/2"~*. (34) 


From the definitions (31) and (33), respectively, short calculations show that, for 
n> 1, 


One,  (2n- 1)’ Biv 2n— 1 
x Qnb2ye 4 B ~ Meet): 


Thus, 


Bsr [O41 n+1 1 
—_——— SS < _, 
B, a, 2(2n-1) 2 

if n > 2. Proceeding by induction, we deduce that 
B+t 1 B, 1 
Qn+1 2a 


for n > 2, and the proof of (34) is complete. 
From (32) and (34), it follows that 


3-—V4-N <3-2yl1 — Msin’a. 


Solving this inequality, we find that sin’a > 1/4, or a > 7/6. 
Second, we calculate a when e = 1. Thus, A = 1 and @ = a. Therefore, from (25) 
and (32), 


14 Asin? F : = ‘ 35 
sin’ 5a = sg bly= a. (35) 


This evaluation follows from a general theorem of Gauss on the evaluation of 
hypergeometric series at the argument 1 [4, p. 2]. Moreover, this particular series is 
found in Gauss’s diary under the date June, 1798 [24]. Thus, 
1 1 1 
sin’—a = — — — = 0.0683098861. 
2 TT 4 


It follows that a = 30°18’6”. 
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Third, we calculate a when e = 0. From (30) and (32), 


1 
* 2 
san ein? , sin2@ , 4 sin 59 
im sinta = lim = lim —— 
a A OC 1200—Cl 
00 1 
= limA~? w= a= —. 
0 Lo ‘4 


Thus, a tends to 7/6 as e tends to 0. 

Ramanujan [46, p. 224] offers another theorem, which we do not state, like 
Theorem 7 but which appears to be motivated by his second approximation for 
L(a, b). 

Ramanujan [46, p. 224] states two additional formulas each of which combines 
two approximations, one for e near 0 and the other for e close to 1. Again, we give 
just one of the pair. A complete proof of Theorem 8 below would be too lengthy for 
this paper, and so we shall just sketch the main ideas of the proof. Complete details 
may be found in [7]. 


THEOREM 8. Set 
tan 0 
L(a,b)=a(a+b)——, 0<0<7/2, (36) 


and 
a-—b 7 
a+b (37) 


Then as e increases from 0 to 1, a decreases from 7/6 to 0. Furthermore, a is 
approximately given by 


2Vab Va _ Vb ; a—b\’ 
30° + 60197497 et VO) — 1°10’55” . 
a+b a+b 


tan @ = Acosa, \ = 


a+b (38) 


Proof. If e = 0, then A = 0 and @ = 0. The argument is very similar to that in 
the proof of Theorem 7, and we find that 


lim cos*a = 3a, = 3/4. 
40 
Thus, a = 7/6 when A = 0 =e. 
We next determine a when e = 1. Thus, A = 1 and tan @ = cosa by (37). From 


(25), (36), and (35), 


tan 6 
0 


A=1 


Thus, 6 = 7/4 and a = 0. 

It appears to be extremely difficult to show that as A goes from 0 to 1, a 
monotonically decreases from 7/6 to 0. It can be shown [7], however, thatO <a < 
a/6, always. A proof depends upon a continued fraction for tan~ ‘x. 
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The proof of (38) is very difficult, and we provide only a brief sketch. We observe 
(again) that 


2Vab 


a+b 


1-% 


9 


and so 


2 2 _ _2vab _ 2 
Vi ~ (1-8) = 7 (la — vo) 


Thus, Ramanujan is attempting to find an approximation to a of the form 
Vl1-W(4+B{1-v1-¥}+ cw), (39) 


which will be a good approximation both when X is close to 0 and when AJ is near 1. 
Our task is then to determine A, B, and C. 
With a considerable amount of effort, it can be shown that [7] 


a= — —- —W+0(d') (40) 


in a neighborhood of \ = 0. The proper expansion near A = 1 is even more difficult 
to obtain because F(— 4, — 4;1; A*) is not analytic at X = 1. However, there does 
exist an asymptotic expansion for F(— 4, — 4;1; A”) as A tends to 1—, and 


employing this, we can show that [7] 


4- a 
a= \ 1- +o0(vy1- 2), (41) 
27 —4 
as A tends toi —. 


Having omitted the hard analysis, we now determine A, B, and C from (40) and 
(41) with little difficulty. When A tends to 0, (39) tends to A. Thus, A = 7/6, by 
(40). Next, examine (a — 7/6)/X’ as A tends to 0. From (39) and (40), we find that 


2173 
— 10 + 58 +C=- 160” 

Now check a/ y1 — ’ as A tends to 1 — . From (39) and (41), we see that 

T 4- qa 

6 +B+C= a 
Simultaneously solving these last two equalities, we conclude that 

4- qa 21V3 7 

B=2 apd 7 BO. OtIOT935 


and 


c=- /—— 21v3 0.0206291 
3 Vw-4 8 1° 


Converting A, B, and C to the sexagesimal system and substituting in (39), we 
complete the proof. 
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Although Ramanujan is well known for his approximations and asymptotic 
formulas in number theory, he has not been adequately recognized for his deep 
contributions to approximations and asymptotic series in analysis, because the vast 
majority of his results in the latter field have been hidden in his notebooks. These 
notebooks were begun in about 1903, when he was 15 or 16, and are a compilation 
of his mathematical discoveries without proofs. The last entries were made in 1914, 
when he sailed to England at the urging of G. H. Hardy. Although the editing of 
Ramanujan’s notebooks was strongly advocated by Hardy and others immediately 
after Ramanujan’s death in 1920, it is only recently that this has come to fruition 
[6]. 

We have not attempted to give complete proofs of some of the theorems that we 
have described, but we hope that the principal ideas have been made clear. We have 
seen that a chain of related ideas stretches back over a period exceeding two 
centuries and provides impetus to contemporary mathematics. Ideas and topics that 
appear disparate are found to have common roots and merge together. For further 
elaboration of these ideas, readers should consult the works cited, especially Cox’s 
paper [18], the papers and book of J. M. and P. B. Borwein [8]-[13], a paper by 
Almkvist [1] written in Swedish, and Berndt’s forthcoming book [7]. 

We are most grateful to Birger Ekwall for providing some very useful references. 
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Introduction. Although geometers have studied the properties of triangles for 
over two thousand years, there still remain problems of interest. For example, a 
given triangle T generates a sequence of triangles {7,,} where 7,,,, is the pedal 
triangle of 7, (n > 0, Ty = T), this being the triangle whose vertices are the feet of 
the altitudes of T,. This sequence was discussed by Hobson [4], [5] but, while his 
formulae for the transition from T,, to T,,,, are correct, those for T, in terms of T 
are not. Lacking correct formulae, we experimented numerically, taking the angles 
of 7 to be integers in degrees. To our surprise the angles in the pedal sequence 
became periodic with periods of twelve steps. This curious fact led to an investiga- 
tion of pedal sequences for general triangles, revealing that (a) the sequence may 
stop by degeneration of the triangle to a straight segment, (b) the angles may 
develop any periodicity, or (c) the sequence may proceed to infinity without 
periodicity. 

In this article (Theorem I) we give necessary and sufficient conditions on the 
angles of T corresponding to these options. We show that any combination of 
angles which are rational multiples of 7 leads either to a periodic sequence or to 
degeneracy and obtain (Theorem II) an explicit expression for a period correspond- 
ing to these angles in terms of the Euler function 9. 

A second surprising result which came to light, while we were experimenting with 
these cyclic sequences, was that eventually triangles became not only similar but 
also parallel to earlier triangles in the sequence. Indeed for an n-cycle in which T,, is 
similar to T, 7,,,, is similar to T;, etc, the triangle 7%, is both similar and also 
parallel to T (Theorem IV). 


609 


610 JOHN G. KINGSTON AND JOHN L. SYNGE _ [August-September 


1. The pedal triangle. The pedal (or first pedal) triangle T’ of a given triangle T 
of non-zero area is formed by joining the feet of the perpendiculars dropped from 
the vertices of T on the opposite sides, produced if necessary. Coxeter [1] prefers the 
word orthic to pedal, but we shall use the latter word, following a well-established 
tradition. 

The pedal of T’ is the second pedal of 7, and so on. The sequence stops if we 
encounter a right-angled triangle; its pedal is a straight line segment. The condition 
for this is given in §3. 

Hobson [4], [5] discussed pedal sequences, stating correctly that the sides a’, b’, 
and c’ and angles A’, B’ and C’ of T’ are given in terms of the elements of T by the 


following formulae: if A, B and C are all acute, 
a’ =acosA, b’ = bcos B, c’ = ccosC; (1.1) 
A'=a-2A, B’=a-2B, C’=2-2C; 


if A is obtuse, 


/ 


a’=-acosA, b’=bcosB, c’ = ccosC, (1.2) 
A’ =2A—-f2, B’ = 2B, C’ =2C, 


with similar formulae if B or C is obtuse. He gave formulae for the elements of the 
nth pedal, different according as n is odd or even. For n odd, his formula for the 
first angle of the nth pedal is 


1(2" + 1)a — 2"A; (1.3) 


if n = 1 (first pedal) this gives 7 — 2A, correct for A acute by (1.1) but false for A 
obtuse by (1.2). Consequently Hobson’s formulae for the angles of the mth pedal 
must be rejected. In fact the occurrence of obtuse angles makes it impossible to 
write down any reasonably simple explicit formulae for the angles of the nth pedal. 
But Hobson’s formulae for the sides of the nth pedal are correct to within a sign, 
and we may write for the first side of the nth pedal 


a, = tacos Acos2Acos4A ---cos2”~ "A, (1.4) 


the sign being chosen to give a positive value. 

If we write A = av, B = Ba, C = yz, thena + B + y = 1 and (a, B, y) can be 
interpreted as the barycentric coordinates of a point interior to a triangle of 
reference PQR (see Figure 1). The formulae given in (1.1) and (1.2) for the angles of 
the pedal triangle then have a particularly pleasing representation. If A is obtuse, a 
is greater than 4 and (a, B, y), which represents the angles of T, maps to the point 
(2a —1, 28, 2y), which represents the angles of the pedal triangle 7’. In the 
triangle PQR this is a dilatation from P (1,0, 0) with scale factor 2. Similarly for B 
or C obtuse we have corresponding dilatations from Q or R. If the triangle is 
acute-angled then a, B, y < 1/2 and (1.1) shows that (a, 8, y) maps to (1 — 2a, 
1 — 28, 1 — 2y). This is a dilatation from S, the point (1/3, 1/3, 1/3) with scale 
factor —2. We thus have a discontinuous (when one of a, B, y is 1/2) 4-to-l 
mapping of the triangle POR to itself. 

As an example we consider a triangle T with angles represented by barycentric 
coordinates (<3, i34, 747). Figure 1 shows the locations of the points which represent 
T and its subsequent pedal triangles 7,, T,,..., T>. Note that T, = T. This is the 
sequence of triangles (5.3) which arises in §5. 
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2. The angles of the nth pedal triangle. The notation A’, a‘, i = 1,2,3 is now 
introduced to represent the angles A, B, C and sides a, b, c of the triangle T. The 
angles and sides of the nth pedal triangle 7, will thus be represented as A’, and a’. 
Equations (1.1) and (1.2) show that the angles progress according to the rule 


Ai=+2A(modz), i =1,2,3, (2.1) 


where the sign depends on the nature of the original triangle 7, but is the same for 
each 7. For the angles of T,,, 


A‘ = 2"E,A'(mod 7), (2.2) 
where E,, = +1 and is independent of i. 


3. Pedal degeneracy. The pedal of a right-angled triangle is a straight segment 
and this has no pedal; the pedal sequence stops. We call T pedally degenerate (PD) 
iff a right angle occurs in T or in the pedal sequence which starts from T. It is 
evident from (2.2) that T is PD iff one of its angles is of the form 


a(2m — 1)/2", (3.1) 


where m and 7 are positive integers. Thus the angles responsible for pedal 
degeneracy are 
50,10, G7... 37, 20,..., 27, &T,.... 
A pedal sequence is infinite if it starts from a triangle which is not PD. 


4. Pedal cycles. Let 7, and 7,,, (n > 0) be triangles in the pedal sequence 
which starts with T assumed not PD. If A‘ = A‘,,, i = 1,2,3 (that is correspond- 


S+n? 


ing angles of 7, and 7,,,, are equal) we say that we have an n-cycle starting with T,,. 
It is clear that 7. is similar to T,,,,, Tyaon.--- - 


S 
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We now describe the nature of the angles of JT which give rise to an n-cycle by 
means of the following theorem. 


THEOREM I. If s >0, n >0 and T is not PD, then T, starts an n-cycle iff the 
angles of T are of the form 


Ai = mpi/(28(2" + E)) i =1,2,3, (4.1) 
where E = +1 and p' are positive integers satisfying 
pit+p?t+ p=25(2" +E). (4.2) 


Proof of sufficiency. We have to show that, if the angles of T are of the form 
(4.1) then A‘, = A‘. Using (2.2) with (4.1) gives 


S+n 
Ai =4+ap'/(2" +E) (modz) 


and 
Ai, = tap2"/(2"+ E) (mod 7). 


The signs in these two equations may differ but for each equation the same sign 
applies for all i. In any of the 4 cases either A’ + A‘,, or A‘ — A‘,, is an integral 
multiple of 7. Since A‘ and A‘,,, are both in the interval (0, 7) they are therefore 
either equal or supplementary, and in either case have the same sine. By the sine 
rule the ratios of the lengths of a triangle are the same as the ratios of the sines of 
the opposite angles. The triangles T, and T,,,, therefore, have their sides in the 


same ratio, and are similar. We must also have A‘, ,, = A‘. 


Proof of necessity. We have to show that if A‘,, = A‘, then the angles of T are 
as in (4.1). Equation (2.2) gives 


Ai. =2'E,A' (moda), Ai,, =25t"E,,,A' (mod7z), 
where EF, = +1, E,,,, = +1. Combining these two equations gives 
2°(2”+ E)A'=0 (modz), 


where E = +1 and is independent of i, so that A’ are indeed of the form (4.1). 
Necessity is now proved. 

Note that in Theorem I the combination n = 1 and E = —1 may be excluded 
since JT is then PD. This means that 1-cycles only occur when FE = +1. Forn> 1 
n-cycles exist whether FE = +1 or E = —1. 

It might appear from the preceding theorem that, if the angles of T are as in 
(4.1), then the n-cycle does not start until we reach 7,. But that is not necessarily the 
case. If the values of p’ in the partition happen to have the common factor 2/ 
(l<j<s) then p’'=2/g' and (4.1) yields A’ = 2q'/(2°/(2” + E)). Applying 
Theorem I we now see that Aj_, = A,- j+n> SO that the n-cycle begins with 7,_, (if 
not earlier) for such choices of the p’. 


5. Examples. The only triangle which maintains its form under the pedal process 
is the equilateral triangle: here we have a monocycle (n = 1), and it seems that there 
is no more to be said. But Theorem I shows that we may start with a scalene triangle 
which, after a possible delay represented by s, settles down into a monocycle. 
Choose, for example, n = 1, s = 4, E = 1, so that 2°(2” + E) = 48. We are now to 
take a partition of 48, but taking care to avoid pedal degeneracy. Thus 48 = 3 + 5 
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+ 40 and, since 3/48 = 1/16, (3.1) shows that we have a PD triangle. But 
48 = 4+ 7+ 37 and the corresponding triangle is not PD. To see how the sequence 
develops, we go back to (1.1) and (1.2), starting with the angles #7, 47, i7. The 
factor 737 being understood, so that a right angle is represented by 24, and with 
obtuse angles marked by an asterisk, the calculation proceeds as follows: 


4 7  37* 

8 14 26% 
16 28* 4 (5.1) 
32* 8 8 
146 16 = 16 
16 16 = 16 


The monocycle has been established in the fourth step. 

In §6 we shall discuss the dodekacycle (n = 12) from which this work originated. 
As further examples here we shall consider two heptacycles (n = 7) without delay 
(s = 0), one corresponding to each value of E. 

For E = 1, we have 2°(2” + E) = 129, and we see from (3.1) that no partition 
can represent a triangle which is PD. Let us see what happens to a triangle which is 
nearly equilateral, displaying the sequence as before (this time with a factor of 57 
being understood and a right angle being represented by 645). 


42 43 44 
45 43 41 
39 430—Cts«O47 
51 8643 0— 35 
27 43 59 (5.2) 
75* 43 11 
21 86* 22 
42 43 44 
The cycle is completed in seven steps. 
For E = —1, we have 2°(2” + E) = 127, which is prime, and no partition can 


give a triangle which is PD. With a right angle at 634, and a triangle nearly 
equilateral, we have 


400 43 44 
47 41 39 
33, 45—Ss« 49 
61 37 29 (5.3) 
5 53 69% 
10 106* 11 
20 85* 22 
40 43 44 


The cycle is completed in seven steps. 


6. Dodekacycles. The preceding results are the outcome of observing the ex- 
istence of 12-cycles in numerical experiments with triangles whose angles were each 
an integral number of degrees. This can be proved by noting that 27(2'? — 1) is a 
multiple of 180, so that each such angle can be represented in the form (4.1) with 
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n= 12, s=2 and E = —1. By Theorem I, a 12-cycle begins after a delay of at 
most two steps. Pedal degeneracy is avoided if the triangle does not have an angle of 
45°, 90° or 135°. 
To illustrate the delays, here is an example, the angles being given in degrees: 
61 63 56 
58 54 68 


64 72 44 
52 36 92* 
104* 72 4 
28 144* 8 
56 =6©108* 316 
112* 36 32 
44 72 64 
92* 36 52 

4 72  104* 

8 144* 28 
16 108* 56 
32 36—s«112* 


64 72 44 


We see that, depending on whether we start with T having angles (61, 63, 56), 
(58, 54, 68) or (64, 72, 44) the 12-cycle starts with T,, 7, or T, respectively. 


7. Cycle lengths. We have to thank the Babylonians for dividing the semicircle 
into 180 degrees, since otherwise pedal cycles might have remained undiscovered. 
But a closer examination shows that cycles would have been found had we taken 
angles integral in seconds of arc or integral in grades (100 grades = 90°) or, more 
generally (in radians), as any rational multiples of 7, 


A'=q'/M, (7.1) 


where g' and M are positive integers. 

The question now arises of how long the cycles will be for a given initial set of 
angles. A periodic length will be determined (Theorem II) in this section but the 
result may not give the shortest period. For example, we may prove that a 12-cycle 
exists when it may be the case that 6-cycles exist. 

By Theorem I we know that, if T has angles given by (7.1), and is not PD, it 
generates an n-cycle if there exist integers p' > 0, s > 0, n > 0 to satisfy 


gi2§(2" + E) = piM, (7.2) 


with £ = 1 or —1. 

A solution (not unique) can always be found by making use of the Euler function 
@ whose relevant properties (Dickson [2]) are as follows: ¢(m) is the number of 
positive integers, not exceeding the positive integer m, which are relatively prime to 
m: if p,, P>,---, Pp, are the distinct prime factors of m, then 


o(m) = m(1 — py')(1 — pz") «+ (1 — px"); (7.3) 


1988] THE SEQUENCE OF PEDAL TRIANGLES 615 


if a is prime to m, 


a? =1 (mod m). (7.4) 


THEOREM II. If a triangle T is not PD and has angles 
A' = 1q'/M, 
where q', M are positive integers, then T generates an n-cycle with n = 4(M,) where 
M, is the odd integer given by 2'M, = M, where t is zero or a positive integer. 


Proof. Since M, is odd and T is not PD it follows that M, > 3. Hence the form 
of ¢(M,) given in (7.3) is even and it may be shown that 


2°%)/2 = F, (mod M,), 
where FE, = 1 or —1 (see Appendix). A solution of (7.2) may be obtained by taking 
s=t, E=-E, n=9(M,)72 
p= yq', 
where y is the positive integer determined by 
9o(M))/2 _ E, = yM,, 


and so the theorem is proved. 

Let us apply the theorem to the case of a triangle with angles integral in degrees. 
If M = 180 = 2? x 45, so that s = 2, M, = 45, 6(M,) = 24, Theorem II shows 
that we must get 12-cycles as already proved in §6. For M = 180 Theorem II gives 
the “best” result in that triangles exist whose shortest pedal cycle is a 12-cycle. 

We invite the reader to experiment with triangles whose angles are integral in 
grades, for which Theorem II predicts 10-cycles. 

Theorem II does not always give the best possible result and the value ¢(M,)/2 
for cycle length can be wildly out, compared with the shortest cycle, as the following 
example (Ribenboim [6]) shows: when M = M, = 10937 (1093 is prime), #(M,)/2 
= 1093 X 546 = 596,778. But 2! = —1 (mod M,), so that a solution of (7.2) may 
be found with n = 182, and we may conclude that there exists a cycle of length 182, 
a considerable improvement on $(M,)/2. 


8. A projective map. A well-known property of elementary geometry (see, for 
example, Theorem 20 in Durell [3]) is that if the pedal triangle T’ of a triangle T is 
magnified by a factor 2 with respect to the centre H, the orthocentre of T, then the 
vertices of the magnified 7’ will lie on the circumcircle of T. Hence, if this 
magnification is performed at each stage of the pedal sequence, we may represent all 
members of a pedal sequence by triads of points on a single circle, the circumcircle 
of the triangle from which the sequence starts. This is illustrated in Figure 2. For 
future use we introduce some further notation. After n of these pedal maps we 
suppose that the vertices of A, B,C of triangle T are mapped to A,, B,, C, of 
triangle 7,. The properties of the pedal map show that 7, will be similar and 
parallel to the actual nth pedal triangle T, but magnified by factor 2”. 


9. Pedal ancestry. Given a triangle T which is not PD, it generates a unique 
infinite pedal sequence, which may be cyclic or, in general, not so. A triangle T_, of 
which T is the pedal may be called an antipedal of T and so we are led to consider 
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FIG. 2(a). FIG. 2(b). 


all triangles T_, (with n positive) which include T in the pedal sequence generated 
by them. To use a biological term, these constitute the ancestry of T. 

The relations between the angles A‘, A‘,, of the triangles T, and T,,, may be 
obtained from (1.1) and (1.2). If T, is acute-angled then (1.1) gives A‘,,, = 7 — 2A’, 
so that 


Ai=Va—-Ai,,), i=1,2,3. (9.1) 


If A’ is an obtuse angle then (1.2) gives 41,, =2Al—2, A?,, =2A?, 42, = 
2 A?, so that 


A, = (Anat + 7), Ay, = tAnat Ay = tAnst- (9.2) 


There are two other possibilities, namely A*, obtuse and A? obtuse. For these cases 
formulae analogous to (9.2) hold. 

There is thus a fourfold ambiguity in the antipedal: any triangle has four parents, 
each step backward in the pedal sequence being determined only by our decision to 
make 7', acute-angled, or to have a specified angle obtuse. 

In terms of the representation in barycentric coordinates illustrated in Figure 1, 
for each triangle represented by a point within PQR there is one antipedal in each 
of the 4 triangular regions shown within PQR. 

Every person now living has had a unique ancestor a thousand generations ago if 
one goes back along a male line or a female line. Similarly, every triangle has a 
unique line of ancestors if it proceeds through acute-angled triangles or through 
triangles in which some specified angle is obtuse. 

To follow the acute ancestry we note that (9.1) can be arranged 


_ 9 ly; . 7 
4,- 3 = = 5 (4, - >) (9.3) 


Thus for any triangle, acute or obtuse, we see that as we trace back this line of 
ancestry the angles of successive triangles converge to 7/3. The triangles however 
rapidly become very large since the circumradius is doubled in each step backward. 
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As for the obtuse ancestry, with Aj, obtuse say, we use (9.2). Thus at each step we 
halve the angles A? and A? and see that the angles in the sequence of ancestors 
converge to 7, 0, and 0. 

The fourfold ambiguity in the antipedal renders the backward extension of a 
pedal cycle complicated, since, to reproduce it, we would need to choose the proper 
parent at each step. Our knowledge of the distribution of obtuse angles in a cycle is 
so far purely experimental. But note that no cycle, except the 1-cycle, can consist of 
purely acute angles, for (9.3) shows that, as a sequence of acute triangles progresses, 
each acute angle differs from 7/3 by twice as much as its predecessor. Obviously 
one angle will sooner or later become obtuse. 


10. Spin and rotation in a pedal sequence. All of the properties of pedal se- 
quences which have been discussed so far have referred only to the shape and size of 
the triangles. We now investigate some of the positional properties of such se- 
quences of triangles. 

By the spin of a triangle A,B,C, in a pedal sequence we mean the counterclock- 
wise or clockwise arrangements of the vertices, taken in alphabetical order. Note 
that, if a triangle is acute-angled (Figure (2a)), the spin of its pedal triangle is the 
same as that of the original triangle whereas if a triangle has an obtuse angle (Figure 
2(b)), the spin of the pedal triangle is reversed from that of the original triangle. 

In order to obtain the main results of this section, Theorems III and IV, we now 
prove a preliminary result concerning the rotation of the sides of a triangle in one 
stage in the pedal sequence. 


LEMMA. Let triangle ABC have counterclockwise spin and let A’, B’, C’ be the 
corresponding vertices of its pedal triangle. Let R( BC), R(CA), and R(AB) be the 
angles through which the directed sides BC, CA, and AB must be turned counterclock- 
wise to make them directed respectively the same as the directed sides B’C’, C’A’, 
A’B’. Then 


R(BC) = B-—Ct+qa (if A i acute ) | 

B-C (if A is obtuse ) 
where the congruence is modulo 27. Corresponding expressions hold for R(CA) and 
R(AB). 


To verify the lemma for the case where all angles of triangle ABC are acute, refer 
to FIGURE 2(a). If we turn BC to BB’, then to B’B, and finally to B’C’, and 
accumulate the counterclockwise angle of turning between each successive pair of 
directed segments, we obtain, modulo 27, 


R(BC) =(5-c)+a-(5-B)=B-C+a. 


(To see that angle BB’C’ is equal to 7/2 — B, note that both B’ and C’ lie on the 
circle having diameter BC. Angles BB’C’ and BCC’ are inscribed angles of this 
circle which include the same arc BC’, and are thus equal. But angle BCC’ is the 
complement of angle B). The case where A is obtuse is shown in Figure 2(b), and 
the case where either angle B or angle C is obtuse is not illustrated, but in either of 
these cases it is straightforward to verify the lemma, using the same approach as 
above. 
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THEOREM III. Jf triangle ABC has pedal triangle A’B’C’, then, using the notation 
of the preceding lemma, 


a (if all angles of ABC are acute ) 


R( BC) + R(CA) + R( AB) = 
( ) (CA) (AB) fi (if ABC has an obtuse angle ) 


where the congruence is modulo 27. 

This is proved, if ABC has counterclockwise spin, by combining the expressions 
which the lemma gives for R(BC), R(CA) and R( AB). If ABC has clockwise spin, 
the values of R(BC), R(CA) and R(AB), in the lemma, have opposite sign and the 
conclusion of the theorem, being modulo 27, still holds. 


THEOREM IV. If triangle T starts an n-cycle then the triangle T,,,, obtained after six 
complete cycles, is oriented similarly to T and, if Tg, is magnified by the factor 2°” 
with respect to any point, the resulting triangle may be obtained from T by giving T a 
translation. 

This theorem is proved by considering two cases. 


Case 1. T,, has spin opposite to that of T. Clearly the triangles encountered in 
the continuation of the pedal sequence from T,, to T,,, will be similar to those in the 
cycle from T to T,, but with reversed spins. All of the angles of turning for the 
sides, involved in the steps between 7,, and T,,, will be the same as those in the steps 
between T and T,, except with signs reversed, so that the net angle of turning for 
each side during this double-length cycle is zero. Then T,,, is oriented the same as T, 
and of course 7;,, 1s also. 


Case 2. T, has the same spin as T. If T has vertices A, B, C let R,( BC) denote 
the total counterclockwise rotation of the directed side BC (and its successors) in 
the steps of the cycle from T to T,, and similarly for R,(CA) and R,( AB). Since T,, 
is similar to 7, with the same spin, we have 


R,(BC) = R,(CA) = R,(AB) (mod2z). 


Also, R,( BC) + R,(CA) + R,,(AB) can be found by adding the expressions given 
by Theorem III for each step of the n-cycle, each of which is 0 or 7, and so this total 
is an integral multiple of 7. Therefore R,( BC), R,(CA), R,(AB) are integral 
multiples of 7/3. Since T, has the same spin as 7, the turning of the sides in 
successive n-cycles is added on in the same sense (instead of cancelling as in case 1). 
Thus in six complete cycles each side will have been rotated through an integral 
number of complete rotations. 

In either case, 7%, is oriented the same as 7, with correspondingly lettered 
vertices in corresponding positions. The discussion in §8 shows that if 7%, 1s 
magnified by the factor 2°” it can be translated to coincide with T. 

To illustrate the preceding results let us take s = 0, n = 4, EF = —1 in (4.2), so 
that 2°(2” + E) = 15, and start with the partition (1,3,11). Then, with a factor 
q/15 understood, the 4-cycle is as follows: 


1 3 11* 

2 6 7 
11* 3 #1 

7 6 

1 3 «11* 
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FIGuRE 3 illustrates this pedal sequence as far as the sixth step, repeatedly using the 
pedal map described in §8 to magnify the triangles so that they are all inscribed in 
the same circle. (This avoids microscopically small triangles.) Note that 7 (which is 
T, magnified) in fact coincides with T, but with A, = C and C, = A. Evidently 
another 6 steps will produce 7, in exactly the same position as T with A, = A, 
B,, = B, C,, = C. Theorem IV predicts that six 4-cycles produce a triangle T,, 
which when magnified by 24 will be a pure translation of 7. The fact that in this 
example we only need three 4-cycles to make analogous claims for 7,, is because 
the basic 4-cycle has an even number (two) of acute-angled triangles making 
R, (BC), R,(CA), R,(AB) even multiples of 7/3 in case 2 (T, has the same spin 
as T) of the proof of Theorem IV. 


11. Conclusion. We have discovered two facts: the negative fact that Hobson 
was in error and the positive fact that pedal cycles exist. In general we have been 
unable to locate the pedal point, by which we mean the point which is the limit of an 
infinite pedal sequence, cyclic or not. However, one consequence of Theorem IV is 
that if a triangle starts a pedal n-cycle then its pedal point is collinear with A and 
A,,, With B and B,, and with C and C,,. 


Professor Synge thanks Professor J. T. Lewis, a colleague, for useful discussions. We are also grateful 
to the referees for their helpful comments and acknowledge the use of their suggestions. 


Appendix 
THEOREM. If n > 3 is an odd integer, then 
2°()/2 = +1 (modn) (A.1) 
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Proof. 


Case 1. n is a power of a single prime factor. 
Suppose n = p* where p > 3 is prime and where s > 1. Then, since 2 is prime to 
n, (7.4) gives 


2°) —1=0 (mod p‘). (A.2) 
(7.3) shows that $(7) is even and (A.2) may, therefore, be factorized to give 
(29(")/2 4 1)(29/2 — 1) =0 (mod p‘). (A.3) 


The two factors on the left-hand side of (A.3) differ by 2 and, since p > 3, they 
cannot both have p as a factor. Hence one of the factors must be divisible by p* and 
(A.1) immediately follows. 


Case 2. n has two or more different primes as factors. 
Suppose n = p*Q where p is prime, s > 1, and where Q > 3 is not divisible by 2 
or p. If q,,...,q, are the distinct prime factors of Q, we have from (7.3) that 
o(n) =p’ (p — 1)Q(1 - g7')-- (1-977). (A.4) 
Also from (7.3), 


$(p*) = p**(p — 1). (A.5) 
Comparing (A.4) and (A.5) it is clear that, since j > 1, (7) is an even multiple of 
( p*). But case 1 gives 


29(P)/2 = +1 (mod p’), 
so it follows that 
2°()/2 = 1 (mod p’). 


Since this last congruence holds for each prime factor of in place of p (with s 
depending on p), the congruence must hold modulo n, giving (A.1) and completing 
the proof. 


REFERENCES 


1 H.S. M. Coxeter, Introduction to Geometry, John Wiley and Sons, New York, 1969, pp. 5, 18. 

2 L.E. Dickson, Modern Elementary Theory of Numbers, Chicago University Press, 1939, pp. 9-12. 

3 C. V. Durell, Modern Geometry, McMillan and Co., London, 1952, p. 31. 

4 E. W. Hobson, A Treatise on Plane Trigonometry, Cambridge University Press, 1897, pp. 194-200. 
5 , A Treatise on Plane Trigonometry, Cambridge University Press, 1925, pp. 197-203. 

6 P. Ribenboim, Algebraic Numbers, Wiley- Interscience, New York, 1972, p. 64. 


Magic Squares Over Fields* 


CHARLES SMALL, QUEEN’S UNIVERSITY 


1. Introduction 


A magic square of size n over a field F is an n X n matrix with entries in F for 
which every row, every column, the principal diagonal, and the principal backdiago- 
nal all have the same sum. The set of all such matrices is an F-vector space. We 
compute its dimension. The answer is n? — 2n (independent of F) for n > 5; for 
n <5 the results depend on the characteristic of F. 

Magic squares with integer entries have a rich history and an extensive literature: 
see, for example, section A20 of [1] for an indication of recent work. Magic squares 
over fields have been comparatively neglected, however, and throughout the litera- 
ture the emphasis is on generating new magic squares rather than examining the 
structure of all magic squares of fixed size. In [2] a theorem equivalent to ours is 
stated for magic squares of size n > 3 over the real numbers. 

While the present results are, therefore, probably not new, except possibly in the 
case of positive characteristic, the proofs may be worth consideration as an 
appealing application of elementary techniques in linear algebra. Two features are 
worthy of special note (pedagogically as well as mathematically): the phenomenon, 
widespread in mathematics and here particularly transparent in the proof, of 
peculiarities in low dimensions which vanish as soon as there is “enough room”; 
and the special role, in the low-dimensional cases, of positive characteristic. 

To begin, we make the definition precise: a magic square of size n and weight d 
over a field F is a matrix A = (a;,;) (1 <i, j <n) with entries in F satisfying: 


a,,=d for all 


a..=ad for all i 


Ms TMs 
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Qi n—it1 = 4. 


Here n > 1 is an integer and d &€ F. 
Let B be the transpose of the row vector (with n? entries) obtained by listing the 
rows of A in order: 


t 
B= (yy5 yay ee 5 Uys Tayy eee y Aggy eee y Anise sey Ann) » 
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Then clearly the 2n + 2 conditions (*) can be written as a single matrix equation 
C,B = d(1,1,...,1)', where C, is the (2n + 2) X n* matrix constructed as follows: 

Let e, = (1,0,...,0), e, = (0,1,0,...,0),..., e, = (0,...,0,1) be the standard 
basis for F” and let 


j=(1,1,...,1)= Le; 
i=1 


The first row of C,, 1s (e;, €;,-.., €;), the second is (e5, @,..., @), and so on to get 
the first n rows. The (n + 1)st row of C, is (/,0,...,0), the (m + 2)nd row is 
(0, 7,0,...,0), and so on to get the next n rows; here 0 is the n-tuple of 0’s. Finally, 
the (2n + 1)st row of C, is (e,, é3,...,@,) and the (2n + 2)nd is (e,, €,_1,--+5 €1): 

The set M,( F) of magic squares of size n (and arbitrary weight) over F is clearly 
an F-vector space with respect to the usual matrix operations; and the mapping 
w: M,(F) — F which assigns to any magic square its weight is clearly linear. The 
kernel Z,(F) of w is the set of magic squares of size n and weight 0. We let 
m,(F) = dim,;M,(F) and z,(F) = dim,Z,,(F). There are two possibilities: either 
w is onto and m,(F) = z,(F) + 1, or w is the 0 map and m,(F) = z,(F). We will 
see (§3, Corollary) that the latter situation arises only for n = 2 or 3 and char. F = n. 
Notice that w is certainly onto in characteristic 0, and in characteristic p when 
p tn, since in these cases the matrix with all entries 1 is a magic square of nonzero 
weight. 

It is clear from this discussion that to compute m,(F) we must compute the rank 
of C, and determine exactly when w is 0. In §3 we do these simultaneously by 
computing the rank of C, augmented by a column of 1’s. Since the argument there 
requires n > 5, we pause first to handle the anomalous cases n < 4. 


2. The cases n < 4 

THEOREM. (a) For n = 2 we have z,(F) = 0 and m,(F) = 1, unless char. F = 2 
in which case z,(F) = m,(F) = 1. 

(b) For n = 3 we have z,(F) = 2 and m,(F) = 3, unless char. F = 3 in which 
case 2,(F) = m,(F) = 3. 

(c) For n = 4 we have z,(F) = 7 and m,(F) = 8, unless char. F = 2 in which 
case z,(F) = 8 and m,(F) = 9. 


Proof. (a) It is immediate that for any F the only magic squares of size 2 are the 
scalar multiples of 


The results follow. 


(b) For n = 3 we augment C, by a column of 1’s and row-reduce to get 


10010 0 1 0 0 = #:1 
010 01 0 0 1 0 =:1 
001 0 0 1 0 0 1 #31 
00011 1 0 0 0 = :1 
000 01-1 1 0-1 30° 
00 0 0 0 3-3 0 3 = =:1 
00 0 0 0 0 1 1 21 =:1 
000 0 0 0 0 0 0° :0 
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Thus C, has rank 7 unless char. F = 3, in which case the rank is 6. Hence 
z;(F) = 9 —7=2 unless char. F = 3, in which case z,(F) = 9 — 6 =3. The 
augmenting column gives no trouble (and w is onto) unless char. F = 3, in which 
case there is trouble in the sixth row (and w is 0). The results follow. 

(c) For n = 4 we augment C, by a column of 1’s and row-reduce to get 


10 001 00 0 1 0 0 0 1 0 0 0 =:1 
0 10 0010 0 0 1 0 0 0 1 0 0 =:1 
0 01 0 0 0 1 0 0 0 1 0 0 0 1 0 =:1 
0 0 0 10 0 0 1 0 0 0 1 0 0 0 1 = :1 
0 0 0 01 11 1 00 0 0 0 0 0 0 =:1 
0 0 0 00 2 1 +21-1 0 1 0-1 0 0 1 = :1 
0 0 0 0 0 0 1-1 0 1 0-1 1 0 0-1 = :0 
0 0 0 0 0 0 0 0 1 1 1 1 0 0 0 0 3:1 
0 0 0 00 0 0 0 0 00 0 1 21 7 ~=21 = :1 
00 0 00 0 0 0 0 0 0 0 0 0 0 0 = :0 


If char. F # 2 the rank is 9 and there is no trouble from the augmenting column (so 
w is onto), so that z,(F) = 16 — 9 = 7 and m,(F) = 8 in this case. In characteris- 
tic 2 the obvious additional row reductions show that the rank is 8 (with, again, no 
trouble from the augmenting column): the eighth row is now the sum of the sixth 
and seventh. The results follow. 


Remark. It is an easy exercise, which we leave to the reader, to carry the row 
reductions of C, and C, a little further and then read off bases for Z,(F) and 
Z,(F), and Gf the augmenting column of 1’s is retained) for M,(F) and M,(F). 
The same remark applies for n = 5,6, .... 


Examples: 


(a) 


OFS 
SS © 
Ee One 
SS Oe 


is a magic square of size 4 and weight 1 in characteristic 2; so is 


1+x 0 1+x 1 
1 1+x 1 


Xx 
1 d+tx 0 text frany~. 
x 1 1+x 1 
111 1 -1 0 1 oO -1 
(b) 1111, 0 1 -1],/ 0 -1 1 
111) \-1 +o +1f/ \-1 #1. «0 


form a basis for the magic squares of size 3 in characteristic 3. 
(c) In characteristic zero, 


1 0 -1 0 1 -l 
X=/]-2 0O 2} and Y=|-1 0 1 
1 0 -1 1 -l 0 


form a basis for the magic squares of size 3 and weight 0, while X and Y together 
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with 


form a basis for the space of all magic squares of size 3. Thus a typical 3 X 3 magic 
Square in characteristic 0 has the form 


2r+s 2r+t -r-s-t 
—2r-—2s-t r 4r+2s+t}; 
3r+st+t —t — Ss 


its weight is 37. 


3. The general case, n > 5 


As in §2, we argue by augmenting C, with a column of 1’s and row reducing. We 
will see that assuming n >5 gives enough room so that the case distinctions 
depending on char. F no longer arise. 


THEOREM. For n > 5, m,(F) = n(n — 2) = 2z,(F) + 1 for all fields F. 


COROLLARY. The weight mapping w is onto unless char. F = n = 2 or char. F =n 
= 3. (In other words, all magic squares of size p over fields of characteristic p have 
weight 0 provided p = 2 or 3, but in all other cases magic squares of all weights 
abound.) 


Example. 


ORR ee 
Se HS Or 
RP Ore 
a eee 
— et ed EY © 


is a magic square of size 5 with nonzero weight in characteristic 5, 


Proof. Everything follows if we show that, for n > 5, w is onto and C, has rank 
2n + 1; for once w is onto we have m,(F) = z,(F) + 1, and rank (C,) = 2n + 1 
implies z,(F) = n* — (2n + 1) = n(n — 2) — 1. Let D, denote C, augmented by a 
column of 1’s. The sum of the first n rows of D, is (1,1,...,1, 2) and this is the 
same as the sum of the next 1 rows. Thus D, is row-equivalent to the matrix E,, 
obtained by replacing its (m + 1)st row by a row of zeroes. Now E, has 2n + 1 
nonzero rows; we claim E, has rank 2n + 1. (If so, we are done.) 

The first n rows of E, have their initial nonzero entries in columns 1,2,..., 7. 
The (” + 1)st row is Q, and the following n — 1 rows of E, have their initial 
non-zero entries in columns n + 1,2n + 1,...,(m — 1)n + 1. The last two rows of 
E, are (@,, @9,-.-,@,,1) and (e,, e,_1,---, €,, 1). Subtract the first row from the 
next-to-last to make it (0, e, — e),...,e@, — €,,0) and subtract the nth row from the 
last to make it (0,e,_, — e,,---,@, — @,,0). Finally, add the (7 + 2)nd row (the 


=n? ° e 


one just under the row of zeroes) to the next-to-last, making it 


(0, e) — & + J, @3 ~ Eqo- seo @y ~ €1,1). 
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We now count to verify that the 2n + 1 nonzero rows of our reduced version of 
E., have their initial nonzero entries in distinct columns. We already have initial 
nonzero entries in columns 1 through n and n + 1,2” + 1,...,(m — 1)n + 1. Since 


e,—e,+ f= (0,2,1,1,...,1), 


the next-to-last row has its initial nonzero entry in column n + 2 if char. F # 2 and 
n+ 3 if char. F = 2. Since e,_, — e, = (0,...,0,1, —1), the last row has its initial 
nonzero entry in column 2n — 1. Finally, since n > 5 we have n+ 3 < 2n — 1; 
thus the 2” + 1 initial nonzero entries of the rows do indeed occur in distinct 
columns. Done! 


COROLLARY. When F is infinite, the probability that an n X n matrix chosen at 
random will be a magic square is 0. When F is finite, with cardinality q, the probability 
is q~? when n = 2, q~' when n = 4 and q is even, and q™ 2” in all other cases. 
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The Jerusalem Ticket Problem 


JANET CRANDELL TREMAIN 


JANET CRANDELL TREMAIN: Currently I am a first year graduate student in 
mathematics at the University of Missouri in Columbia, Missouri. I’ve 
completed a master’s degree with a thesis in modal logic. During the last two 
years I’ve attended mathematical lectures and conferences in Denmark, 
Poland, Germany, Switzerland, and Israel. I hope to take some graduate 
mathematics courses at Princeton and to eventually be a mathematical 
researcher. 


Introduction. As one’s mathematical knowledge increases, so does an ability to 
intuit the mathematically “obvious.” This sort of intuition can lead to immeasurable 
frustration for mathematicians attempting unencumbered lives in a world main- 
tained by nonmathematical clerks. One such case in point is that of Professor Joram 
Lindenstrauss, mathematical researcher at the Hebrew University of Jerusalem, 
Israel. 

In the spring of 1986, Professor Lindenstrauss approached the ticket counter of 
Jerusalem Theater to purchase festival tickets for his large family and some visitors. 

He arrived on the first day to insure receiving the advertised free ticket per every 
three purchased from distinct performances. After presenting his large order involv- 
ing varying numbers of tickets for selections of music, dance, and theater, he 
requested one-fourth of his order for free as the advertisement had implied. 

The ticket agent had no idea he was dealing with a famous mathematician, and 
since he could not decompose the order into the appropriate groups of three for 
each free ticket he claimed it would not result in one-fourth the order free. 

After recovering from disbelief at what the agent could not grasp, Professor 
Lindenstrauss sketched out a simple sufficient condition for the “Ticket Problem” 
(this subcase is subsumed under case one of the theorem below). 

The ticket agent was not mathematically inclined and stubbornly stood by his 
statement that the order could not be legitimately decomposed as he reviewed the 
unfamiliar figures this customer had presented. 

Realizing the difficulty, Professor Lindenstrauss physically grouped his tickets, 
thereby convincing the chagrined clerk. 

In what follows, we will give a complete solution to the “Jerusalem Ticket 
Problem.” 


The problem. What is the decomposition of a ticket order maximizing the 
number of free tickets received if one is received per three purchased for distinct 
events? 


The solution. 
Notation 


(1) For 1<is<n, a person buys a, tickets for performances A,;, We may 
assume that a, >a,2> -°:- 24,,. 


626 


1988] THE JERUSALEM TICKET PROBLEM 627 


(2) Let / = [X%_,a,/4], ie., an upper bound for the number of free tickets. 

(3) [a] represents the greatest integer less than or equal to a. 

(4) Let /* = the actual number of free tickets a person will receive given a 
purchase. 

(5) Let [= max{m| the total tickets ordered can be decomposed into m 
disjoint sets of three tickets each with no two tickets for the same perfor- 
mance in one set by our counting procedure}. 


Note: / is given by our counting procedure. 


THEOREM. If one buys a, tickets for performance A,, 1 <i<n, anda, >a,2= 
> a,, then: 

(1) If L_,a, = 2a,, then Il* = 1. 

(2) If L_,a; < 2a, and L_3a; < a>, then I* = min{L}_,a,, 1}. 

(3) If L7_,a; < 2a, and U_,a; > a, then I* = min{{X?_,a,/2], 1}. 


Note: The proof of the theorem is deferred. 


Remark. Our goal is to determine the actual number of free tickets a person will 
receive, /*, given a purchase. It is clear one can get no more free tickets than the 
total number of tickets divided by four, so we will call this maximum /. By applying 
a technique for breaking down the order into the greatest possible number of groups 
of three tickets for distinct events, we will find a way of computing the value of i. 
Later, our lemma will give us the actual number of free tickets by the relationship: 


I* = min{/, 7}. 


LEMMA. 
= min{/,/}. 


Proof. Clearly, /* </ and /* < I, and hence /* < min{/, 1}. Also, if i</ then 
the person is eligible for at most I free tickets and we use up 3/ tickets which must 
be paid for to get the I free ones. 


So altogether, we use 4/ < 4/ < ¥"_,a, = the total number of tickets. That is, the 
person is eligible for at most / free tickets and there are enough groups of three for 
distinct events to produce this. Also, there are enough tickets left over for the person 
to get the / free ones. Therefore, in this case, /* = f= min{/, /}. 

Finally, if / < i, then the person is eligible for / free tickets but / is the maximum 
number possible. So, the person gets / free tickets. That is, in this case we also have 
that /* =] = min{/, 1}. 


Counting procedure. Recall: The a, represent the cardinality of the A, and we 
are assuming that a, >a,2> -:-: 24,. 

Now we place the tickets in rows with the tickets for performance A, in the ith 
row. We will consider the three cases of the theorem separately. 


Case 1. We assume that ©7_,a,; > 2a,. Let j = [X7_,a,/3] and let r be the 
remainder in ):”_,a,/3, where r = 0, 1, or 2. Now form three rows using all of the 
tickets where row one has length w,, row two has length w,, and row three has 
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length 7 wnere, 


_ {J ifr =0 
mT \ +1 ifr =lor2 


J ifr =Oorl 
YW, => 
2 j+1 ifr = 2, 


using the following procedure: 

Line up the A, tickets one after another in row one. Now line up the A, tickets 
after the A, tickets and go to A,. Eventually a portion of the tickets from some A, 
will finish row one, i.e., we will have w, tickets in row one. Start row two with the 
rest of the tickets from this A, and continue until row two has w, tickets and row 
three has just j tickets. 

Now we divide the tickets into groups of three by grouping the three tickets in 
column one together, the three tickets in column two together, until we reach 
column /j. It is clear that since a, >a, > --- >= 4,, our groups will contain tickets 
from distinct performances and all the tickets will be grouped except the last r = 0, 
1, or 2. (See FIGURES 1 and 2.) 


A, A, 
A, A, A, A, 
As Ag A, Ag Ag Axo Ay, Ay 


Fic. 1. Arrangement of tickets in Case 1. 


oiial;eo}];e 7]; xX }]* » 4 
x11 x*K 1} OT; O1;O 110 - 
—||—|]4s| [4] || Hee 


Fic. 2. Grouping tickets in Case 1. 


Case 2. We assume that L?_,a; < 2a, and L7_4a; < a>. 

Form row one with A, tickets. Form rows two and three by placing the A, tickets 
in row two and then placing the remaining tickets in row three. (See FIGURE 3.) 

Now we group the tickets into groups of three for distinct performances as in 


case one. Clearly we have exactly as many groups as 1”_,q,. 


Case 3. We assume that L?_,a; < 2a, and L7_3a; > ap. 


1988] THE JERUSALEM TICKET PROBLEM 629 


Fic. 3. Arrangement of tickets in Case 2. 


Form row one with the tickets A, Form rows two and three by placing 
k = [L}_,a,/2] terms of A,, A3,..., 4, consecutively in row two if n is even 
(k + 1if n is odd), and then placing the remaining k tickets in row three. 


After grouping as in case one we get precisely k groups of three distinct tickets 
each. (See FIGURE 4.) 


A, A; 
+ $$$ $ — —— — —  — —  _____f OY 


A, A; Ag A, Ag Ay Ajo Ay, Aj 


Fic. 4. Arrangement of tickets in Case 3. 


Note: The counting procedure gives us i. 
Now we are ready to prove our theorem. 


Proof. 


Case 1. Let j = [X7_,a,/3] and let r be the remainder in /7_,a,/3 where 
r = 0,1,2. Now we apply the counting procedure. As we saw earlier, this procedure 
divides all the tickets except r= 1, or 2 into groups of three distinct tickets. 
Therefore, / = [X"_,a,/3] since we have grouped all tickets except for one or two. 


Hence, 
[= b a3} ap a/4| = |, 
i=1 i=1 


and so /* = min{/,/} = 1. This completes the proof of case 1. 


Case 2. It suffices to prove that £”_,a, = /, by our lemma. If we divide the 
tickets into groups of three different kinds per group, it follows that each group 
contains at least one ticket from one of the performances A; for 3 < i < n. Hence, 
I< L734; 

But, lining up performances A, through A,, end to end as row three (and since 
13a; S a, < a,), by the counting procedure: L?_,a; < /. Therefore, / = L?_,a,. 
This completes the proof of case 2. 
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Case 3. Let k =([X7_,a,/2] and note that L?_,a; > a, implies k > a. After 
applying the counting procedure we get / > k. But, any division of the tickets into 
groups of three different tickets has the property that each group contains at least 
two tickets from the performances A,,..., A,,. Hence, the number of groups is less 
than or equal to [L7_,a,/2] = k. That is, /<k. Therefore, /=k. But now, 

= min{/,/} = min{/, k} = min{/,[2"_,a,/2]. This completes the proof of case 
three and hence of the theorem. An examination of the proof of the theorem gives 
the following corollary. 


COROLLARY. The counting procedure always gives i. Furthermore, T will be the 
correct number of free tickets a person is eligible for and we can add j free tickets of any 
kind where j = [ — 1 when T> 1. 

If | < I then fis the number of free tickets the person will actually receive. In other 
words, these two processes give us a decomposition method yielding the optimal 


grouping. 


Variations. In addition to the obvious variations of this problem (e.g., receiving 
two free tickets for each four purchased), an interesting variation occurs by 
excluding certain of the expensive tickets from qualifying as free. 

Another even more interesting variation occurs by assuming the tickets have 
different prices and that we want to pay the least possible. Note that in this case, we 
may need to take fewer than the optimal number of free ones as the following 
example illustrates. 

Let a, = 3 tickets at $1/each, a, = 3 tickets at $1/each, a, = 1 ticket at $10 
each, and a, = 1 ticket at $10/each. This falls into case two of our theorem. Our 
previous rearrangement without regards to cost can be seen in FIGURE 5. 


re 0) O O O o 
ty D x a 


Fic. 5. Old grouping. Fic. 6. Cost optimizing grouping. 


However, to optimize cost this would not be the correct choice since we would 
pay $24 for tickets in order to receive $2 worth free whereas we could pay $16 for 
tickets in order to get $10 worth free if we make the choice where now a $10 ticket is 
free as in FIGURE 6. 


LETTERS TO THE EDITOR 
Editor: 


Regarding Barry Cipra’s “Introduction to the Ising Model” [3], we find fault with 
neither its prose nor its derivations, which could set a new standard for tutorial 
articles in applied mathematics. Yet, as the author’s stated intent was to attract 
attention to the model, his concluding paragraphs might have offered it as a 
springboard to understanding the more complicated spin glass models which have 
been used in the analysis of “neural networks.” The explosive growth of interna- 
tional and interdisciplinary fascination with these connectionist models is surely one 
of the cultural phenomena of science in this decade [4]. A spark was provided by 
Hopfield [5], whose expression 


N N-1 N 
H(S) ~~ » » SST; ~ » SU; (1) 
j=itl i=l i=1 


for the “computational energy” of the network is analogous to Cipra’s equation 
(1.1). Here S;, is the state of the ith neuron-like element—either firing at its peak 
rate (S; = 1) or resting (S; = 0); and the connectivity matrix ||7; || gives the strength 
of the “synapse” through which the ith unit excites Gf 7;, > 0) or inhibits (if 
T,,; < 0) the jth unit. The state of the ith unit is decided by the threshold test 
applied to its input, 


N 
x; = L ST + U,. (2.1) 
j= 
The binary “McCulloch-Pitts neuron” obeys the rule 
_ fl fx,>0 59 
i \0 if X, <0. (2.2) 


When the 7-matrix is real-valued and symmetric, with all zeros on the diagonal, the 
network evolves toward stable states s which correspond to local minima of the 
computational energy. The “energy landscape” can be configured so that these local 
minima correspond to solutions of constrained optimization and pattern recognition 
problems [9, 10]. In the latter case, the vector U of inputs to the N units might 
represent the pixel pattern on a retina. 

A provocative paper by Ackley, Hinton and Sejnowski [1] proposed simulated 
annealing to dislodge the Hopfield network from local minima and enable it to 
settle into states of still lower energy which would represent better (if still subopti- 
mal) solutions. The network is “heated” by the addition of noise to the input of 
each unit. When these noises are independent, identically distributed random 
variables, the state S takes a random walk on the 2” vertices of a hypercube. The 
stationary distribution, 


Pr(S = s) = exp[-BH(S)] /Lexp[—BH(6')], (3) 


is formally equivalent to Cipra’s equation (1.3). The assertion of Ackley, Hinton and 
Sejnowski, that T = 1/8 is the root mean intensity of noise described by a logistic 
distribution, is not powerfully motivated. Shaw et al. [8] had earlier arrived at an 
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expression like (3) in which B is a “smearing factor” determined from details of a 
stochastic model of the chemical synapse. 

It was over a hundred years ago that Gibbs sought time-invariant solutions to a 
Liouville equation in which the independent variables were the Hamiltonian coordi- 
nates of a multiparticle system and the dependent variable was the probability of 
the system being in a given state. He arrived at a canonical ensemble in which “the 
index of probability [i.e., the log-probability] is a linear function of the energy” of 
the state. This result is expressed by equation (3), usually called a Boltzmann 
distribution. Other functions of the energy, however, will serve this purpose; and the 
fact that the linear dependence (of log-probability on energy) maximizes the entropy 
of the system is not necessarily germane to the question. Belief in the possibility of a 
mathematical treatment of biological intelligence, patterned after statistical thermo- 
dynamics, goes back at least as far as the works of John von Neumann (published 
posthumously). For this belief to find expression in contemporary neural network 
research is not surprising. The mathematician who studies this work must be slightly 
bewildered by derivations which appeal to analogies with statistical physics, some of 
which are complicated by psychological theory [7]. The validity of the Boltzmann 
distribution, in the context of the connectionist paradigm, is solely dependent on the 
existence of models which give rise to it. As far as real neuronal networks are 
concerned, the laboratory experiments which would verify the result have yet to be 
defined. 

The computational technique of simulated annealing traces its roots to the 
Metropolis [6] algorithm, which updates the state of an N-particle system according 
to a stochastic model in which the Boltzmann distribution is expressly assumed 
beforehand. Our own research has led to an alternative derivation of C. R. 
Darnafalski, the amateur mathematician whose unpublished essays we have cited 
elsewhere [2]. The stochastic model, adapted to the present problem, leads to a 
simple algorithm as follows. Pick an integer i © {1,..., N} at random. Compute X, 
according to (2.1). Modify X, by the addition of a real random variable, call it Y;, 
which is symmetrically distributed about a mean of zero. Compute S, according to 
(2.2). These steps are iterated indefinitely with independent, identically distributed 
random numbers {Y,, k = 1,2,...}. It is not hard to see that this gives rise to a 
sequence {S,, k = 1,2,...} of states which constitute a Markov chain. Nonzero 
probabilities can be attributed to transitions which involve at most one component 
of the state vector. With no external input (U = 0), these probabilities depend on 
the 7;, and the distribution of Y. When Hopfield’s conditions are obeyed by the 
former, the stationary distribution can be derived analytically. This distribution is 


N N-1 
Pr(S =s)=Z'exp) YY sys,log{F(7,,)/[1 - F(Z,)]}). 
j=itl i=l 
in which F is the (cumulative) distribution function of Y and the denominator Z is 
the sum over all states which normalizes the discrete density. The assumption of 
logistic noise, as 
F(y) =1/0 + e778”), -w<y<o, 

gives the last equation a particularly simple form (3). 

The central theme of Cipra’s article is the discovery of closed-form expressions 
for the partition function Z. These would allow the analyst to identify the critical 
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temperatures of phase transitions in the physical systems described by spin glass 
models, of which Ising’s is the progenitor. At the present time, however, the creation 
of computational devices to accelerate the simulation of such systems is becoming a 
major R & D initiative, motivated by the quest for computers that can carry out 
associative mappings and solve optimization problems using highly parallel, simple 
computing elements. The same phenomenon which the physicist calls a phase 
transition is, in the connectionist view, the conclusion of a state space search that 
constitutes an intelligent response to given stimuli. Some will argue the futility of 
the quest for such “neutral architectures”; but it seems not unlikely that the effort 
will produce tangible products. The best insights of some talented mathematicians 
will be required to make sense of what these products can and cannot do. 


James P. Coughlin 
Towson State University 
Towson, Maryland 


Robert H. Baran 
Naval Surface Warfare Center 


Silver Spring, MD 
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EDITED BY DAVID J. HALLENBECK, DENNIS DETURCK, AND ANITA E. SOLOW 
Decomposition of Bipartite Graphs Into Paths 


KIRAN B. CHILAKAMARRI 
Department of Aeronautical and Astronautical Engineering, Ohio State University, 
Columbus, Ohio 43210 


Liu and Zaks prove in [1], among other results, that bipartite graph K,, ,,.; can 
be decomposed into paths P,, P,,..., P,, of length (i.e., number of edges in the 
path) 2,4,...,2n, respectively, for all odd n. They conjecture that this cannot be 
done for any even integer n. This conjecture is obviously prompted by observing 
that from K, one cannot obtain both P, and P, at the same time. The purpose of 
this note is to show that we can indeed decompose K,, ,,,, into paths P, Py,..., Po, 
for all even n > 4 and failure to do so occurs only for n = 2. 

We need a few observations to obtain the result. First consider an n X m array 
R,,, m Of points where a point in the ith row and jth column is identified with edge 
(x;, y;) of a bipartite graph K,,,,, with vertex set {X,, X2,.--,Xns Vis Yase++> Vm} 
and edges joining any x, to any y, for 1 < p <n and 1 < q < m. Any path on the 
points of R,, ,, with the properties: (1) travels only along rows or columns (2) uses at 
most two points from any row or column and (3) whose end points do not lie in the 
same row or column, defines a unique path in K,, ,,. If a path with (1), (2), and (3) 
in R,,,, uses N points then the corresponding path in K,, ,, uses exactly N edges 


nym m 
and has no repeated vertices. 

Now the problem of decomposing K,, ,,, into paths P,, Py,..., P,, for even 
n > 4, reduces to covering R,,,,,, with paths using 2,4,...,2n points and each 


satisfying conditions (1), (2), and (3). We will do this for n = 4,6, and 8. For an 
even integer n beyond 8 the construction we give can be continued. A general 
procedure for the construction can be given but we will not do so since it is clumsy 
and only obscures the procedure. 

One can read off paths in K,, from Fic. 1. For example (x,, y,) and (x3, y,) 
form P, in Ky 5 Similarly (x2, yy), (X2, Yo), (X41, Yo), (%1, V3), (% 4, V3), and (X4, Ys) 
form P, in K, 5. For illustration purposes we show the decomposition of K,, in 
Fic. 2. 


Yi 2 Y3 v4 Vs 


Fic. 1. n = 4. Decomposition of K4 , into P,, P,, Ps, Pg. 
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Ji 


» Y2 


Y3 


V4 


Ms 


V7 


X6 


Fic. 3. n = 6. Decomposition of Kg z into P,, Py, Pe, Pg, Pig, Py. 


9 


Fic. 4. n = 8. Decomposition of Kg into P,, Py,.... Pr. 
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The author is grateful to Professor S. Zaks for suggesting the problem. 
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Fermat’s Last Theorem in Combinatorial Form 


W. V. QUINE 
Department of Philosophy, Harvard University, Cambridge, MA 02138 


Fermat’s Last Theorem can be vividly stated in terms of sorting objects into a 
row of bins, some of which are red, some blue, and the rest unpainted. The theorem 
amounts to saying that when there are more than two objects, the following 
statement is never true: 


STATEMENT. The number of ways of sorting them that shun both colors is equal to 
the number of ways that shun neither. 


I shall show that this statement is equivalent to Fermat’s equation x” + y” = z”, 
when nv is the number of objects, z is the number of bins, x is the number of bins 
that are not red, and y is the number of bins that are not blue. There are z” ways of 
sorting the objects into bins; x” of these ways shun red and y” of them shun blue. 
So where A is the number of ways that shun both colors, B is the number of ways 
that shun red but not blue, C is the number of ways that shun blue but not red, and 
D is the number of ways that shun neither, we have 


x"=A+B, y"=A+C, 2"=A+BH+CH+D. 


These equations give x” + y” = z” if and only if A = D, which is to say, if and only 
if the statement above holds. 


A Difference Equation for Strings of Ones 


ANDREW SIMOSON 
Mathematics Department, King College, Bristol, TN 37620 


The classic proof [2, pp. 122-3] that 


almost every number in its binary expansion contains arbitrarily (+) 
long strings of the digit 1 


can be translated into an elegant difference equation argument, as shown below. 
In this note, for each number such as + which has a dual binary expansion, 
interpret its binary expansion as the one terminating in all zeroes. Let J be the half 
open interval [0,1). For each positive integer n, let B, be the set of numbers in J 
containing no string of n 1’s in their binary expansions. To prove (*) it is sufficient 
to show that B, has measure zero. For all integers k > 0,0 <j <n, let Y;(K) be the 
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set of all numbers in J such that the first & digits of their binary expansion 

i. terminate with a string of exactly / 1’s; 

li. contain no string of n 1’s. 
Note that Y,(X) is the union of pairwise disjoint intervals of length 1 /2* for j < k, 
and that Y,(k) is empty for j > k. For example, if n = 2 then Y,(3) = [0,1/8) U 
(1/4, 3/8) U [1/2,5/8) and Y,(3) = [1/8,1/4) U [5/8, 3/4). Also note that 


n—-1 
B,c U Y¥,(k) forall k > 0. (**) 


j=0 


Note that for 0 <j <n -—1, “half” of the numbers that are in Y,(k) are those 
numbers that comprise the set Y,,,(k + 1), whereas “half” of the numbers in 
Y,-,() have a string of n 1’s in the first k + 1 digits of their binary expansions. 
Furthermore for 0 <j < n, the other “halves” of Y,(k) make up Y)(k + 1). Hence, 
if y(k) denotes the measure of Y,(k), then y(k +1) = 32% Zgy,(k) and 
Vink + 1) = 5y,(K) for all k = 0,0 <j <n — 1. Let y(k) be the transpose of the 
n-tuple (yo(k), y,(K),..-, ¥,-1(k)). These relations define the difference equation 


y(k +1) = Ay(k) forall k = 0, where A and y(0) arethen X n matrix (* + *) 


and n X 1 vector: 


1 , 
1 9 0 0 0 
0 1 0 0 0 

A= and y(0) = , 
00140 0 0 VO)= | 9 
0 0 024 0 0 


Then for all k => 0, by (* *) and the definition of y(k) it follows that 


(the measure of B,) < (the sum of the components of y(k)) 
= (the sum of the components of A*y(0)). 


At this point, recall some linear algebra: 


(1) If all nonzero entries of an n X n matrix A are positive then the dominant 
eigenvalue of A is a real number between 0 and the maximum of c,, 
1 <j <n, where c, is the sum of the components of column j of A, [3, pp. 
193-5]. 

(2) If all the eigenvalues of an n X n matrix A have magnitude less than 1 then 
A* converges to the zero matrix as k increases, [3, p. 157]. 


By (1) the dominant eigenvalue of A is a positive number less than or equal to 1. 
But 1 is not an eigenvalue of A. To see this note that the matrix 2A of (* * *) is the 
companion matrix of the polynomial equation 1 + x + x7 + --- +x""'-— x" =0, 
(which is also the characteristic equation of 2A). See [4, p. 25], for example. This 
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polynomial equation (for n > 1) can be written as (1 — 2x” + x”*?) /Q. —x)=0. 
Since 2 is not a root of this equation and since the eigenvalues of A are 5 the roots 
of this equation, then 1 is not an eigenvalue of A. Therefore, by (2), A*y(0) 
converges to the zero vector as k increases, and so B, has measure zero. 

The above argument generalizes for the following extension of (*): almost every 
number in its binary expansion contains any given binary string. To outline this 
procedure, let S = a,a, --- a, bea string of binary digits of length n. Let B, be 
those numbers in / containing no instance of S in their binary expansions. For all 
integers k > 0,0 <j <n, let Y,(K) be the set of all numbers in J such that the first 
k digits of their binary expansion 

i. terminate in the string a,a, --- a,, but in no longer prefix of S; 

ii. contain no instance of S. 

Define y(k) as before, yielding the difference equation y(k + 1) = Ay(k). This 
matrix A resembles that of (* * *) in that 
i. each of its 2n — 1 nonzero entries is 5; 

ii. each of its first n — 1 columns contain exactly 2 nonzero terms; 

iii. exactly n of the nonzero entries are in the upper triangular region of A; 

iv. the remaining n — 1 entries of A are on the (j + 1, /) diagonal of A, where 
l<j<n. 

For example, if S is the string 1101, then the matrix A is 


2 7 O 3 
1 0 0 0 
0 4 1 9g 
0 0 4 0 


For matrices such as these it follows that 1 is not an eigenvalue. To see this let e be 
an eigenvector for A associated with a supposed eigenvalue of 1. Comparing the 
sum of the components of Ae with the sum of the components of e forces the last 
component of e to be 0, which upon successive back substitutions force the 
remaining components of e to be 0, which means that 1 is not an eigenvalue of A. 
The remainder of the argument is as before. 

Furthermore, the above argument is valid for the analogous result in base ten (or 
any other base); for example to show the analog of (*) in base ten, in the matrix A 
of (* * *) simply replace the first row of 5’s with 9/10’s and replace the remaining 
(5)’s with 1/10’s. " 

For a contrasting, more measure theoretic approach to this same material see [1, 
p. 43-8]. 
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Continued Fractions and Differentiability of Functions 


ALEC NORTON 
Department of Mathematics, Boston University, Boston, MA 02215 


The “ruler function” f: R — R is defined by the rule 


p(x) = { 


(By convention, if x is an integer, then x = x/1 in lowest terms, e.g., f(0) = 1.) 
This is a well-known example of a function continuous at the irrationals but 
discontinuous at the rationals. Suppose that we wish to improve continuity at the 
irrationals to differentiability. The ruler function is differentiable nowhere, but a 
simple modification will make it differentiable almost everywhere. 

The idea of modifying f to produce points of differentiability has been around 
for some time, as has the more general question of differentiability in the face of a 
dense set of discontinuities, e.g., [1], [2], [3], [5], [6], [8]. However, the literature is 
fragmentary and not widely known. Furthermore, the point of view of Diophantine 
approximation taken here has not been fully exploited, nor, apparently, have 
derivatives higher than the first been discussed in this context. To do so is to 
provide a beautiful connection between the notions (1) degree of differentiability 
and (2) degree of approximability by points of a dense set (such as the rationals). 

First we need a definition. Let k > 0 be an integer. A function f: R > Risk 
times (Peano) differentiable at a point x if there is a polynomial P,(t) of degree 
< k such that P.(0) = f(x) and 


f(x +t) — P(t) 


t 


0 if x is irrational 
1/q_ if x = p/gq in lowest terms (with g > 0). 


>0 ast-0. 


(Note that when k = 0 this is just the definition of continuity.) Here P, is the kth 
order Taylor polynomial for f at x. If f is k times differentiable at x, define the 
kth (Peano) derivative of f at x by f(x) = (coefficient of ¢* in P,(t)) - k!. (It 
should be remarked that for k > 1 this definition is slightly more general than the 
usual one, which requires that f(*~» exist in a neighborhood of x for f(x) to be 
defined. However, the Peano derivative agrees with the ordinary higher derivative 
whenever the latter is defined, and has the virtue of allowing us to discuss higher 
derivatives in the context of a dense set of discontinuities. In this context, we will 
omit the word “Peano” from now on without danger of ambiguity.) A function f is 
infinitely differentiable at x if it is k times differentiable at x for all k. 
Define g: R — R by 


_ {0 if x is irrational 
a(x) = 1/27 if x = p/gq in lowest terms (with g > 0). 


PROPOSITION. There is a partition of the irrationals R \ Q into uncountable dense 
sets Qo, O,,...,0,, with the property that g is infinitely differentiable at each point of 
@,,, and for each k => 0, if @& ©,, then g is k times differentiable but not (k + 1) 
times differentiable at 0. Furthermore, ©, is residual in R (i.e., contains a countable 
intersection of open dense sets), while R \ ©,, has Lebesgue measure zero. 


640 ALEC NORTON [August-September 


Proof. The proof uses the ideas of Diophantine approximation (good references 
are [4] or [7], among many others); in fact the partition {©,} will merely be a 
partition according to the rate of divergence of the coefficients in the continued 
fraction expansions of the irrationals (see below). 

We henceforth use @ to denote irrationals. First note that, for any k, if the kth 
derivative g‘*)(@) exists, it must be zero. We can see this by induction. For k = 1, 
this follows because 9(0) = 0 and g > 0. Suppose g’(0) = --- = g~%(6) =0, 
and gis k times differentiable at @. Then 


g(0+t) — P(t) 


; —-0 ast-—0, 
t 


SO 
1/27 — g©(0)(6 — p/q)*/k! 
(0 — p/q)* 


But the quantity (1/27)/(@ — p/q)* has a limit only if that limit is zero: this is 
because there will always be rationals with very large denominators that are only 
moderately near 6. Hence g‘*)(@) = 0. 

Since g is periodic with period one, it is enough to consider just the interval [0, 1]. 
Each irrational @ € [0,1] has a unique infinite continued fraction expansion 


—>0 as p/q-@. 


a2 + ay +... , 

denoted [a,, a, a;,...], where the coefficients a, (called partial quotients) are 
positive integers. Truncating at the nth stage yields the nth rational convergent 
Pn/ Un = [@1,4,---, 4,], which has the property that, among rationals p/q with 
Gd <4,» |9 — p/q| is minimized when p/q = p,/q,. It turns out that the sequence 
{q,,} Satisfies the relations 


do _ 1, q) = 4); Qn+1 = Qn+19n + Qn-1° (1) 
Moreover, we have the standard inequalities (which, e.g., follow from eq. (37), p. 237 


of [7]): 
(2) 


<|6 — p,/4n| < 


(2 + Ans) 4 Anside 


The question of the differentiability of g at @ will hinge on the rate at which the 
sequence {a,} is increasing, or more exactly, the rate at which the monotone 
sequence { A, } is increasing, where A, = max{a;;i=1,...,n + 1}. 

For convenience, define a function F: Z*— Z* (depending only on @) by 
F(q,;) = 2+ A, for each j, and F(q) = 2 for all q not in the sequence {q;}. We 
will need the following general fact: 


for all rationals p/q, |@ — p/q| = 1/q?F(q). (3) 


To see this, suppose otherwise for some p/gq. Then in particular |9 — p/q| < 1/2q’. 
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But this implies (e.g. [7, p. 237] that p/q is actually one of the convergents p,/q,,. 
Hence |6 — p,/q,| < 1/q2(2 + A,), but this contradicts (2). 

Now suppose g is k times differentiable at @. Since the first k derivatives of g 
must therefore be zero, this is true if and only if 


(1/2%)/|8 - p/q|' +0 as p/q> 
By (3), this will be true if (q7F(q))*/2% — 0 as g > oo, which holds if and only if 
(q2A,)</2% —>0 asn- oo. 


Conversely, suppose (g7A,)*/2% does not converge to zero. Then for some 
e>0, A={n: (q7A,)*/2™ > e} is an infinite set. Furthermore, A contains 
infinitely many n such that A, = a,,,. (To see this, choose N so large that g2*/2% 
is monotone decreasing for n > N. Choose m> WN so that A, =a,,,, (possible 
since { A,,} cannot be bounded if A is infinite). Now pick p © A such that p > m. 
If A, #a,,,, then A, = A,_, and p—16€A. Repeating with p — 1, we must 
eventually find n © A with n > m and A, = a,,,,. This can be done for arbitrarily 
large m.) We have now shown that (q2a,,,)"/2% > e for infinitely many n. 
Therefore, 


(1/2%)/9 — Da/dyl* > (G2ay 41) /2% > € 


for infinitely many n (using (2)). Hence g cannot be k-times differentiable at 6. 
We have proved that g is k-times differentiable at @ if and only if 


(q24,)‘/2% >0 asn— oo. (4) 


Now we simply define ©, to be the set of irrationals @ for which (4) holds for k but 
not for k + 1. 

For k > 0, the following argument shows that ©, is dense and uncountable. Fix 
k. We inductively define a sequence {a,} of partial quotients as follows. Let 
a, = 1, and if a,,..., a,, have been defined (and hence q,,..., g,, by means of (1)), 
let a, ,, be the least integer greater than or equal to g7 72%”). Then 


(a4y.4192)*/2% < [( gy 22%/* + 1) 42] “/2™, 
= qu *2%[1 + qe2~%/*] ‘loa, 


=q,*{1+ g22~ aml)" >0 asn- oo. 
On the other hand, 


(aug?) 1/2 > (gz t2/#) "> 


= gq, FtDQ&@/E — oe asn— oo. 


Since {a,} is eventually increasing, a,,, = A, for n large enough. This means that 
[a,, d,,...] © ©, by definition of ©,. Indeed, the set {[c,, c5, c3,...]: ¢; = a; or 
1 + a, for each i} is contained in ©, by similar reasoning, and is uncountable by 
virtue of the uniqueness of continued fraction expansions [7]. Furthermore, for any 
finite sequence {b,, b,,...,b,,}, [b,,---; Dns Im+i> Im+20-+-] © @, Since a finite 


9 m? 
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number of changes in the sequence {a,} cannot affect the limit (4). The reader can 
readily check that the set of all such points is dense in [0,1], and hence, so is ©,. 

Define ©,, to be the set of @ such that (4) holds for all k © Z*. In case 
A, < Cq® for some B,C > 0, then the corresponding irrational is called Diophan- 
tine. Any Diophantine irrational is contained in @,, since for each k, g?t8*/2% — 
0 as n — oo. It is well known [9, p. 8] that the Diophantine irrationals form a set of 
measure one in [0,1]. Hence ©,, is also of full measure. 

On the other hand, the following argument shows that ©, is residual. Let 
Wp, q) = {x €© [0,1]: |x — p/q| < 1/27}. This set is open, and V, = 
U,.,U,W(p, ¢) contains all the rationals, and, hence, is open dense. Let V = a Von 
If 0 & V is irrational, then for infinitely many g, |@ — p/g| < 1/24 < 1/2q? “for 
some p. But, as before, such a g must in fact be a q,, so by (2) we have 


1 1 
24 a,.ae 0 P/N S amy g? 
Therefore, 2 + a,,, > 9, 72% for infinitely many n, which implies that 
2+ A, > q772% 
for infinitely many n, and hence 
g2A,/2 > 1/2 


for infinitely many n, so 8 & @p. 


This means that ©, U Q contains V, and, therefore, is residual. Hence , is residual 
(since Q@ is countable). This completes the proof. 


Remarks. (1) The Proposition says that g is either not differentiable at “most” 
points or infinitely differentiable at “most” points, according to whether “most” is 
interpreted in the sense of category or measure. This is related to the well-known 
dichotomy between the Diophantine irrationals and the Liouville irrationals (those 
which are not Diophantine). See [9] for more on this interesting topic. 

(2) Suppose we alter the definition of g so that 27 is replaced by w(q), where 
w: Z*-— Z™* is some increasing function. Then the following are left to the reader. 
(See [8] for (a) and other related results.) 

(a) If w(q) = q?, then g is nowhere differentiable. (Use (2).) 

(b) If w(q) = q°, g is differentiable on a dense, uncountable set of irrationals, 
but nowhere twice differentiable. 

(c) No matter how rapidly w increases, the set ©, of points of nondifferentiabil- 
ity is residual. 

As a consequence of (c), no function vanishing at the irrationals and discontinu- 
ous at the rationals can be differentiable at the irrationals. In fact, a little more 
argument shows that no function can be discontinuous at every rational but 
differentiable at every irrational. (This has been known, by another method of 
proof, for some time, e.g. [3], [2].) The following theorem implies (c) and the above 
statements, and provides a nice application of the Diophantine approximation point 
of view. (A slightly weaker version appears in [6] and is considered from a more 
general viewpoint in [1].) 
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DEFINITION. For a > 0, a function f is said to be a-Hélder at x € R if, for some 
M > 0, |f(x) — f(y)| < M|x — y|* for all y near x. 


THEOREM. If f: R — R is discontinuous on a dense set S, then there is a residual 
set A C R with the property that for all x © A and all a > 0, f fails to be a-Holder 
at x. 


Proof. S has a countable dense subset {7, }. (Exercise.) The proof is inspired by 
the above proof that ©, is residual, with {7,.} playing the role of the rationals. Let 
s: Z*—> R U {co} be defined by s(n) = lim sup] f(7,) — f(y)|, where the lim sup is 
taken as y — r,. Then s(n) is positive for all n since f is discontinuous on {7, }. Let 
h(n) = min{s(n),1}. Fix a > 0. For each n, k € Zt, let 6(n, k) = k~"h(n)'*. 
Let A(k) = U,, Ban, xy (%n)- (B,(b) denotes the open ball of radius a, center b). This 
is open and dense in R. Let A, =1,A(k). Pick x © A,, and assume x is not in 
{7,, } (otherwise f is not even continuous at x). 

Then for every k there exists r,;,) such that x © Bg acgy ny (ncn) Lees |X — Mcyl < 
k“ h(n(k))'“ (by definition of 6). Choose y, appropriately near (possibly equal) 
tO 7,(4) So that 


Ye Tack) | < |x - Tack) | and | f(x) — f(y») | > 5h(n(k)). 
Then |x — y,|<2|x — nuy|, so 


lx) — fwd] Hall) AC (49) 
Ix — y, |" (2|x — rw) 2°*"h(n(k)) 


and this tends to infinity as k > oo (hence as y, — x). Hence f is not a-Hdélder at 
x. Letting A =f),A,,,; completes the proof. 


_ ke/2et), 


Final Remark. A version of the Proposition is also true in this more general 
context: for any countable dense set S there is a function g: R — R discontinuous 
on S and satisfying the conclusions of the Proposition (with respect to R\ S 
instead of R \ @). Details will appear elsewhere. 


(Acknowledgement. I thank the referees for helpful suggestions and for pointing out many of the cited 
articles.) 
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In a note [1] in the MONTHLY, Botsko and Gosser point out that the standard 
version of the Fundamental Theorem of Calculus holds when the usual derivative is 
replaced by the right-hand derivative. We would like to point out that by making a 
slight alteration in the usual definition of the Riemann integral, we can obtain an 
integral for which the Fundamental Theorem of Calculus holds in full generality. 

We begin by recalling one of the common definitions of the Riemann integral. 
If P= {a =x 9<x,< ++: <x, = 5} is a partition of [a,b], the mesh of P is 
max{x,; — xX;-,;:i=1,...,n}. 


DEFINITION 1. A function f:[a,b] — R is Riemann integrable over [a, b] if 
there exists A © R such that for every e > 0 there exists 6 > 0 such that if P isa 
partition of mesh less than 6 and if ¢; © [x,_,, x,], then 


Yada — X;1) ~A| <e. 


The number A is called the Riemann integral of f and is denoted by [?f. 

In order for a function f:[a,b] — R to be Riemann integrable it is necessary 
that whenever the interval [a, b] is partitioned into subintervals of length less than 
6, the Riemann sums L_, f(t;)(x; — x;~,) approximate the integral of f within e. It 
is this requirement of being able to uniformly partition the interval that limits the 
scope of the Riemann integral. It would be much more desirable to somehow allow 
“variable length” partitions; for example, if one were attempting to approximate 
the area under the graph of f(x) = 1/yx, 0 < x < 1, it would be natural to take 
the subintervals in an approximating partition to be very fine near the singularity 
x = 0. By making a slight modification in the definition above, we can easily 
achieve the ability to employ such variable length partitions. 

First, note that the requirement in Definition 1 that the partition P have mesh 
less than 6 can be replaced by the condition: 


) ) 
[x,,,x,] € [y, — 5 t, + 5] where 1¢, © [x;,_,, x;]. (1) 


Now we can achieve the desired variable length partition by merely replacing the 
constant 6 in (1) by a positive-valued function 6 :[a, b] > R, 1.e., we replace (1) by: 


[x;1, x; ] Cc (t; — 8(t,),¢;, + 5(t;)) where 1¢; © [x;_,, x;]. (1’) 
644 
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By varying 4(t) as ¢ varies along the interval [a, b], the lengths of the subintervals 
in the partition satisfying (1’) will vary with 6(z,). 

We give the formal definition of the resulting integral. A tagged partition of 
[a, b] is a finite set T = {x 9, x,,...,X,3 4, ty,..., ¢,} such that {xo, x,,..., x, } is 
a partition of [a,b] and ¢; © [x;_,, x,]; the point ¢, is said to be a tag for the 
subinterval [x,;_,, x,;]. Any positive-valued function 6:[a, b] — R is called a gauge 
on [a, b]. We say that a tagged division T is 6-fine if (1’) is satisfied. 


DEFINITION 2. A function f:[a, b] — R is gauge-integrable over [a, b] if there 
exists A € R such that for every e > 0 there exists a gauge 6 on [a, b] such that if T 
is a 6-fine tagged partition of [a, b], then 


y f(t )(x; _ X;-1) — Al <e. 


i=] 


The number 4 is called the (gauge) integral of f and is denoted by {2f. The 
gauge integral is also referred to as the generalized Riemann integral [9] or the 
Riemann complete integral [5]. From Definition 1, we see that a function is 
Riemann integrable iff it is gauge integrable with respect to constant-valued gauges. 
Note also that if a function is gauge integrable then its integral is the limit of a 
sequence of Riemann sums. One technicality must be taken care of in order for 
Definition 2 to make sense: it must be shown that every gauge 6 has at least one 
6-fine tagged partition. Currently this observation is ascribed to Pierre Cousin [2] 
but it may go somewhat earlier. The lemma has a curious habit of rediscovery: for 
example, each of the articles [3], [7], [12], [14] contains a fresh account with similar 
applications. The proof requires only a compactness argument (based on the 
Bolzano-Weierstrass or Heine-Borel theorems) and indeed the lemma is equivalent 
to these theorems. The reader can find an elementary proof in [9]. 

Before proceeding to the Fundamental Theorem of Calculus, consider the inte- 
grability of the Dirichlet function: f(x) =1 for 0 <x <1 and rational and 
f(x) = 0 for 0 < x < 1 and irrational. This is the most common example given of a 
bounded function that is not Riemann integrable, and will, therefore, furnish a 
comparison of the gauge and Riemann integrals. Let « > 0 be given and for x 
irrational, set 6(x) = 1. Let {z,}?2, be an enumeration of the rationals in [0,1] and 
set 6(z,) = ¢/2'*!. Now suppose that T = { xo, x,,..., X,3 ty ty,.--,¢,} is a 6-fine 
tagged division of [0,1]. If ¢; is not rational, the term f(t;)(x; — x;.,) in the 
Riemann sum of f with respect to T is 0; if ¢; is rational and ¢; = z,, then the term 
f(t,)(%; — %;—1) in the Riemann sum is less than 26(z,) = ¢/2/**. Thus, we have 


n-1 oe) 

L f(t )(%1 - x1) | < 2 Le e/2/** =e, 

i=0 j=l 
where the factor 2 is necessary since each z, may be the tag for two of the 
subintervals in the partition. This implies that f is gauge integrable with [of = 0. 
Note that the gauge 6 is definitely not a constant-valued gauge. 

We now show that the Fundamental Theorem of Calculus is valid for the gauge 
integral in full generality. For this we require the following lemma which plays the 
role of the Mean Value Theorem in the usual proofs of the Fundamental Theorem 
of Calculus for the Riemann integral. 
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LEMMA 3 (STRADDLE LEMMA). Let F':[a, b] — R be differentiable at z & [a, b]. 
Then for each ¢ > 0, there is a 5 > 0 such that 


|F(v) — F(u) — F(z)(v — u)| s e(v — 4), 
whenever u < z < vu and [u,v] C [a, b] N(z — 6, z+ 8). 
Proof. Since F is differentiable at z, there is a 6 > O such that 
(F(x) — F(z))/(x — 2) -— F(z) <e 


for 0 < |x — z| < 6, x € [a, b]. If z = u or z = v, the conclusion is immediate so 
suppose u < z < v. Then, 


|F(v) - F(u) - F(z)(v - u)| 
<|F(v) — F(z) — F(z)(v — z)| +| F(z) — Flu) — F(z)(z - 4)| 
<e(vu-—z)+e(z-—u) =e(v—-Uu). 


The geometric interpretation of the Straddle Lemma is clear. If the points u and 
v “straddle” z, then the slope of the chord between the points (u, f(u)) and 
(v, f(v)) is close to the slope of the tangent line at (z, f(z)). 


THEOREM 4 (FUNDAMENTAL THEOREM OF CALCULUS). Jf F:[a,b] > R is dif- 
ferentiable on [a, b], then F’ is gauge integrable over [a, b] and [°F’ = F(b) — F(a). 
Proof. Let ¢ > 0. For z & [a, b], let 6(z) > 0 be the 6 given by the Straddle 


Lemma. Suppose T = { Xo, X1,.-..,X,3 4, ¢2,---,t, } 18 a 6-fine tagged partition of 
[a, b]. Then by the Straddle Lemma, 


Y Fad ~— x;_,) — (F(b) - r(a)| 


» { F’(t;)(x; — X;1) - (F(x;) ~ F(x;,_1))} 


< )i e(x; — x;_,) = e(b — a), 
i=1 
and the conclusion follows. 


Note that in general the gauge 6 constructed above is not a constant-valued 
gauge and depends on the differentiability properties of F. 

The versions of the Fundamental Theorem of Calculus for both the Riemann and 
Lebesgue integrals require the hypothesis that the derivative F’ is integrable; it is 
part of the conclusion of Theorem 4 that the derivative F’ is gauge integrable. For 
example, the derivative of the function F(t) = t* cos(7/t’), 0 < t < 1, F(0) = 0 is 
gauge integrable, but is not integrable for either the Riemann or Lebesgue integrals. 

To further illustrate the utility of the gauge integral, we now proceed to 
generalize Theorem 4 by allowing the function F to be nondifferentiable at a 
countable number of points. 


THEOREM 5. Let F:[a, b] > R be differentiable except perhaps at countably many 
points of [a, b]. Let G:[a, b] ~ R be such that G(x) = F’(x) when F is differentia- 
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ble at x. If F is continuous, then G is gauge integrable over {a, b] with [?G = F(b) — 
F(a). 


Proof. The proof is not substantially different and requires only some arithmetic 
to take care of the exceptional set N = {z,;:i = 1,2,...} where F may fail to be 
differentiable. Let « > 0. For x € N define (x) > 0 by the Straddle Lemma as 
before. For z; € N, choose 6(z;) > 0 such that |G(z,)|26(z,) < ¢/2'*? and 

|F(z,) — F(z, + h)| < ¢/2'*? 


for |h| < 8(z,) (by continuity). Suppose that T = {Xo, X1,..-,%X,3 ty to,---,t,} 18 a 
6-fine tagged partition of [a, b]. Then as before, 


Y G(4)(x,~ x11) ~ (FB) ~ Fla) 


¥ (G(4)(x,~ 1-1) ~ (FC) ~ Fa) (2) 


We break the sum on the right-hand side of (2) into two parts. Let ©’ denote the 
sum of the terms with tags t; € N, and let ©” denote the sum of the terms with tags 
t; € N. As before the sum »’ is less than e(b — a). For ©”, if t; = z,, then 


|G(t;)(x; — x;-1) | <|G(z,) |28(z,) < e/2/*? 


and 
|F(x;) — F(x-1)| <|F(x;) — F(z,)| +|F(z;) — F(x,-1)| < 26/2/*°. 
Hence, 
y” <2| ¥ c/2it?2 + ¥ c/2/*?| =e, 
j=l j=l 


where the factor of 2 accounts for the fact that each z; may be the tag for at most 
two of the subintervals. Thus, the sum in (2) is less than e(b — a) + e and the result 
follows. 


Both versions of the Fundamental Theorem of Calculus given in Theorems 4 and 
5 are well known for the gauge integral and can be found in [9]. There is also a 
divergence version of the Fundamental Theorem of Calculus for a gauge-type 
integral in n dimensions given in [13]. 

The functions F(x) = 2x!/7,0 <x <1, and G(x) = x7!” for 0 < x < 1 and 
G(0) = 0 provide a simple example where Theorem 5 is applicable but the more 
familiar version of the Fundamental Theorem of Calculus is not. Note that in this 


case, the integral 
} “172 = : = te = 2 
i} x dx i} G i 


is computed without resorting to the limiting technique required by the Riemann 
approach. 

Note that the continuity assumption in Theorem 5 is important. For example, if 
F(x) =x +1 for O<x <1 and F(x) = —-x for -1 <x <0, then F/(x) = 
G(x) except for x = 0, but {1,G = 0 while F,(1) — F,(-1) = 1. 
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As can be seen from the definition, the gauge integral has very much the same 
flavor as the Riemann integral, being obtained from a slight modification of the 
Riemann integral, and does not require a lot of technical apparatus for its introduc- 
tion as is the case for the Lebesgue integral. However, despite the elementary 
appearance of the gauge integral, it leads to a very powerful theory of integration 
which encompasses the Riemann integral, the Cauchy-Riemann (improper 
Riemann) integral, and the Lebesgue integral. For this reason, the gauge integral 
would seem to be a very reasonable candidate for inclusion in an introductory real 
analysis course; it is as conceptually easy to describe as the Riemann integral and 
yet possesses all of the powerful properties of the Lebesgue integral including the 
Monotone and Dominated Convergence Theorems. 

Remarkably, this simple modification of the Riemann integral was not intro- 
duced until approximately a century after Riemann’s introduction of his integral in 
1854. The gauge integral was independently introduced by Kurzweil [6] and Henstock 
[4]; Kurzweil used the integral to treat some questions in ordinary differential 
equations but did not develop any of the deep properties of the integral; Henstock 
established the convergence theorems for the integral. 

The interested reader can find very readable expositions of the gauge integral in 
[5], [8], [9]. E. J. McShane also treats a “gauge-like” integral in [10], [11]; he alters 
the definition above by dropping the requirement that the tag ¢, belongs to its 
corresponding subinterval. The resulting integral is, surprisingly enough, exactly 
equivalent to the classical Lebesgue integral. 
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A Strong Inverse Function Theorem 
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A linear transformation from R” into the same space R” is one-to-one if and 
only if its matrix relative to, say, the standard basis has nonzero determinant. In this 
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case the transformation maps R” onto R”. This much everyone knows. A trivial 
extension says that an affine function (a constant plus a linear transformation) from 
R” into R” is one-to-one if and only if the matrix of its linear part has nonzero 
determinant, in which case the function maps R” onto R”. 

Now consider a non-linear function f= = ( fi ...5f,) from a subset of R” into R”. 
We say f is differentiable at a point a a if f is “affine at a,” by which we mean there 
exist a linear transformation T and a “vanishing” function E such that 


f(X) = f(@) + T(X — a) + [|x - a E(x) 
for all x in some neighborhood of a, and E n(x) > 0 as x > 4. If we write 
C= f(@) — T(@), then f(%) = @+ T(x) (an affine function) when X = a. We call 
T the total derivative of f at a. The determinant of its matrix is the Jacobian of fat 
a. The property of affine maps mentioned in the first paragraph suggests that if this 
Jacobian is nonzero, then f will be one-to-one on some neighborhoods of a and map 


that neighborhood onto a neighborhood of f(a). This pleasant conjecture is 
defeated by the following function from R? to R?: 


fi(x, y) =x +2x*sin(1/x) if x#0, f,(0, y) =0 


f 2(x oJ ) = YJ. 

This function is differentiable at (0,0) and has Jacobian 1, but there is no 
neighborhood of (0,0) on which it is one-to-one (for the proof of this assertion it 
helps to note that (d/dx) (x + 2x? sin(1/x)) takes positive and negative values on 
every neighborhood of 0). This explains why texts that prove the Inverse Function 
Theorem must use stronger hypotheses. Typically they assume that f is differentia- 
ble throughout some neighborhood of a, or even that f has continuous partial 
derivatives on such a neighborhood. The purpose of this paper is to prove that the 
classical sufficient condition for differentiability of f at a single point a is also 
sufficient to make f invertible on a neighborhood of a when the Jacobian is 
nonzero. The classical condition to which we refer is the following: the first-order 
partial derivatives of the component functions f, exist on a neighborhood of a and are 
continuous at G. In this case we shall say that “f is minimally C! at a.” It is our 
pious hope that textbook writers of the future, seeing the simplicity and strength of 
this condition, will say simply that “f is C! at a.” 


ONE-POINT INVERSE FUNCTION THEOREM. Let f be a function from a subset of R” 
into the same space R”. Suppose fi is minimally C’ at a point a and the Jacobian of fe at 
a is nonzero. Then there exists a neighborhood N of a such that tf is one-to-one and 
continuous on N and f LN |] isa neighborhood of f(a). Let g denote the inverse 
function of the restriction of f to WN’. Then _g is continuous on fl [NW] and differentiable 
at at fF ( (a ), and the Jacobian matrix of g at i (a) is the inverse of the Jacobian matrix of 
fata. 


To prove this theorem it suffices to prove that the hypotheses imply that f iS 
strongly differentiable at a, which means (see [2], [3]) there is a linear transformation 
T:R” — R” and a vanishing function F:R” X R” — R” such that 


F(%) — f(¥) = T(E -¥) + F - HUF, 7) (1) 


for all x and y in some neighborhood of a, and F(X, J) > 0 as (x, F) > (4, @). 
As shown (tacitly) in [4], strong differentiability of f at a is enough to deliver most 


650 WILLIAM J. KNIGHT [August-September 


of the conclusions of the One-Point Inverse Function Theorem stated above (in fact 
the inverse function g can even be shown to be strongly differentiable at f(@)). 
Thus the proof of our theorem comes to rest on the following lemma. 


LEMMA. Let f be a function from a subset of R” into R" and let a be a point at 
which f is minimally C’. Then f is strongly differentiable at @. (Note: the proof does not 
require the domain and range spaces to have the same dimension.) 


Proof. While it is possible to prove that these conditions imply the hypotheses of 
a simple extension of Theorem 1 of [3, p. 972], it seems preferable to give a proof 
from first principles. Let T denote the total derivative of f at a. This exists because 
f is minimally C’ at a. Let B be a box about @ on which all first-order partial 
derivatives of the component functions f, exist. Fix x and y in B. By moving from 
x to y along lines parallel to coordinate axes and applying the Mean Value 
Theorem for real-valued functions, one variable at a time, we obtain 


i) 19) = Y Dahlia) ~ y,) (2) 


for each component function f, and for appropriately chosen points c,, along the 
path from x to y. These points c;, are forced toward a as (x, y) > (a, @), so the 
difference defined by 


Ei,(x, y) = D,f:( ix) _ D,f,(@) 


approaches 0 as (x, y) — (a, a). Thus we can write (2) in the form 


(3) ~ (3) =X Dye - yy) + x Eg %, BV(x%4 — Ye) 


Vii(a) -(¥—¥) + EX, ¥) > (¥-Y), (3) 
where E, = (E,,..., E;,). The first term on the right side of (3) is the ith 
component of T(z — y). To handle the second term we write 


(E,(%, ¥) -(¥-¥),..., E,(% 9) - (¥- 9) 


if x # y and set F(x, X) = 0. This yields equation (1). Moreover, 
_ 1 n ~ . . n ~ |, 
F(x, y)Il < lz — 5 DIE(*, ¥)-(¥-Y is VIE Pl 
i=l i=l 
by the inequality |All < L|h,| and the Cauchy-Schwarz Inequality. It follows that 
F(x, JV) > Oas (x, y) > (4a, a). 0 


Some texts (e.g., [1]) prove that the following is a sufficient condition for 
(ordinary) differentiability of a real-valued function f of several variables: all but 
one of the first-order partials of f exist on a neighborhood of a and are continuous at G; 
the exceptional partial derivative exists at a. This result can easily be extended to 
vector functions. Does this condition also suffice for strong differentiability at a? 
No. The function given as an example earlier satisfies this condition at (0,0), but it 
cannot be strongly differentiable there because it has a nonzero Jacobian at (0, 0) 
and yet is not locally invertible there. 
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A Constructive Proof of the Partial Fraction Decomposition 


DAN SCOTT 
Department of Mathematics, Rose-Hulman Institute of Technology, Terre Haute, IN 47803 


DONALD R. PEEPLES 
Department of Mathematics, Mary Washington College, Fredericksburg, VA 22401 


The method of partial fraction decomposition is central to the utilization of 
rational functions in analysis (integration, differential equations, etc.). An efficient 
method of partial fraction decomposition appears in classical algebra texts such as 
Chrystal [1]; however, the efficiency of this “classical method” was demonstrated by 
Straight and Dowds only in 1984 [2]. The classical method was shown to work on all 
rational functions whose denominator contains at most one irreducible quadratic 
function. We show that the classical method can be modified to work on all rational 
functions while retaining the same efficiency; furthermore, our proof is constructive, 
is performed in a general setting, and uses techniques of modern algebra. 

Straight and Dowds [2] point out that the classical method is recursive and that, 
whereas the usual method described in modern calculus texts requires O(n°) 
operations for a rational function whose denominator has degree n, the classical 
method takes only O(n’) operations and is “algebraically simpler”; furthermore, 
Chrystal points out that it is somewhat self-checking. All four of these properties are 
retained when the classical method is extended to include all rational functions. 


Notation. The rational function that we wish to decompose will be denoted by 
p/q. Let R[x] be the ring of polynomials over the reals; R[x]/(f(x)) is the 
quotient field, where f stands for some irreducible polynomial of R[x]. (Thus f is a 
linear or a quadratic polynomial.) It is well known that the elements of R[x]/( f(x)) 
can be represented by real polynomials of degree less than the degree of f. Also well 
known is that there exists a homomorphism of R[x] into R[x]/( f(x)) defined by 
o(u) = umod f, where u is any element of R[x]. For the sake of clarity all 
arithmetic operations in the quotient field will be enclosed in a circle. 


THEOREM (Partial fraction decomposition). Suppose that the denominator of the 
rational function p/q has been factored so that p/q = p/(wf*) where f is an 
irreducible element of R[x] and w and f are coprime. Then there exist unique 
polynomials s and t of R[x] with degree(s) < degree( f) such that 


P P S t 
qf Fe wp ) 
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And in fact s and t are given by 


s=p(p)Op(w); t=(p-sw)/f. (2a, b) 

Comments: Note that s is actually thought of as both an element of R[x]/( f(x)) 

and as an element of R[x] as appropriate. This is allowable in light of the earlier 

remark made about the usual representation of elements in R[x]/(f(x)). It should 

also be clear that if this theorem can be established, then it may be applied 

repeatedly to yield a complete partial fraction decomposition of p/qg. (Further 
factorization of the denominator will most likely be necessary.) 


Proof. Equation (1) is equivalent to 


p=swe+ ft. (3) 
Applying the homomorphism, q¢, to (3) we obtain 
9(p) = (s) @ p(w). (4) 


Since degree(s) < degree(/), p(s) = s so solving (4) for s we obtain 


s=(p) @ o(w) 


as desired. Thus if (1) holds then (2a) must follow. Then (2b) follows from (3). Thus 
we have now established the uniqueness of the decomposition. The only additional 
thing that must be verified is that ¢, as defined by (2b), is actually a polynomial, that 
is, that p — sw is a multiple of f. To do this simply observe that 


¢(p — sw) = (Pp) © [e(p) @ o(w)] @ o(w) = 0. (5) 


This completes the proof. It should be noted that one may find s and ¢ without ever 
dealing explicitly with operations in R[x]/(f(x)), simply by solving the equation 
p mod f = sw mod f obtained from (3). In practice this seems to be the easiest way 
to proceed. 


Analysis of Complexity. In order to facilitate easy comparison of the complexity 
of our algorithm with other methods, we shall assume that the degree of the 
numerator of the rational function is less than the degree of the denominator. The 
algorithm can, however, be applied even if this is not the case. 

Suppose the denominator of p/q has been completely factored into the product 
of powers of linear and irreducible quadratic terms. Let m be the sum of the powers 
of these terms, and n the degree of the denominator. Then it must be that 
n/2 <m <n. The number of times that the theorem must be applied, in order to 
completely decompose p/q, is m—1. Thus the number of applications of the 
theorem is O(n). Each time the theorem is applied it is necessary to perform the 
following steps: 1) compute g(p) and m(w)—each of which may be done by 
division and requires no more than O(n) steps; 2) compute o( p) @ p(w)—which 
takes a small fixed number of operations; and 3) compute t = (p — sw)/f by 
division—which requires no more than O(n) steps. Thus the theorem is applied 
O(n) times and each time requires O(n) operations; therefore the decomposition 
requires O(n”) operations overall. More precise estimates may be obtained by 
analyzing the algorithm in more detail. 


Example. A short example should help clarify the discussion. We shall find the 
partial fraction decomposition of (x? — x + 4)/(x? + 1)(x — 1)”. The first applica- 
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tion of the theorem gives us 


x?-x+4 s t 
(24+ D(x-1p +1 (x- 1) (6) 
where 
s=9(p) @ o(w) = (-2x + 4) @ (-2x) =2x4+1 (7) 
and 


t= (p — sw)/f = [(x3 — x + 4) — (2x + 1)(x — 1))|/(x? +1) = —x +3. 
(8) 


Next, we use the theorem to decompose (— x + 3)/(x — 1)”. We obtain 


—x +3 S t (9) 

eee + —_, 

(x-1)/ (x-1 x71 
where 

s=9(p)o9(w) =2/1=2 (10) 
and 

t= (p — sw)/f = [(-x + 3) - Q)Q)I/(* - 1) = - 1. (11) 
So, putting everything together, we have 
xi—-x+4 2x +1 2 1 


(@4+(-1P +l Gp x1 (12) 


The approach used above is certainly not the most efficient implementation for 
hand calculations, though it does have the advantage of staying close to the proof of 
the theorem. A more elementary version of this method can be taught to freshmen 
calculus students (see the note after the proof). In fact, in our experience, they seem 
to prefer it to the usual method of equating coefficients. This is partly because of 
the smaller number of calculations, and partly because after each fraction in the 
expansion is found, there is a division that helps determine if that term was found 
correctly. The divisibility check is made when it is found that f divides evenly into 
p — sw. 
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Let a mod m denote the arithmetic progression a + km: k © Z. Suppose b mod n 
is disjoint from a mod m. What is the smallest positive integer , such that there is 
an arithmetic progression cmodr which is disjoint from both amodm and 
b mod n? 


E 3279. Proposed by Stanley Rabinowitz, Alliant Computer Systems Corp., Little- 
ton, MA. 

For some nv find a simplex in E” with integer edges and volume 1. (Cf. Problem 
1261 in Mathematics Magazine, [1987, p. 40; 1988, p. 118].) 

E 3280. Proposed by Peter J. Ferraro, Roselle Park, NJ. 


Let (X, T ) be a topological space such that { p }’ is closed for every point p © X. 
Prove that A’ is closed for every subset A of X. Here A’ is the set of points x € X 
such that every open set containing x contains a point of A \ {x}. 


E 3281. Proposed by Sandi Klavzar and Marko Petkovsek, University of Ljubljana, 
Yugoslavia. 


Given a graph G, let C(G) be the inclusion ordering on the vertex subsets of G 
inducing connected subgraphs. Prove that C(G) is a lattice if and only if G is 
connected and every cycle of G induces a complete subgraph of G. 


SOLUTION OF ELEMENTARY PROBLEMS 
More on Arithmetic-Geometric-Harmonic Means 


E 3141 [1986, 298]. Proposed by Mo Song-Qing, The Beijing 19th Middle School, 
Beijing, China. 


Suppose a, > 0, i=1,...,. Consider the inequality 
QA, +a, t+ ++ FA, Ag+ Az, 4 ++ FAag4y Aa,t a, +++ Fay, 
k k ves k 
Qa, t+a,+°:: +4, 4 
> (yay a, (1) 


for2<k<n-1. 
(a) Prove or disprove (1) for n = 3. 
(b)* Prove or disprove (1) for n > 3. 


Solution I to (a) by Michael Vowe, Therwil, Switzerland. Let 


1 1 3 
f(x, yz) = glx t yy t z)(z + x) — Ge + y + 2)(92) o, 


We show f(x, y, z) > 0 for positive x, y, z, with equality if and only if x = y = z. 
Since f(tx, ty, tz) = t?f(x, y, z), we may assume x + y + z = 1. In this case, we 
want to prove 


8 2/3 
(1 = x)(1 = y)(1 = 2) > 5 (92), 
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which holds if and only if 


3. 8 1 
(xyz) > 5° (2) 
3 I/x+1/yt+1/z-1 
Because x + y+z=1, the arithmetic-harmonic mean inequality becomes 
1/3 > 3/1 /x +1/y+1/z). This rearranges to 91/x+1/y+1/z-1)2 
8(1/x + 1/y + 1/2), or 
3 8 1 

—_—__- > — . —_____ (3) 

l/x+1/yt1/z 3 1/x+1/yt+1/z-1 


The inequality between the geometric and harmonic means implies that the left side 
of (2) is as large as the left side of (3), which proves (2) and thus (1). 


Solution II to (a) by Laszlo Cseh, Odorheiu-Secuiesc, Romania. For n = 3, we 
prove a sharpened form of (1), namely, 


a, +a, a,+a, a,t+a, a, +a,+a,\3/* 1/2 4 


For n = 3, (1) follows from (4) by the arithmetic-geometric mean inequality. 

(4) follows from an inequality of T. Popoviciu (in “Sur certaines inégalités qui 
caractérisent les fonctions convexes,” Analele Stiintifice Univ. “Al. I. Cuza’”’ Iasi 
Sect. I-a Mat. (N.S.) 11B(1965) 155-164). If f is a convex real-valued function on 
[a, b] and x, y, z © [a, b], Popoviciu proved that 


j[Et2 42), L410) #100 


3 3 
2 x+y yz z+x 
>= + + . 5 
idee kelered helen) ©) 
The function f(x) = —logx is convex on (0, +00). Putting x =a,, y= a4), 


Z = a, and exponentiating Popoviciu’s inequality yields (4). 


Editorial comment. Murray Klamkin remarked that, by an argument similar to 
that in Cseh’s solution, Popoviciu’s more general inequality for convex functions 


wnoy(e)e PE nay EA 


No jz, \f 


where s = L”_,x,, implies the truth of (1) for the case k = n — 1, for any n > 3. 
The truth of (1) in the case k = n — 1 was also established by M. J. Pelling. In 
addition Pelling showed that (1) holds in the case k = 2, n = 4, so that (1) is valid 
for n = 4 as well as for n = 3. 
On the other hand Pelling showed that, for each fixed k > 2, the assertion (1) is 
false when 


7 k?—1 
> —(k?-1). 
n> —(k?-1) 
In particular, if k = 2, (1) is false when n > 10. 


Pelling’s elegant argument for the falsity of (1) when n > a*(k? — 1)/3 is as 
follows. 
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Suppose k is fixed, k > 2, and n > k. Under the convention that a,,, = a,, (1) 
may be written as 


L<R, (6) 
where 
L= ~ ¥ tog{ 2“ +A j44-1 
j=l K 
1 2 n—-1 
R= -Ioe = > “| — y log a, 
j=l j=l 


If (6) were universally valid, it would hold in particular when a; = 1 + ecos(27j/n) 
for 1 <j < n. For this choice of the a; we have 


2 


ne [kK-1 0 Qa(jtr) 
L= _ a? 3 
yy ral 2, cos - + O(e°) 
J r 
en en 2a(s —r) 
am * ie y cos - + O(e°), 


O<r<s<k 


where we have used the identity 


2a(j + r) 2a(j + s) n 2a(s — r) 
_ COS — = = COS 


cos > 3). 
> n n 2 n (7 ) 
Similarly 
n—-1 2 ¢ 27] 
R= » cos? + O(e?) 
n 2,2 n 
j 
n — 1)e? 
= wee + O(e?). 
Thus a necessary condition for (6) is that 
n n 2n(s—r) (n-1) 
—+— >- )} cos———— < 
4k 2k* ye feyeg n 4 
or 
k? 1 2a(s—r 
—<<k(k-1)- Yb cos ET) 
2n 2 O<r<s<k n 
or 
k? 2a(s—r a(s—r 
—< y¥ f1- cos TS") = »y sine") 
2n n n 


O<r<s<k O<r<s<k 
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Using the inequality |sin x| < |x|, we obtain 


k? 27(s — ry 
—_ < ee 
2n  Qeres<k n° 
2a? (k*(k —1)(2k - 1) on — 1) aq k?(k? —1) 
— p? 6 4 yp? 6 


Thus a necessary condition for the validity of (1) or (6) is that 
oT 
ns 5 bk 2 1). 


Part (a) was solved also by O. Krafft (West Germany), J. E. Pecari¢, and the proposer. 


Weitzenbéck Generalized 


E 3150 [1986, 400]. Proposed by George A. Tsintsifas, Thessaloniki, Greece. 


Let ABC be a triangle with sides a,b,c and area F. It is well known that 
a? + b? + c? > 4FY3. If p,q, r are arbitrary positive real numbers, prove that 


c? > 2Fy3. 
qtr r+ p p+q 


Solution I by Victor Pambuccian, Bucharest, Romania. Put a= a,, b= 4b), 
C= a; P=P1, =P, =p; and k=2p,. The left side of the proposed 
inequality becomes 


> 


Pid k= a Da? = 13(k >> 
k — p, = k — p, a; 2 P:) 


where the inequality in the last step follows from Cauchy’ s Inequality. 
Furthermore, 


k — p, — 2a; > 3(2a,)° — 2a;, 


1 1 
5 (2a,)" — Ya? = 5 (22l/a; — xa?), 


where / = a,a,a3. Let A, denote the angle opposite a;. Since Da? = 3(21/a;) cos A,, 
we can rewrite the desired expression as 


1 ; , 122! 1 A, 
5 (2a,) — Xa; = 5744 — cos A,;) = sin 7 
2F Ai A, 
~ nan > = 2F2tan—. 


We now need only show = tan(4,/2) > V3. This follows from the convexity of the 
tangent function on the interval (0, 7/2). 


Solution II by O. P. Lossers, Eindhoven University of Technology, The Nether- 
lands. With the notation p,,a,; as above, we may assume that 2p, = 1. Using 
Lagrange multipliers for the function f(p,, Pp, p3) = =p,;a7/(1 — p;) on the set 
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{( Pi, Po» P3): Pp; > 0, 2p; = 1}, we find a minimum with p; = (s — a,)/s, where 
s = (1/2)3a,. Evaluating f at this point, we must prove that =(s — a,)a, > 2FV3. 

Because both sides of this inequality are positive, we may check the inequality by 
taking the difference of their squares. To express both sides in convenient forms, we 
express the side-lengths as a, =y+2z, a, =z+x, a,=x+y. Then X(s — a,;) 
a, = xy + yz + zx) and F* =s(s — a,)\(s — a,)\(s — a3) = xyz(x + y + 2). 
Thus 


[=(s — a,)a,]° — 12F? = Al (xy + yz + 2x) — 3xyz(x+y 4+ z)| 
=2x7(y—z)/ +2y2(z— x)? +227(x —y)’ > 0. 
Furthermore, equality occurs if and only if x = y = z,1€., a, = a, = Q3. 


Solution III and generalization by M. S. Klamkin, University of Alberta, Canada. 
Again with the same notation, we derive more generally an inequality for S = 
= p,ai"/(k — p;), where 1 > n > O and a, > 0. By Cauchy’s Inequality, we have 
208 + Sai") = S(k — p,)Sa}"/(k — p;) > (Za?")’. Letting R = Za}, this is 
equivalent to 


2S > RII(R — 2a”) (1) 


with equality if and only if a7”"/(p, + p3) = a3"/(p3 + p,) = a3"/( p, + Po). If 
the {a/’} are not the sides of a triangle, then the right side of (1) is negative. 
Suppose that {a,;} form a triangle and that 1 > n> 0. Then {a7} also form a 
triangle, whose area we denote by F,. The right side of (1) is 16 times the square of 
the side-length formula for area, yielding 2S > 16F?. 

We now use Oppenheim’s generalization [1] of the Finsler-Hadwiger inequality, 
1e., 4F / V3 > (4F/y3)" for 1 >n => 0, with equality for n <1 if and only if 
a, = a, = a,. With (1), this yields the generalization 


1S >3(4F/V¥3)" (0<n<1). (2) 


The proposed inequality corresponds to the special case n = 1/2. The special 
case n = 1 corresponds to the proposer’s problem #1051 in Crux Mathematicorum, 
11(1985) 187. 


1. A. Oppenheim, Inequalities involving elements of triangles, quadrilaterals, or 
tetrahedra, Publications de la Faculté d’ Electrotechnique de 1’ Université a Belgrade, 
No. 461—497(1974) 257-263. 


Also solved by P. Dwinger, J. Fukuta (Japan), J. V. Harrell, W. Janous (Austria), D. C. Kay, K. 
Schilling, South Alabama Problem Group, M. Vowe (Switzerland), R. C. Vrem, and the proposer. 
Partially solved by H. Geiges (West Germany), L. Kuipers (Switzerland), and I. A. Sakmar (Turkey). The 
inequality a? + b? + c* > 4F/3 goes back to Weitzenbéck, Math. Z., 5(1919) 137-146. 


An Inscribed Triangle Inequality 


E 3154 (1986, 482]. Proposed by George A. Tsintsifas, Thessaloniki, Greece. 


Let A,, B,,C, be points on the sides a, b,c of the triangle ABC, respectively, 
and let a,, b,, c, be the sides of the triangle A,B,C,. Prove that a7b,c, + b’c,a, + 
c’a,b, > 4F?, where F is the area of the triangle ABC. 
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Solution by Jiro Fukuta, Gifu-ken, Japan. 


LEMMA. Let P be an arbitrary point in ABC, with a, b,c the lengths of the sides 
opposite the respective vertices. Then 


PB PC PCPA _ PA PB 


b ¢ c oa ab” 


V 


with equality if and only if P is the orthocenter or a vertex of ABC. 


To prove the lemma, let P be the origin in the complex plane and let a, B, y be 
complex numbers locating A, B,C, respectively. Then it is easy to verify the 
following algebraic identity 

B Y Y a a B 

oo tH root To mT ns = 

(a-y)(a-B) (B-a)(B-y) (y-B) (y¥-@) 
Taking magnitudes and applying the triangle inequality yields the desired in- 
equality. Equality holds when one of a, 8, y is 0, i.e. when P coincides with a 
vertex. If aBy # 0, equality holds for magnitudes if and only if each term of the 
equality involving a, B, y is real, i.e., if and only if P is the orthocenter of ABC (the 
point of intersection of the three altitudes). 

Now consider the original problem; let S be the expression on the left side of the 
desired inequality. The circumcircles of the triangles AB,C,, BC,A,, CA,B, are 
concurrent in the “Miquel point,” denoted by P. (Cf. Arnold Dresden, An Invita- 
tion to Mathematics, New York, Holt, 1935, pp. 233-239.) Let the radii of these 
circles and the circumcircle of ABC, respectively, be r,, 7,, 7,, r. Using the formulas 
a= 2rsin A, a, = 27,sin A, b, = 27,s1in B, c, = 27,8inC, we have 


S = 8rsin Asin B sinC(ar,r, + brzr, + cr) = (4F/r)(arnr, + bryr, + crn), 


because F = 2r’sin Asin B sin C. By the inequalities 27, > PA, 2r, > PB,2r, > PC 
and the lemma, 


S>(F/r)(a-PB-PC+b-PC-PA+c:PA- PB) >(F/r)abe = 4F’, 


with the last equality from the formula F = abc/(4r). Equality holds if and only if 
equality holds in all the inequalities used in the last step, which is true if and only if 
P is the orthocenter of ABC (1.e., A,, B,, C, are the feet of the three altitudes) or P 
coincides with one of the vertices of ABC and A,B,C, degenerates to one of its 
altitudes. Note that equality can hold for a non-degenerate triangle with obtuse 
angle A only when B, = C, = A and A, is the foot of the altitude from A. 


Editorial comment. The lemma in this proof appeared as long ago as T. Hayashi, 
“Two theorems on complex numbers,” Tohoku Math. J. 4(1913/14) 68-70. The 
history of the lemma can be found in M. S. Klamkin, “Triangle inequalities from 
the triangle inequality,” Elem. der Mathematik, 34(1979) 49-55. 


Also solved by the proposer. Three incorrect solutions were received. 


Elliptical Tangents 


E 3164 [1986, 566]. Proposed by Huseyn Demir, Middle East Technical University, 
Ankara, Turkey. 
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Let s, t be the lengths of the tangent line segments to an ellipse from an exterior 
point. Find the extreme values of the ratio s/t. 


Solution by Gene Arnold and Vaclav Konecny, Ferris State College, Big Rapids, 
MI. Consider an ellipse as the normal projection of a circle, from one plane to 
another in R°. Clearly tangents project to tangents and the minor axis of the ellipse 
is perpendicular to the intersection of the two planes. Since the ratio of any two 
intersecting tangents to the circle is 1, the extreme ratios of two such tangents to the 
ellipse will be attained when the ratio of one tangent to the circle to its projection is 
minimum while the other ratio is maximum. This happens when the projected 
tangents are respectively parallel to and perpendicular to the intersection of the 
planes. Thus the extreme ratios are those of the major axis to the minor axis, and its 
reciprocal. 


Editorial comment. M. S. Klamkin suggested study of the more difficult problem 
of the extreme values of |s — ¢l. 


Also solved by M. Barr (Canada), J. C. Binz (Switzerland), J. M. Cohen, J. Dou (Spain), J. Fukuta 
(Japan), P. L. Hon (Hong Kong), L. R. King, M. S. Klamkin (Canada), K.-W. Lau (Hong Kong), O. P. 
Lossers (The Netherlands), M. Pachter (South Africa), K. Schilling, J. H. Steelman, P. Tracy, D. B. Tyler, 
C. Vandermee (The Netherlands), and the proposer. One incorrect solution was received. 


Nonunits in Commutative Rings 


E 3165 [1986, 649]. Proposed by Liviu I. Nicolaescu (student), Iassy, Romania. 


Let R be a finite commutative ring with 1, |R| = k. Let N = N(R) denote the 
set of nonzero nonunits of R. Prove that N # @ forces |N| > |vk — 1 |. 


Solution by David R. Richman, University of South Carolina, Columbia, SC. Let 
x be a nonzero nonunit of R, and let A(x) denote the set of elements y in R such 
that yx = 0. The first isomorphism theorem shows that the additive groups Rx and 
R/A(x) are isomorphic, and thus |Rx| = |R|/|A(x)|. Note also that the elements 
of Rx and A(x) are nonunits, so |Rx|<|N|+1 and |A(x)| <|N|+ 1. Hence 
(|N| + 1)? > |Rx||A(x)| = |R =k. If f= |vk-—1], then (|N| +1)? > 7? or 
|N| > /. 


Editorial comment. As mentioned by several solvers, this result was originally due 
to N. Ganesan, in Properties of rings with a finite number of zero divisors, Math. 
Annalen, 157(1964) 215-218). Generalizations of the original result can be found in 
the following: 

N. Ganesan, Properties of rings with a finite number of zero divisors II, Math. 
Annalen, 161(1965) 241-246. 

K. Koh, On “Properties of rings with a finite number of zero divisors,” Math. 
Annalen, 171(1967) 79-80. 

R. Gilmer, Zero divisors in commutative rings, this MONTHLY, 93(1986) 382-387. 

D. MacHale, Wedderburn’s Theorem revisited, Bull. Irish Math. Soc. (Sept. 
1986) 44-48. 

The assertion of the problem is valid without the hypothesis of commutativity. 


Also solved by J. Bergen, J. M. Cohen, A. Facchini (Italy), 8. Fabbri and L. Jones, R. Gilmer, R. P. 
Infante, I. M. Isaacs, A. A. Jagers (The Netherlands), K.-W. Lau (Hong Kong), O. P. Lossers (The 
Netherlands), D. MacHale (Ireland), L. E. Mattics, B. Osofsky, P. Tracy, and the proposer. One incorrect 
solution was received. 
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A Polynomial Identity 


E 3166 [1986, 650]. Proposed by Armel Mercier, University of Quebec at Chicoutimi. 


Let n > 1 be an integer and let f be a polynomial of degree less than or equal to 
n. Show the following combinatorial identity: 


(47 |r +1) nln +:1)122"*14(0) 
2 (27 +1) 7 (2n + 2)! 


x + £0) 40) +f0) E> 


Solution I by Paul J. Zwier, Calvin College, Grand Rapids, MI. Since n is fixed, 
let L(/) denote the left side of the desired identity. Since L is a linear functional, it 
suffices to show that the required identity holds for a spanning set of the real 
polynomials of degree < n. 

Let f(x) =x —1 and f,(x) = x(x — 1)*7! for k > 1. Furthermore, let g(x) 
= —(1—.x’*)". The binomial formula yields 1 + g(x) = _1(—1)*Y "|, 
Applying this to the computation of L, we find L( fo) = fg(1 + g(x)) dx, L(f,) = 
fc ———_ t+ a(x) dx, and L(f,) = [(xD)*~*(1 + 9(x))],-; for k > 2, where xD is the 
differentiation operator followed by multiplication by x (actually, the formula for f, 
holds also for f,.) 

Now we evaluate these expressions using g(x) = —(1 — x*)". For fo, we sub- 
stitute x =sinz to get L(fp) =1-— {{/*cos?"*1zdz =1 — [n(n + 1)!27"*1/ 
(2n + 2)!]. For f,, we let u= 1 — x? to obtain L(f,) = (1/2) fi — u") du/( - 
u) = L"_41/(2j). Since g(1) = 0, L(f,) = 1. Finally, since (1 — x*)"~™ divides 
(xD)™(1 + g(x)) forl <m<n-—1, we have L(f,) =O for3 <k <n. 

We have chosen the spanning set f, so that f,(0) = —§,,, f,(1) = 6,, and f/(1) 
equals 1 for 0 < k < 2 and 0 for 3 < k < n. Here 6 denotes the Kronecker delta. 
By examination, then, the values computed for L(f,) above are the values of the 
right side for f,. Hence each real polynomial of degree at most n satisfies the 
identity. 


Solution II by Donald E. Knuth, Stanford University, Stanford, CA. The identity 
holds as well when f has degree n + 1. Let f be an arbitrary polynomial of degree 
at most n + 1. We can write f in terms of a polynomial g of degree less than n as 
fQj+1)=a+t+ (Qj +1)(b + 2 jg(/)), for some constants a and b. The given sum 
is then 


aE (-1)/"{"] 


j>0 5 - 2] Wt 


+ oY yn(7 \ = 


+E v5] 


+ g(0), 


- yeni 4 (- “2+ ¥ (-1)'(7) 


j>o0 


aa 


since Li 5 o(— iy ‘\g(i) = = 0 is the nth difference of g at 0. The second sum 
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evaluates to 2*"*'n!?/(2n + 1)! by setting x = 1/2 in the partial fraction expansion 


ee (2) 


joo jtx  x(x4+1)---(x4+n)- 


The first sum evaluates to L”_,1/j by subtracting 1/x from both sides of (*), 
letting x — 0, and applying lHospital’s rule. Finally, we use the facts f(0) = a, 
fQ) =a+b, f’A) =6b + g(0) to obtain the desired right side. 


Editorial comment. Most solvers showed that the identity holds for the powers x*, 


which involves a somewhat lengthier calculation than that of Solution I above. 
Michiel H. M. Smid used induction on n. The fact that the identity also holds for 
degree n + 1 can be obtained from Solution I by using the remark that (1 — x7) 
divides (xD)"~*(1 + g(x)) to conclude that L(f,,,) = 0. 

Also solved by J. C. Binz (Switzerland), L. Kuipers (Switzerland), K.-W. Lau (Hong Kong), O. P. 


Lossers (The Netherlands), M. H. M. Smid (The Netherlands), D. B. Tyler, University of South Alabama 
Problem Group, Western Maryland College Problems Group, and the proposer. 


Continuous and Differentiable 


E 3173 [1986, 733]. Proposed by H. W. Oliver, Williamstown, MA. 


Let the real-valued function f be differentiable on the open interval (a, b). For 
(x, vy) © D= {x, y): a<x<b,a<y <b}, define 


F(x, y) = mie for x # y, 


F(x, x) = f’(x). 
Give a necessary and sufficient condition (in terms of the function /) that 
(i) F be differentiable at each point of D; 
(i1) F be continuously differentiable on D. 


Solution by O. P. Lossers, Eindhoven University of Technology, The Netherlands. 

(i) The condition is differentiability of f’. We use the interpretation of the 
derivative as a linear approximation to a function. I.e., a function g is differentiable 
with derivative g’(x) at x if and only if g(x + h) = g(x) + g’(x)h + o(h), where 
o(h) denotes a function whose ratio with [h| goes to 0 as h goes to 0. Similarly, for 
two variables, 


F(x +h, y +k) = F(x, y) + (DF(x, y), DoF(x, y)) - (h, k) + o((h, )I). 
If F is differentiable at each point of D, then clearly f’ is differentiable on (a, b). 
Conversely, if f is twice differentiable, then F is differentiable. This is immediate 


for off-diagonal points; to prove that F is differentiable at the point (x, x), we 
consider the function g(t) defined by 


g(0) = f(x +0) ~ o'(x) ~ 5e7"(x). 


Since g’(0) = 0 and g’(0) = 0, we have g’(t) = o(t). Applying the mean value 
theorem to the interval between h and k, we obtain a value c between h and k such 
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that 


ge) = g'(c) = o(|h| + |). 


The expression in terms of g on the left equals F(x +h,x +k) — F(x, x) - 
(1/2)(h + k) f(x). Since o({h| + |k[) = o(\(h, k)), this proves that F is differen- 
tiable at (x, x) with derivatives D, F(x, x) = D, F(x, x) = f’(x)/2. 

(ii) The condition is continuous differentiability of f’. If F is continuously 
differentiable, then f” is continuous. Conversely, if f” is continuous, we claim that 
D,F and D,F are continuous. For all (x, y) © D, integration by parts yields 
df'(y + t(x — y)) dt = F(x, y). Hence 


D.F(x, y) = fifa + t(x -y)) de, 


which is clearly a continuous function. 


Also solved by D. Doster, K. Schilling, and the proposer. 


Convergent and Divergent Series 


E 3174 [1986, 733]. Proposed by Albert Wilansky, Lehigh University, Bethlehem, 
PA. 


Let a © cs mean that 2a, converges. Show that 


(i) there exists b € cs such that inf, .., sup - , = 0, 
and ; 

oo ° . a, 

(11) there exists a © cs such that inf, ., sup,|1 — rae 0, 


Editorial comment. The problem tacitly assumes that b, # 0 for all n. 


Solution by W. Hensgen, Universitat Regensburg. Both assertions are implied by 
the existence of a © cs and b € cs such that a,, b, # 0 and a,/b, — 1. In case (i) 
we can take 


b, = (-1)"{1 + (-1)"/log(n +. 1)}/n (n= 1,2,...) 
and, for given k, 
a =b for l<n<k, a®=(-1)"/n for n>k. 
An analogous example can be given for (11). 


Also solved by D. Callan, J. N. Fitch, E. Hertz, O. P. Lossers (Netherlands), R. Neidinger, M. J. 
Reed, A. Riese, A. V. Stanoyevich, A. Stenger, L. Thurston, D. M. Wells, and the Univ. of South 
Alabama Problem Group, and partially by the proposer. 


A Sined Matrix 


E 3178 [1986, 812]. Proposed by G. A. Hively, Lockheed Palo Alto Research 
Laboratory, CA. 
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Let x, and y, be arbitrary real numbers. Show that the n X n matrix 


S= (sin(x, + y,)) 
is singular if n > 3. 


Solution I by Jack Tomsky, Lockheed Palo Alto Research Laboratory, CA. Let A 
be the n X 2 matrix whose jth row is (sin x,,cos x,), and let B be the n X 2 matrix 
whose kth row is (cos y,, sin y,). Because "sin(x, + yy.) = sin x,;Cos y, + 
cos x sin y,, we have S = AB’. Thus S has rank at most 2 and is singular for 
n> 3. 


Solution II by Bruce Dearden, University of North Dakota, Grand Forks, ND. The 
matrix S is the imaginary part of the matrix E = (exp i(x, + y,)). Let v and w be 
the column vectors whose jth entries are e’” and e'%, respectively. Since lor n > 3 
the entries of v are linearly dependent over the reals, the equation E = vw! implies 
that the rows of S are linearly dependent over the reals. 


Editorial comment. Jesus Ferrer (Spain) and Pei-Yuan Wu (Taiwan) both noted 
that the special case with x, = y, for all j appears as Problem 294 in Faddeev and 
Sominskii, Problems in Higher Algebra. A number of readers pointed out that 
analogous propositions are true when sin is replaced by other functions with 
suitable addition formulas. 


Also solved by the proposer and 40 other readers. 


ADVANCED PROBLEMS 


6576. Proposed by Hans U. Gerber, University of Lausanne, Switzerland. 


Suppose X,, X,,... are independent identically distributed real random variables 
with E( X,) = pw. Put 


S,=X, +X, ++: +X, for k=1,2,.... 


(a) If p < w <1, where p = —0.278465--- is the real root of xe!~* = —1, 
show that the series 


Sie Sk/kA 
1 


iMs 


converges with probability one. 
(b) If X,, X,,... are positive and if u» < 1, show that the expectation of 
oO 
yy Sk e7Sk/k! 
k=1 
is p/(1 — p). 
(c)* In (b) is it possible to relax the condition that the random variables are 
positive? For example, would it suffice to assume E(|X,|) < oo and p < p< 1? 


6577. Proposed by B. Bagchi, G. Misra, and N. S. N. Sastry, Indian Statistical 
Institute, Calcutta, India. 
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(a) Let H be an infinite-dimensional inner-product space. Suppose that finitely 
many closed balls cover the surface S of the unit ball B in H. Prove that these balls 
also cover the center of B. 

(b) Does the above assertion remain valid if H is any infinite-dimensional, 
normed linear space? 


6578. Proposed by S. T. Stefanov, Sofia, Bulgaria. 


A fluid flows on a 2-sphere S with continuous velocity vector v(x, t). Let T(x, t) 
be the temperature of the fluid at the point x © S at the time ¢. Assume that 
T(x,0) < Ty and T(x,1) > T) for all x € S and some 7). Prove that there exist 
Xo © S and ty € (0,1) such that T(Xo, to) = Tp and 0( Xo, to) = 0. 


SOLUTIONS OF ADVANCED PROBLEMS 
Critical Points of Gateaux Differentiable Functions 


6467 [1984, 441]. Proposed by Bu Qi-yue (student), Shanghai Jiao Tong Univer- 
sity, People’s Republic of China. 


Suppose that g: D C R” — R' has a second Gateaux derivative at X, € int D. 
Suppose also that Xp is a critical point of g and g’”(X,)) > 0. Must X, be a proper 
local minimizer of g? Answer the same question with “Fréchet” instead of 
“ Gateaux.” 


Editorial Remark. A function ¢: D — R, where D is an open set in R”, is said to 
be Gateaux differentiable at a point x, € D if there exists a linear function 
$6,x,, R" 7 R 
such that 


o( x, + th) — d(x 
walt) = in 2E0* = 90) ye ge 
t- 


The function ¢ is said to be Fréchet differentiable at x, if there exists a linear 
function 


Or. Xo. R"’ > R 
such that 
(x9 +h) — $(xo) — $%,x,(2) 


m =0, allheR’”. 
|All > 0 AI 


The functions ¢¢ ,, and ¢%, ,, if they exist, are unique and are called, respectively, 
the Gateaux and the Fréchet derivatives of } at xp. 


Solution by the proposer. It is clear that if g is twice Fréchet differentiable at Xo, 
then the stated conditions guarantee a proper local minimizer at X>). But if g is 
assumed only to be twice Gateaux differentiable at X,, then the situation is 
different. 
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For example, fix n = 2 and let D be the unit disc in the plane. For 0 < 6 < 27 
let C, and C, be the curves 


r=cos(8/4), r= (1/2)cos(8/4), 


respectively. Denote by S the open set consisting of the points between the two 
curves (thus S is a band that narrows while spiraling into the origin). Let 


2 
so) (EY ES 
For h, k © D, a computation shows that 
g’(0,0) =0 
and 
h-g"(0,0)k =2h-k. 
Therefore, g’’(0, 0) is positive definite, but obviously 0 is not a local minimizer of g. 


No other correct solutions were received. 


Lipschitz vs. Holder 


6529 [1986, 743]. Proposed by O. Hajek, Case Western Reserve University. 


Let f,, f5,... be a sequence of functions from R! to R! with common Lipschitz 
constant A. Show that each continuous function f with 


(graph f) ¢ LU (graph f,) 
k=1 


also has Lipschitz constant \. (The condition on the graph of f may be restated as 
follows: for each real x there is a k with f(x) = f,(x).) Is this true if the Lipschitz 
condition is replaced by a Hélder condition of order a for some fixed a with 
0<a<1? 


Combined solution by the proposer and M. J. Pelling, London, England. By 
considering Ax + f,(x) one need only prove this: if all f, are increasing, then so is 
each continuous f with 


(graph f) c U (graph /;,). 
Assume the contrary, that somewhere f(a) > f(b) with a < b. We shall show 
that f strictly decreases on an uncountable set, and a contradiction will ensue. By 


decreasing the interval [a, b] if necessary, we may even assume that f(a) > f(x) > 
f(b) for all x © [a, b]. By continuity, for each ¢ & [ f(b), f(a)], we may define 


g(t): =min{x € [a, b|: f(x) =+¢}. 
Then f(g(t)) = ¢t, and t < s implies g(t) > g(s) (since, by the intermediate value 


theorem, both = and < lead to a contradiction). 
For each t € [ f(b), f(a)] there is an index k such that 


t = f(g(t)) =f,(g(¢)). 


To different ¢’s there must correspond distinct k’s. Indeed, if t<s and s= 
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fr Z(s)), then g(t) > g(s); since t < s and f, is increasing, we cannot have m = k. 
Thus there are continuum-many t’s but only countably many k’s: a contradiction. 
The Lipschitz condition cannot be replaced by a Holder condition. For example, 
if a = 1/2 and A = 1 define 
fi(x) = vx, O<x <0, 
O0<x<l, 


1 
h={F 1<x < o, 


and 


fi(x) O<x<l, 
f(x) 1l<sx<oo. 


f(x) = | 


Then f, and f, satisfy the order 1/2 Hélder condition with constant A = 1, but / 
does not since 


Lf(2) — f(0)| = 2 > 1(2 — 0)” = y2. 
Here the counterexample is due to M. J. Pelling, while the affirmative part is due 
to the proposer. 
Also solved by Pavol Ralbovsky (Czechoslovakia). 


Graphs with Many Horizontal Chords 


6530 [1986, 743]. Proposed by Wolfgang Walter, Universitat Karlsruhe, Federal 
Republic of Germany. 


Let f be a real-valued continuous function on (0,1) and {a,},{b,} two 
sequences of real numbers such that a, # b,, n = 1,2,3,..., and 
lim a,, = lim b, = 0. 
Assume that for each x in (0,1) there exists an N = N(x) for which 
f(x+a,)=f(x+b,), n>M(x). 
Must f be a constant function if 0 < b, <a,? 
*What if b, = —a,? 


Partial solution by O. P. Lossers, Eindhoven University of Technology, The 
Netherlands. The answer to the first question is “No.” This is seen as follows. Write 
x € (0,1) in decimal notation, 


x= Vel" (Ose, <9). 
n=1 
For x & (0,1) let M(x) be the smallest integer n such that ¢,,,, # 0. Then define 
M(x) 
f(x): = & e,.107" + 107%™, 
n=1 
(if ¢,,,, = 0 for all n then define f(x): = L9_,¢€,,.10~”). It is easily verified that f 
is uniquely determined. Let a, =2-10~7""1 and b, = 1072"~*. Then f, a,, b, 
satisfy the given conditions but f is not constant. The function f is continuous, 
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because for all n 


Ix — y| $107" = | f(x) — f(y)| < 107. 
Next, if M(x) = oo then clearly f(x + a,) = f(x + b,). Finally, if M(x) = M, 
where x is represented so that it does not end in a string of 9’s and N is the first 
position after M with e, < 9, then f(x + a,) = f(x + b,) for n > N/2. 


Editorial comment. Adam Riese independently provided an essentially identical 
solution, and also showed that if b,,, = a,, then f must be constant. His argument 
is to assume that 

f(x) # f(x), 0<x, <x, <1, 


and define 
C= {x © [x, x]: f(x) =f(~)}. 
Then C is closed and hence has a largest number c; clearly f(x) # f(c) for x in 
(c, x,]. Now 
fle +a,) =flet+ b,) = fle + a,-1) 


for n sufficiently large. Since 
lim f(c+a,) =f(c), 


it follows that f(c + a,) = f(c) for n sufficiently large, a contradiction for 0 < a,,. 
M. J. Pelling and John Henry Steelman both pointed out the similarity of their 
counterexample functions to the classical Cantor singular function. Steelman ob- 
served that if f(x + a,) = f(x — a,) for large n and 
S = {x € (0,1): f is constant in a neighborhood of x}, 


then S is an open dense subset of (0,1). This follows by a Baire category argument. 
If f is not constant, the complement of S must be uncountable. This suggests that 
any such nonconstant f must “resemble” a Cantor function. 


No other solutions were received. 


Holomorphic Functions Satisfying a Triangle Equality 


6533 [1987, 81]. Proposed by Boo Rim Choe (graduate student), University of 
Wisconsin- Madison. 


Let ¢ be a fixed real number. Find all functions f(z) holomorphic in a disc D 
centered at z = 0 that satisfy 


F(Z) = IFO) + FO) 


for all z= x + iy in D. 


Solution by O. P. Lossers, Eindhoven University of Technology, The Netherlands. 
Aside from f(z) = 0, the only possible solutions are f(z) = Ke®’ for real p and 
complex K, and f(z) = Az? for complex A. These can only occur when ¢ = 0 
and t = 1, respectively. To prove this it is natural to consider the cases f(0) # 0 and 
f(0) = 0 separately. In the first case ¢ must be zero, so |f(z)| depends only on x, 
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1.€., 
f(z) = g(x) em, 
where g and h are real, twice-continuously differentiable functions of their vari- 


ables. The Cauchy-Riemann equations, applied either to f(z) or to log f(z), quickly 
yield 


dh dh 
g(x) = a(x) 5 and 0 = g(x). 
Hence h depends only on y and 
g(x) 
g(x) 


for some real constant p. These differential equations are easily solved to yield 
f(z) = c,e'e°?. The case f(0) = 0 but f(z) # 0 is a bit harder. Say 


=h'(y) =p 


f(z) =Az*¥+ YO e.z”, A #0 


m=k+1 
is the power series of f for z = re’® in a small neighborhood of zero. Then 


f(z)|_|f(x) x*1 | £0) ¥* 


Zz xk zk yk gk 


yields 
|A| = |A|(|cos ¢|* + zsing|*), all ¢, 


so t= 1 and k = 2. Now any solution other than f(z) = Az? would lead to a 
solution of the form 


f(z) = 2? + Bez! + O(2'*}), 
where B > 0 and / > 3. Then r — 0 together with (*) for k = 2 implies 
[l + Ber’ 7e— | + O(r'~1) = cos’*p[l + Ber’ cos’ 7o| 
+sin’¢ [1 + Ber! *sin'~ 9]. 
Since 
l + pe“| — 1 = pcosa + O(p’) 


for small p, and />3 implies 2/-4>/-— 1, the difference of the left- and 
right-hand sides of the equation in the previous sentence is 


Br'~?|cos(@ + (1 — 2)@) — cos @ cos‘ — cos 6 sin'p| + O(r'~'). 
From this we first deduce 
cos(@ + (1 — 2)¢) = cos A cos‘ + cos sin’, 
and then (differentiate and set ¢ = 0 to see that 6 = nz, n an integer) the absurdity 


cos(/ — 2)@ = cos‘ + sing, all ¢. 


Also solved by Rainer Briick (Federal Republic of Germany), B. Crofoot and K. Kearnes (jointly), 
Kee-wai Lau (Hong Kong), Douglas B. Tyler, and the proposer. Two incomplete solutions were received. 


REVIEWS 


EDITED BY JOSEPH KONHAUSER 
Department of Mathematics, Macalester College, St. Paul, MN 55105 


Metric Spaces: Iteration and Application. By Victor Bryant. Cambridge University 
Press, Cambridge, 1985. v + 105 pp. 


MICHAEL EDELSTEIN 
Department of Mathematics, Dalhousie University, Halifax, Nova Scotia B3H 3J5 Canada 


Classes in mathematics, made up of students of widely varying backgrounds, 
pose something of a challenge to an instructor. To reach a diverse audience, 
including the less capable individuals in it, a teacher must show that the material is 
both interesting and useful. Thus extra time for illustrations, examples and applica- 
tions is needed; furthermore, a slower lecture pace must be adopted. In these 
circumstances it may be difficult to cover adequately the anticipated course content. 
In general, the experienced teacher knows how to strike an appropriate balance. In 
the book under review such a balance is, unfortunately, missing. 

Designed as a course for combined honors students, the book is meant to be 
elementary or—as the author puts it— “down to earth.” To make the material easier 
to absorb much of the standard metric space theory is either omitted or diluted. 
(For example, separability, total boundedness, the process of completion are omitted; 
connectedness is dealt with all too briefly.) Motivation, on the other hand, is 
emphasized throughout. 

The starting point and recurrent theme, taking up the greater part of the book, is 
that of iteration; i.e., sequences arising from iterative procedures. Here a wealth of 
interesting examples is given, culminating in the proof of an implicit function 
theorem. Unquestionably, this topic is presented in an appealing manner; and in a 
much larger book it would have been entirely appropriate. Some paring down of this 
material seems to be called for, and it may be of some interest to consider the 
following scheme for a more condensed, yet forceful, treatment of this particular 
topic. 

Before plunging into course work proper, students might be challenged with a 
simple concrete exercise. For example, they could be asked to produce a program 
(in Basic or any other computer language) solving, to a reasonable approximation, 
the equation x — cosx = 0 (0 < x < 1). (With the current popularity of computer 
science courses this should be quite easy.) Satisfied that this was accomplished, the 
instructor might ask for the seemingly easier problem of solving the same equation 
with sine replacing cosine, but with initial approximation x, = 1. Students will, of 
course, unavoidably find, that for the former equation, twenty iterations suffice to 
produce the approximation x =~ 0.739, correct to the last digit; while, in contrast, 
even after 10,000 iterations, x ~ 0.017 is incorrect in the last two digits. 

Why should this be so? Do one or both of the above two sequences converge? If 
both do, why is there such a disparity in the rate of convergence? In particular, why 
is it so much harder to attain an improved approximation for the second equation 
(when starting with x, # 0)? 
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These are thought-provoking questions that lead quite naturally to a discussion of 
contractions and other mappings with related properties. A more comprehensive 
study of such problems might have to be postponed or omitted altogether, but with 
a little preparation, it can be pointed out in this context that iterates under a 
contraction always form a Cauchy sequence. 

The above scheme, if implemented, would still leave a text, already rather 
shorthanded in theory, even more lacking. The need for its enrichment would thus 
be more compelling. What additional material should be included? There is an 
embarassment of riches here and any practical answer must be somewhat arbitrary. 

At a minimum one would want to see precompactness included so as to 
intertwine, via that notion, compactness and completeness. In the three C’s men- 
tioned by the author, connectedness should be restored to its rightful place along- 
side completeness and compactness, displacing closedness(!). And, of course, con- 
nectedness has to be given more prominence. Separability and second countability 
properties could be included, with the intention of exhibiting the delightful embed- 
ding of such spaces into the Hilbert cube. 

Finally, a remark or two about general comments. A class filled with budding 
chemists, engineers, etc., should be issued what may be called an invitation to 
mathematics. By this I mean the general, historical, and cultural asides which show 
that mathematics is an evolving human endeavor. More specifically, the genesis of 
the notion of a metric space and its position in various branches of mathematics 
should be treated. Further, to cultivate and pay homage to the human aspect of 
mathematical creativity, names of outstanding contributors to the discipline should 
be mentioned. A course in metric spaces looks arid and deprived without the names 
of Fréchet, Hausdorff, Urysohn, and Banach in it. 


Mathematics for Liberal Arts: A Problem Solving Approach, by Rick Billstein and 
Johnny W. Lott. The Benjamin /Cummings Publishing Co., 1986, xvii + 646 pp. 
Mathematical Ideas, fifth edition, by Charles D. Miller and Vern E. Heeren, Scott, 

Foresman and Company, 1986, xiv + 751 pp. 


ALVIN M. WHITE 
Department of Mathematics, Harvey Mudd College, Claremont, CA 91711 


Billstein and Lott start each chapter with a preliminary problem from everyday 
experience which motivates the topics of the chapter. After definitions, techniques, 
examples and exercises, the chapter ends with an analysis of the solution of the 
preliminary problem following Polya’s advice: Understanding the problem; Devis- 
ing a plan; Carrying out the plan; Looking back. Scattered through the book are 
brain teasers whose statements are easy to understand but whose solutions are not 
obvious or easy. The brain teasers would be appropriate challenges for groups of 
students to ponder. 

Miller and Heeren is the fifth edition of a book that is more straightforward in its 
presentation of definitions, techniques, examples and exercises. 

The first book uses cartoons “to add a lighter touch.” The second book features 
postage stamps of famous mathematicians in the margin with a postage stamp’s 
worth of explanation of why that mathematician is so honored. 
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Both books are designed for liberal arts students, and in that genre they are 
typical. Both offer essentially the same content at the same level in similar styles. 
The topics are standard, elementary finite mathematics. Almost all of the material is 
algorithmic: Here is a method—There is a set of exercises that can be done using 
that method. Except for Babylonian and Hindu-Arabic numeration, the material is 
presented as a timeless collection of facts and techniques with no history or 
evolution. The historical notes in one and the postage stamps of mathematicians in 
the other are hardly sufficient to arouse the curiosity of the reader except to think, 
“Who are these people and why are their pictures in the margin of my book?” The 
many separate topics have no evident connection, very little motivation, and no hint 
or guide to possible understanding at a deeper level. 

Most mathematics for liberal arts texts miss an opportunity to serve the liberal 
arts students. What is usually offered is elementary mathematics, superficially 
treated, presented with a finality that forecloses further inquiry, connections or 
development. 

An alternative approach could put mathematics in the center of the liberal arts. 
Instead of formulas and routines explained in a few pages, if at all, ideas would 
receive the main attention. Certain ideas have made mathematics possible and have 
enriched the liberal arts. Mathematics, as intellectual history, has changed our world 
view and altered our modes of thought. The traditional text introduces Cartesian 
coordinates in order to present the equations of a straight line and a circle, followed 
by exercises. The equation of a straight line and the location of its x and y 
intercepts are of little concern to liberal arts students. The enormous influence of 
Descartes in mathematics as well as in philosophy throughout Europe ((3], [8)]), 
however, is completely ignored. An opportunity to connect mathematics with liberal 
arts is lost. 

A valid question is whether poorly understood technicalities are more valuable 
for liberal arts students than important concepts and discoveries of mathematics. If 
straight lines are introduced, they could be followed by the idea of parallel lines and 
a discussion of Euclid’s fifth postulate. A valuable contribution to liberal arts from 
mathematics would flow from a discussion of axiomatic systems. The story of the 
search for a proof of the fifth postulate is an exciting chapter of mathematics as a 
human adventure. The discovery of Non-Euclidean geometry, which resolved the 
search, has been compared to the Copernican revolution. Here are mathematical 
adventures and liberal arts combined. 


Relevance to Liberal Arts. Liberal arts students are interested in their place in 
the world and a sense of how they got there. The principles underlying our world 
view are intimately related to mathematics at least since Pythagoras. The role of 
mathematics in modern science is relevant, and the place of modern science in 
coloring our ways of thought and mentality is fundamental. Writing about the rise 
of modern science, Whitehead [8] declares, “Since a babe was born in a manger, it 
may be doubted whether so great a thing has happened with so little stir... The 
new mentality is more important even than the new science and the new technology. 
It has altered the metaphysical presuppositions and the imaginative content of our 
minds, so that now the old stimuli provoke a new response.” 

Mathematics is more than facts to be memorized or techniques to be mastered. 
Most texts, however, give us only facts and techniques. S. K. Stein described that 
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state of affairs in his talk, “Gresham’s Law: Algorithm Drives out Thought” [6]. 
“Gresham’s law in economics says, ‘Bad money drives good money out of circula- 
tion...’ Gresham’s law in mathematical pedagogy can be stated—‘Algorithm drives 
out thought’. ‘The robotic displaces humanistic’. ‘Cultivation of algorithms replaces 
concern for thinking and writing’... Algorithms, of course, are good and must be 
taught.... But the temptation to emphasize drill over understanding is almost 
irresistible” — and algorithms are easier to teach and test. 


Imagination and Invention. Albert Einstein is quoted on posters, tee shirts and 
bumper strips: 


Imagination is More Important Than Knowledge. 


That sentiment should be evident in mathematics, especially for liberal arts 
students. Imagination and invention are what make mathematics part of the liberal 
arts. Whitehead [8] describes Pure Mathematics as the most “original creation of the 
human spirit.” But our students see neither creation nor spirit. Regarding the basic 
topics that we teach, the question is never raised, ‘Why so?’ or ‘How does one arrive 
at them?’ “Yet all these matters must at one time have been goals of an urgent 
quest, answers to burning questions, at the time, namely, when they were created. If 
we were to go back to the origins of these ideas, they would lose that dead 
appearance of cut-and-dried facts and instead take on fresh and vibrant life again” [7]. 


Thematic Continuity. The value of mathematics to liberal arts students is not in a 
few facts and techniques. The statement and proof of the Pythagorean theorem are 
not as important as the idea of proof. The invention and development of deductive 
reasoning is one of the triumphs of intellect. What is the purpose of proof in 
mathematics and how does it compare with proof in physics or anthropology? These 
are questions that are in the mainstream of liberal arts—certainly more so than 
Venn diagrams or BASIC programming. 

The Pythagoreans offer the student much more than a famous theorem. Toeplitz 
[7] relates two proofs of the irrationality of /2 that he attributes to the Pythagoreans. 
The second proof uses an elementary geometrical construction instead of facts 
about even and odd numbers, and proves that it is impossible to find a common 
measure for the side and diagonal of a square. The meaning of an impossibility 
proof is certainly worthy of discussion. Toeplitz writes of this first impossibility 
proof. “This great discovery, more than anything else, inaugurated the character of 
modern mathematics.” 

Two thousand years later Fermat uses the same method, now called “infinite 
descent,” to prove several theorems, including a simple case of his last theorem. 
Thematic continuity is evident in mathematics, science and liberal arts. 

Almost three hundred years after Fermat, in his Nobel Prize speech in 1922 Niels 
Bohr describes how the concept of the quantum had led gradually to a systematic 
classification of any electron in an atom, offering a complete explanation of the 
remarkable relationships between the physical and chemical properties of the 
elements, as expressed in the periodic table of Mendeleev. “Such an interpretation 
of the properties of matter appeared as a realization; even surpassing the dreams of 
the Pythagoreans of the ancient ideal of reducing the formulation of the laws of 
nature to considerations of pure number.” [1] 
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Examples. Using appropriate mathematics as illustrations, texts and teachers can 
trace the development of vague ideas to polished concepts. The discovery, evolution 
and meaning of rigor is an example that might show that mathematics involves more 
than memorization and might indicate some reasons why mathematics has endured 
so vigorously. The role of human needs and invention might become evident with a 
description of Zeno’s paradox and a discussion of the infinite from Pythagoras to 
Cantor to Hilbert. A discussion about the paradoxes and other crises of set theory 
would contribute to appreciation of mathematics as a human activity. Technical 
definitions and routines should include meaning and context. Liberal arts students 
are not automatons who can proceed without understanding. 

It would be worth introducing some elementary calculus so that the class could 
divide into two debate teams and a jury to consider Bishop Berkeley’s attack on 
Newton’s ideas on one side versus the ideas of Newton’s defenders on the other. The 
debate-discussion would clarify the issues and ideas and be in the spirit of liberal 
arts. 


Influence of Newton. Should the formulas for derivatives be the extent of the 
liberal arts students’ acquaintance with Newton? E. A. Burtt describes Newton as 
the one Englishman whose authority and influence in modern times has rivaled that 
of Aristotle over the late medieval epoch. Newton was even more honored and 
respected in the eighteenth century than he is today. He not only brought glory to 
his country, “but has done honor to human nature by having extended the greatest 
and most noble of our faculties, reason, to subjects, which till he attempted them, 
appeared to be wholly beyond the reach of our limited capacities.” Newton set the 
scientific program for two hundred years. The whole orientation of philosophy was 
shifted by his ideas [3]. It would seem unfair, especially to liberal arts students, to 
represent Newton’s contributions by formulas that can now be produced by 
calculator keys. 

Whitehead reviews the genesis of modern science in the Century of Genius [8]. 
“The subject of the formation of the three laws of motion and of the law of 
gravitation deserves critical attention. The whole development of thought occupied 
exactly two generations. It commenced with Galileo and ended with Newton’s 
Principia; and Newton was born in the year Galileo died. Also the lives of Descartes 
and Huyghens fall within the period occupied by these great terminal figures. The 
issue of the combined labours of these four men has some right to be considered as 
the greatest single intellectual success which mankind has achieved...” The law of 
inertia “contains the repudiation of a belief which had blocked the progress of 
physics for two thousand years. It also deals with a fundamental concept which is 
essential to scientific theory; I mean, the concept of an ideally isolated system. This 
conception embodies a fundamental character of things, without which science or 
indeed any knowledge on the part of a finite intellect would be impossible.” 

Those are heady sentiments that are ignored by many authors of mathematics for 
liberal arts texts. Although mathematics is central to modern science and modern 
western culture, most texts bypass these important aspects of mathematics and 
offer, instead, essentially no intellectual content or challenge. The mind is certainly 
not stretched or inspired by the contents of these texts. Is math or science literacy 
better enhanced by memorizing algorithms without understanding, or by consider- 
ing how. certain mathematical and scientific ideas changed the direction of civiliza- 
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tion? Of course, it may be necessary to study preliminary ideas and concepts in 
order to understand the pivotal moments. But then the preliminary study is given a 
purpose and direction. 


Report of MSEB. It is difficult to break with tradition, as I advocate. But the 
tradition has not served us well. SIAM News (May 1987) reports on the conclusions 
of the Mathematical Sciences Education Board (MSEB). The mathematical skills of 
American precollege students, when compared to comparable students in other 
industrialized countries, rank near the bottom. “Boring and repetitious curriculum, 
low expectations, and negative attitudes have been identified as leading culprits.” 
Other disappointing conclusions are: “An average European or Japanese student is 
typically better prepared than the average American; U.S. standardized tests mea- 
sure largely irrelevant skills; U.S. students spend about 40 hours a year more than 
Japanese students on mathematics; and the U.S. curriculum’s focus on computation 
is one of its biggest weaknesses.” 


The Project Physics Course. If breaking with tradition for texts is considered 
radical or just avant-garde among mathematicians, it is old hat among physicists. 
While mathematicians were creating and refining the New Math, physicists were 
creating and publishing The Project Physics Course in high school and college 
versions. In 1952 Gerald Holton wrote Introduction to Concepts and Theories in 
Physical Science (Addison-Wesley). This was followed by Foundations of Modern 
Physical Science in 1958 (with Duane H. D. Roller) and the second edition of 
Concepts and Theories (with Stephen Brush) in 1972. In 1970 Holt, Rinehart and 
Winston published The Project Physics Course which was an integrated combination 
of text, laboratory experiment films, transparencies, and teacher resource books. 

The following quotes are from the prefaces of Concepts and Theories. [5] 


Perhaps the most striking characteristic of this book is the emphasis it puts on 
the nature of discovery, reasoning, and concept formation in science as a 
fascinating topic in its own right. This means that the historical and 
philosophical aspects of the exposition are not merely sugar-coating to enable 
the reader to swallow the material as easily as possible, but are presented for 
their own inherent interest. Indeed this is not intended to be an “easy” 
book... by presenting neither an encyclopedic account of all details nor a 
survey treatment, by concentrating instead on the meaning and the power of 
some basic ideas...one can hope to leave the student with a view of physical 
science as an interconnected achievement; one can engage his attention in those 
lengthier, more rigorous, penetrating arguments and derivations without which 
an honest understanding of the processes of science is hardly attainable...” 


Quoting from the Report on General Education in a Free Society (Harvard 
University Press, 1945, pp. 220—222), 


From the viewpoint of general education the principal criticism... (is that) 
most of the time...is devoted to developing a technical vocabulary and 
technical skills ... Comparatively little serious attention is given to the examina- 
tion of basic concepts, the nature of scientific enterprise, the historical 
development, ...its great literature, or its interrelationships with other areas... 
What such courses frequently supply are only the bricks of the scientific 
structure. The student who goes on to more advanced work can build 
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something with them. The general student is more likely to be left simply with 
bricks... The emphasis on historical development...is in no sense to 
constitute a humanistic garnishing of its factual material. On the contrary, it is 
introduced to illuminate and vitalize the content with which it is integrated. 


I have quoted extensively from the prefaces of Holton’s books because I think 
that their point of view would very well serve the needs of liberal arts and other 
students taking mathematics courses. The ignoring or denying of that point of view 
by authors and teachers, in my opinion, will lead to further isolation and alienation 
of mathematics from liberal arts students. 


Medical Education. When we consider extending our teaching beyond facts and 
techniques, we are confronted with felt time pressure. “There is so much to cover, 
that there is no time for extraneous considerations,” is a common response. Medical 
education, which is notorious for its demands for the memorization of facts, 
however, is changing. 

A former chairman of a Department of Medicine returned to medical school as a 
full-time student because he was troubled by the problems that he perceived in 
medical education. He wrote about his experience in the New England Journal of 
Medicine. [4] 

Learning is a thinking, problem solving process that requires time. Medical 

school education today involves too little thinking and problem solving. It 

consists largely of too much fact in too little time.... 

Let me be clear and not misunderstood. Facts are essential. Problems cannot 
be solved without the sequential arrangement of facts. But in medicine, the 
answer may not be there even after the facts are arranged. Students must learn 
to handle uncertainty: that too is medicine. Emphasis on facts does not teach 
this aspect of medicine any more than it teaches problem solving.... It is vital 
to replace the concept of learning as fact gathering to pass examinations with 
the concept of education as inquisitiveness, sequential thought, problem 
solving, and the satisfactions that result. 


The Association of Medical Colleges issued a report in 1984 (“Physicians for the 
Twenty-First Century”). Recommendation 1 states, 


In the general professional education of the physician, medical faculties should 
emphasize the acquisition and development of skills, values and attitudes by 
students at least to the same extent that they do their acquisition of knowledge. 
To do this, medical faculties must limit the amount of factual information that 
students are expected to memorize. 


Harvard Medical School (New Pathways) and McMaster University Medical 
School are two that have decreased lectures and coverage in their effort to improve 
medical education. 


Humanistic Mathematics. Some mathematicians are searching for new ap- 
proaches to teaching. In March 1986 The Exxon Education Foundation supported a 
three day Conference to Examine Mathematics as a Humanistic Discipline at 
Harvey Mudd College, Claremont, California. A common response of the thirty-six 
mathematicians who participated was “I was startled to see so many who thought as 
I do”. 
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One theme that emerged was the need to place the student more centrally in the 
position of inquirer than is generally the case, while at the same time acknowledging 
the emotional climate of the activity of learning mathematics. A second theme was 
the need to reconstruct the curriculum and the discipline of mathematics itself. The 
reconstruction would relate mathematical discoveries to personal courage, relate 
discovery to verification, mathematics to science, truth to utility, and in general, to 
relate mathematics to the culture in which it is embedded. There is a movement to 
continue and broaden the network of mathematicians who would be mutually 
supportive of efforts to teach and study mathematics according to the themes of the 
conference. 

Mathematics is open ended. Its concepts, methods and contacts with other 
disciplines are expanding. Easily understood unsolved problems and unanswered 
questions abound. Philosophic controversy swirls around it. But most students 
perceive mathematics as a dead, unchanging, irrelevant subject. 

Students at every level could come to appreciate that there are unanswered 
questions as well as unanswerable questions (and to know the difference) by 
exploring and inventing questions. Mathematics is more than solving problems or 
proving things. Finding a new idea or a metaphor to describe an old idea, evaluating 
the significance of concepts and seeing new connections are also reasonable activi- 
ties in mathematics. [2] 
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General, L. Quantum Implications: Essays in Hon- 
our of David Bohm. Ed: B.J. Hiley, F. David Peat. 
Routledge & Kegan Paul, 1987, vii + 455 pp, $49.95. 
[ISBN: 0-7102-0806-5] Thirty essays on topics rang- 
ing from “Hidden variables and the implicit order” 
(by Bohm), to “Meaning and being in contemporary 
physics,” to “The automorphism group of C4,” to 
“Wholeness and dreaming.” Quite a statement on 
the breadth and depth of Bohm’s influence on mod- 
ern science. Contributors include Penrose, Feynman, 
and R. Weber. MR 


Finite Mathematics, T(18). Mathematics with 
Applications for the Management, Life, and Social 
Sciences, Third Edition. Howard Anton, et al. Har- 
court Brace Jovanovich, 1988, xv + 1029 pp, $32. 
[ISBN: 0-15-555237-6] A good introductory book 
which presents nice motivation for definitions. In- 
cludes standard precalculus material; in the final 
chapters, the authors take an informal and intuitive 
approach to develop the fundamental ideas and ap- 
plications of differential and integral calculus, differ- 
ential equations, and functions of several variables. 
Exercise sets include drills as well as many word 
problems. (Second Edition, TR, April 1984.) LW 


Education, P*, L*. Computers and Mathemat- 
ics: The Use of Computers in Undergraduate In- 
struction. Ed: David A. Smith, et al. MAA Notes 
No. 9. MAA, 1988, xi + 147 pp, (P). [ISBN: 0-88385- 
059-1] Nineteen surveys by pedagogical pioneers who 
are exploring the use of computing across the entire 
undergraduate curriculum—from calculus to statis- 
tics, from differential equations to abstract algebra. 
A project of the MAA Committee on Computers in 
Mathematics Education (CCIME), this is no mere 
committee report, but a stimulating anthology of the 
best current exemplars on this important frontier of 
teaching. LAS 


History, S, L. Studies in the History of Mathemat- 
tcs. Ed: Esther R. Phillips. Stud. in Math., V. 


26. MAA, 1987, 308 pp, $32. (ISBN: 0-88385-128-8] 
“Collection of [ten] articles on modern and contem- 
porary mathematics ...whose purpose is to provide 
a sample of recent scholarship in the history of math- 
ematics and to suggest additional sources for further 
study.” Each article analyzes a particular event or 
development in the history of mathematics. BH 


History, P, L**. Gesammelte Abhandlungen, Col- 
lected Papers. Friedrich Hirzebruch. Springer- Ver- 
lag, 1987, $195 set [ISBN: 0-387-18087-7]. Band I, 
1951-1962, viii + 814 pp; Band II, 1969-1987, iv 
+ 818 pp. Virtually all of Hirzebruch’s papers, in 
chronological order, with added commentary (at the 
end of each volume) by the author keyed to occa- 
sional marginal indicators to indicate progress since 
original publication. LAS 


Number Theory. 1017 Problems. A.S. Moiseenko 
(10-12 Kimball St., Belleville, NJ), 1987, 527 pp, 
(P). The problems all consict of specific Diophantine 
equations and their solutions. A photocopy of the 
author’s handwriting. No text; just equations and 
solutions. CEC 

Linear Algebra, T**(14-15: 1, 2), S, L*. Linear 
Algebra and Its Applications, Third Edition. Gilbert 
Strang. Harcourt Brace Jovanovich, 1988, xii + 
505 pp, $30. (ISBN: 0-15-551005-3] “Linear alge- 
bra is a fantastic subject.” This new edition of a 
path-breaking text includes a new section on the 
Fast-Fourier Transform, many new exercises, and a 
greatly expanded appendix on computer codes, pack- 
ages, and experiments. In addition, linear transfor- 
mations are now integrated throughout the text. An 
exciting book that “allows pure mathematicians to 
teach applied mathematics” and, in the process, dis- 
plays as few other texts do the eye-opening power 
and beauty of linear algebra. (First Edition, TR, 
June-July 1976, Extended Review, January 1978; 
Second Edition, TR, October 1980.) LAS 


Algebra, T(18: 2), S, P. Structure of Blocks of 
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Group Algebras. Gregory Karpilovsky. Mono. & 
Surv. in Pure & Appl. Math., V. 33. Longman 
Scientific & Technical (US Distr: Wiley), 1987, xviii 
+ 427 pp, $179.95. [ISBN: 0-470-20747-7] A self- 
contained, up-to-date account of the ring-theoretic 
structure of blocks of modular group algebras. Ties 
together the various developments of the last twenty 
years, and gives a picture of the current state of the 
subject. Gives a thorough list of references. No ex- 
ercises. Expensive! CEC 


Real Analysis, T(15: 2), S, L*. Real Analysis 
with Point-Set Topology. Donald L. Stancl, Mildred 
L. Stancl. Pure & Appl. Math., V. 113. Mar- 
cel Dekker, 1987, x + 287 pp, $45. [ISBN: 0-8247- 
7790-5] Gives a basic introduction to point-set topol- 
ogy and metric spaces. Then, covers continuity, se- 
quences and series, sequences of functions, integra- 
tion, and differentiation in the more general topolog- 
ical or metric space setting while applying the results 
to the real line. Well motivated with good selection 
of problems in each section. Good choice for better 
students. BH 


Complex Analysis, T(15-16: 1). Elements of 
Complez Analysis. William J. Adams (Dept. of 
Math., Pace U., NY 10038), 1987, ix + 337 pp, 
$27.50 (P). Rigorous approach to first course in com- 
plex analysis. Begins with background material on 
real numbers, sequences of real numbers and com- 
plex numbers, basic topological facts, and continuous 
functions, then carefully develops theory of complex 
functions. Perhaps too much emphasis on rigorous 
details and too little on the “big picture” and on giv- 
ing examples. May be appropriate for more gifted 
students. BH 


Partial Differential Equations, P. Numerical 
Simulation in Oil Recovery. Ed: Mary F. Wheeler. 
IMA, V. 11. Springer-Verlag, 1988, xi + 283 
pp, $25.95. (ISBN: 0-387-96653-6] Proceedings of 
a December 1986 symposium at the Institute for 
Mathematics and Its Applications, University of 
Minnesota. Seventeen papers address a variety 
of research fields connected with reservoir mod- 
elling, including the modelling of fractures, hetero- 
geneities, viscous fingering, and diffusion-dispersion 
effects. DFA 


Numerical Analysis, P. Lecture Notes in Mathe- 
matics-1287: Approzimation Theory, Tampa. Ed: 
E.B. Saff. Springer-Verlag, 1987, v + 228 pp, $21.20 
(P). [ISBN: 0-387-18500-3] Twelve papers from an in- 
ternational seminar held during 1985-86 at the Insti- 
tute for Constructive Mathematics at the University 
of South Florida. RWN 


Functional Analysis, P. Lecture Notes in Mathe- 
matics-1282: Positive Polynomials, Convez Integral 
Polytopes, and a Random Walk Problem. David E. 
Handelman. Springer-Verlag, 1987, xi + 136 pp, 
$13.10 (P). [ISBN: 0-387-18400-7] Concerned with 
results and interconnections in a number of areas in- 
cluding positive polynomials, a class of special ran- 
dom walk problems on lattices, convex integral poly- 
topes, reflection groups, and commutative algebras. 
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The central problems arise from special actions of 
tori on C*-algebras. Techniques include those of 
functional analysis, ordered rings, commutative al- 
gebra, and convex analysis. CEC 


Analysis, P, L. A Handbook of Fourier Theorems. 
D.C. Champeney. Cambridge U Pr, 1987, xi + 185 
pp, $39.50. [ISBN: 0-521-26503-7] “Intended to as- 
sist those scientists, engineers, and applied mathe- 
maticians who are already familiar with Fourier the- 
ory and its applications in a non-rigorous way, but 
who wish to find out the exact mathematical con- 
ditions under which particular results can be used.” 
Begins with an introduction to ideas and terminol- 
ogy of Lebesque integration and then “gives rigor- 
ous statements of the most important theorems in 
Fourier theory with explanatory comments and ex- 
amples.” No proofs; well written. BH 


Algebraic Geometry, P. Algebraic Groups and 
Class Fields. Jean-Pierre Serre. Grad. Texts in 
Math., V. 117. Springer-Verlag, 1988, ix + 207 pp, 
$36. (ISBN: 0-387-96648-X] A translation of the 1975 
French edition. SG 


Differential Geometry, T(16-17: 2), L. Differ- 
ential Geometry: Manifolds, Curves, and Surfaces. 
Marcel Berger, Bernard Gostiaux. Transl: Silvio 
Levy. Grad. Texts in Math., V. 115. Springer- 
Verlag, 1988, xii + 474 pp, $52. (ISBN: 0-387-96626- 
9] Translation of a French text in differential geom- 
etry in two parts: an introduction to manifolds and 
curves with attention to applications of Stokes’ for- 
mula, non-trivial examples of new notions, and in- 
terplay between geometry and analysis (with exer- 
cises); and a terser “travel guide” to local and global 
properties of surfaces in R?. Presumes “good calcu- 
lus background, including multivariable calculus and 
some knowledge of forms in R”.” RB 


Differential Geometry, P. Stochastic and Inte- 
gral Geometry. Ed: R.V. Ambartzumian. D Reidel 
(US Distr: Kluwer Academic), 1987, 136 pp, $34.50. 
(ISBN: 90-277-2543-8] Seven papers selected from an 
October 1985 symposium held in Armenia. A reprint 
of Acta Applicandae Mathematicae, Vol. 9, Nos. 1-2, 
1987. LAS 


Differential Geometry, P. Global Analysts on Fo- 
liated Spaces. Calvin C. Moore, Claude Schochet. 
Math. Sci. Res. Inst. Pub., V. 9. Springer-Verlag, 
1988, 337 pp, $34. [ISBN: 0-387-96664-1] A foli- 
ated space is an n-dimensional manifold which may 
be partitioned into an (n — p)-dimensional family of 
p-dimensional submanifolds. Foliations arise in the 
study of flows and dynamics, in group representa- 
tions, in automorphic forms, and in groups acting on 
spaces. The book develops a variety of aspects of 
analysis and geometry on foliated spaces, and con- 
tains an exposition of the Connes index theorem for 
foliated spaces. AM 


Differential Geometry, T(17-18: 2), P, L. Dif- 
ferential Geometry. Tanjiro Okubo. Pure & Appl. 
Math., V. 112. Dekker, 1987, xviii + 788 pp, $125. 
[ISBN: 0-8247-7700-X] Intended as an introduction 
and reference for local and global differential geom- 


1988] 


etry. Written for graduate students enrolled in dif- 
ferential geometry courses as well as a more general 
audience which includes geometers, topologists, and 
mathematical physicists. Among other topics, the 
book treats differentiable, complex, and Riemannian 
manifolds, connections, Lie groups, de Rham coho- 
mology, and characteristic classes. Does contain ex- 
ercises. AM 


Differential Geometry, P. Differential Geometry: 
The Interface between Pure and Applied Mathemat- 
tcs. Ed: Mladen Luksic, Clyde Martin, William 
Shadwick. Contemp. Math., V. 68. AMS, 1987, ix + 
273 pp, $29 (P). (ISBN: 0-8218-5075-X] Proceedings 
of a conference held in San Antonio, Texas, April 23- 
25, 1986. The book contains sixteen papers covering, 
among other topics, numerical analysis, mechanics, 
and control theory. AM 


Geometry, S*(14), P, L. Groups: A Path to Ge- 
ometry. R.P. Burn. Cambridge U Pr, 1987, xii + 
242 pp, $16.95 (P). [ISBN: 0-521-34793-9] A paper- 
back edition (with corrections) of the 1985 text (TR, 
June-July 1987). INC 


Geometry, S(14-16), P, L. The Geometrical Work 
of Girard Desargues. J.V. Field, J.J. Gray. Springer- 
Verlag, 1987, x + 237 pp, $68. (ISBN: 0-387-96403- 
7] Primarily devoted to a translation of Desargues’ 
Rough Draft on Conics, although the authors also 
provide details of the Greek legacy upon which this 
work is built. The three-and-one-half century-old 
work is rendered eminently readable by authors who 
have used modern notation, given a glossary of De- 
sargues’ “botanical vocabulary,” and provided a run- 
ning commentary through elaborate footnotes. They 
have also introduced their own illustrations, for De- 
sargues’ appear to have been lost. Translations are 
presented of the lesser works on perspective and the 
sun dial, and a chapter is devoted to Desargues’ work 
on applied geometry. SS 


Geometry, S*(16-17), P, L*. Lectures in Ge- 
ometry, Semester V: Lie Groups and Lie Algebras. 
M. Postnikov. Transl: Vladimir Shokurov. MIR (US 
Distr: Imported Pub), 1986, 437 pp, $10.95. [ISBN: 
0-8285-3296-6] Twenty-one lectures for an elective 
university course by a great Russian mathematician, 
with orally-omitted arguments filled in. Basic no- 
tions and examples; “local theory” of Lie groups; ex- 
tending local to global theory; subgroups quotient 
groups (G2, Fs presented); proof of Ado’s theorem. 
Inexpensive yet valuable: a compact, accessible com- 
pilation of the major results. RB 


Algebraic Topology, P. Homotopy Theory. Ed: 
E. Rees, J.D.S. Jones. London Math. Soc. Lect. 
Note Ser., V. 117. Cambridge U Pr, 1987, vii + 
247 pp, $29.95 (P). (ISBN: 0-521-33946-4] Proceed- 
ings of a symposium at Durham University, August 
1985, featuring expository papers on recent advances 
in higher homotopy theary. Ten papers: Burnside 
ring conjecture; stable splittings; localization and pe- 
riodicity; global methods, K-theory and homotopy 
theory; computational complexity of rational homo- 
topy; unstable homotopy since 1950; and symbolic 


TELEGRAPHIC REVIEWS 


681 


calculus. RB 


Algebraic Topology, P. Lecture Notes in Mathe- 
matics-1286: Algebraic Topology. Ed: H.R. Miller, 
D.C. Ravenel. Springer-Verlag, 1987, vii + 341 pp, 
$30.30 (P). [ISBN: 0-387-18481-3] Proceedings of a 
University of Washington workshop during 1984-85 
school year in which a total of 29 topologists par- 
ticipated at various times in four topic courses and a 
very active seminar. One set of course notes (by F.R. 
Cohen, on aspects of classical homotopy theory), and 
eleven research papers are included (Kervaire invari- 
ant, loop spaces of spheres, stability, Adams spectral 
sequences). RB 


Topology, P. Lecture Notes in Mathematics-1283: 
Geometric Topology and Shape Theory. Ed: S. 
MardeSi¢é, J. Segal. Springer-Verlag, 1987, 261 pp, 
$25.80 (P). [ISBN: 0-387-18443-0] Proceedings of the 
Postgraduate School and Conference on the sub- 
ject held at the University of Zagreb, Yugoslavia, 
October 1986. Twenty research papers on topics 
including decomposition theory, cell-like mappings, 
infinite-dimensional spaces, shape fibrations, fibered 
shape, LC"™-compacta, generalized manifolds, em- 
bedding continua into manifolds, complement the- 
orems in shape theory, shape-theoretic methods in 
group theory, exact homologies, strong shape the- 
ory. RB 


Topology, S(16-18), P. On Knots. Louis H. Kauff- 
man. Annals of Math. Stud., No. 115. Princeton U 
Pr, 1987, xv + 480 pp, $50; $18.95 (P). [ISBN: 0-691- 
08434-3; 0-691-08435-1] Expanded version of notes 
from a seminar held at the University of Zaragoza, 
Spain, Winter 1984. Elementary concepts of diagram 
moves, linking numbers; knot cobordism and the Arf 
invariant; geometric knot theory with covering spaces 
and branched covers; topology of singularities and 
Brieskorn varieties; collection of miscellaneous top- 
ics compiled throughout the course. RB 


Control Theory, T(18), P. Game- Theoretical 
Control Problems. N.N. Krasnovskii, A.I. Subbotin. 
Transl: Samuel Kotz. Springer-Verlag, 1987, xi + 
517 pp, $125. [ISBN: 0-387-96389-8] Translated, 
substantially-revised version of the authors’ Russian 
monograph Positional Differential Games (1974). 
“[A] formalization of differential games is presented, 
existence theorems for equilibrium are proved, ..., 
attention is devoted to the construction of optimal 
strategies, ..., stability of proposed solutions ....” 
RB 


Probability, P. Multivariate Empirical Processes. 
J.H.J. Einmahl. CWI Tract, V. 32. Math Centrum, 
1987, iv + 99 pp, Dfl. 14.10 (P). [ISBN: 90-6196-312- 
5] A research tract dealing with a classical approach 
to weighted empirical processes based on indepen- 
dent, identically distributed random variables. In- 
cludes a discussion of weak convergence and strong 
limit theorems. Extensive bibliography. TAV 


Elementary Statistics, T**(7-12), S**. Quan- 
titative Literacy Series. Dale Seymour Pub (POB 
10888, Palo Alto, CA 94303). Ezploring Data. 
James M. Landwehr, Ann E. Watkins. 1986, 161 pp, 
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$9.95 (P), [ISBN: 0-86651-321-3]; Ezploring Proba- 
bility. Claire M. Newman, Thomas E. Obremski, 
Richard L. Scheaffer. 1987, 63 pp, $7.95 (P), [ISBN: 
0-86651-333-7]; The Art and Techniques of Simula- 
tion. Mrudulla Gnanadesikan, Richard L. Scheaf- 
fer, Jim Swift. 1987, 53 pp, $7.95 (P), [ISBN: 
0-86651-336-1]; Exploring Surveys and Information 
from Samples. James M. Landwehr, Jim Swift, Ann 
E. Watkins. 1987, 96 pp, $8.95 (P). [ISBN: 0-86651- 
339-6] Four volumes of innovative workbooks devel- 
oped by ASA and NCTM to introduce statistical 
methods into mathematics courses in grades 7-12. 
Ezploring Data covers stem-and-leaf plots, box plots, 
scatterplots, smoothing, and other visual representa- 
tions of data; examples fit the world view of young 
teenagers. LEzploring Probability uses experiments 
(e.g., tree diagrams, guessing numbers, throwing 
darts to estimate areas) to develop intuition about 
chance leading to simple examples of complemen- 
tary and compound events. Simulation introduces 
computer simulation of coin tosses and random num- 
bers, then follows with mini-projects involving a wide 
variety of waiting-time applications. Ezploring Sur- 
veys, intended for high school students, explores sam- 
pling, confidence intervals, capture-recapture meth- 
ods, and other techniques in the context of extensive 
case studies and student projects. Steep discounts 
available for class purchases; teacher’s edition, soft- 
ware, and videotape supplement also available. LAS 


Computational Statistics, T(14-17: 1), S. A 
Guide to SPSS/PC+. Neil Frude. Springer-Verlag, 
1987, xvi + 279 pp, $27 (P). (ISBN: 0-387-91312-2] 
Tutorial introduction to the Statistical Package for 
the Social Sciences (SPSS) as adapted for use on IBM 
PC/XT or IBM PC/AT or equivalent personal com- 
puters. Focuses on data collected in a questionnaire 
survey, following the process through to the produc- 
tion of a final report. Prepares the reader to cope 
with the more technical SPSS/PC+ Manual. RSK 


Statistics, P. Cancer Modeling. Ed: James R. 
Thompson, Barry W. Brown. Stat.: Textbooks & 
Mono., V. 83. Marcel Dekker, 1987, xii + 422 pp, 
$89.75. [ISBN: 0-8247-7773-5] A series of ten arti- 
cles in which fifteen leading experts elucidate meth- 
ods to describe the genises, progression, results, and 
the treatment of cancer. Methodologies that range 
from purely descriptive to highly sophisticated are 
demonstrated. Includes suggestions for the exten- 
sion of current methods. CEC 


Statistics, P. Coordinate-Free Multivariable Statis- 
tics: An Illustrated Progression from Halmos to 
Gauss and Bayes. Mervyn Stone. Stat. Sci. Ser., V. 
2. Clarendon Pr, 1987, xiv + 120 pp, $29.95. [ISBN: 
0-19-852210-X] Uses a coordinate-free geometric ap- 
proach to provide a theoretical framework for linear 
and affine multivariate statistical methods. Relies 
heavily on linear algebra, particularly dual vector 
spaces, aided by a novel way to represent these con- 
cepts pictorially. RSK 


Statistics, T(16-17: 1, 2), P, L. Linear Models for 
Unbalanced Data. Shayle R. Searle. Prob. & Math. 
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Stat. Wiley, 1987, xxiv + 536 pp, $49.95. [ISBN: 0- 
471-84096-3] First half presents linear model method- 
ology without using matrix algebra for data having 
unequal numbers of observations in the subclasses. 
Second half uses matrices to extend these concepts. 
Deals only with the fixed effects model, except for 
the last chapter which gives a summary review of the 
mixed model. Includes comments on computer out- 
put from five of the major statistical packages. RSK 


Statistics, P. Topics in Multivariate Approzima- 
tion. Ed: C.K. Chui, L.L. Schumaker, F.I. Utreras. 
Academic Pr, 1987, x + 335 pp, $39.95. [ISBN: 0- 
12-174585-6] Twenty-one survey papers delivered at 
a December 1986 workshop at the University of Chile 
in Santiago. Topics include multivariate splines, fit- 
ting of scattered data, tensor approximation meth- 
ods, multivariate polynomial approximation, numer- 
ical grid generation, finite element methods, con- 
strained interpolation and smoothing. Also includes 
a bibliography of over 1100 entries by L.L. Schumaker 
and R. Franke. DFA 


Statistics, T*(17: 1, 2). Applied Multivariate Sta- 
tistical Analysis, Second Edition. Richard A. John- 
son, Dean W. Wichern. Ser. in Stat. Prentice-Hall, 
1988, xvi + 607 pp, $50.33. [ISBN: 0-13-041146-9] 
Revision of the authors’ 1982 First Edition (TR, De- 
cember 1982). Main changes include the addition of 
a chapter on canonical correlation analysis, and ex- 
pansion of the chapter on factor analysis to include 
some recent material on computer-aided analysis of 
structural equations. RSK 


Languages, P. Lecture Notes in Computer Science- 
285: Semantics of Digital Circutts. Carlos Delgado 
Kloos. Springer-Verlag, 1987, ix + 124 pp, $16.40 
(P). (ISBN: 0-387-18540-2] “The theory of formal se- 
mantics for programming languages was developed 
for precisely defining the function of computer pro- 
grams .... It has now become evident that a similar 
treatment is necessary for digital circuits.” A lan- 
guage for describing concurrent systems is defined 
and applied, with different domains, to various levels 
of VLSI design. RB 


Languages, P. Lecture Notes in Computer Science- 
283: Category Theory and Computer Science. Ed: 
D.H. Pitt, A. Poigné, D.E. Rydeheard. Springer- 
Verlag, 1987, v + 300 pp, $27.30 (P). [ISBN: 0-387- 
18508-9] Proceedings of a conference held at Edin- 
burgh, United Kingdom, September 1987, describ- 
ing links being established between category the- 
ory and computer programming. Sixteen papers; in 
some, category theory relates different approaches to 
a topic (e.g., axiomatic and denotational semantics); 
others introduce category theoretic models (e.g., of 
logic programming, of distributed computing). RB 


Languages, P. Lecture Notes in Computer Science- 
288: MetaSoft Primer. Andrzej Blikle. Springer- 
Verlag, 1987, xiii + 140 pp, $19.10 (P). [ISBN: 0-387- 
18657-3] In denotational semantics, a function which 
associates a value with each syntactic construct in 
a programming language determines the meaning of 
a program. This book covers mathematical frame- 
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work for the author’s simplified denotational seman- 
tics theory, discusses application to an example lan- 
guage (Pascal), and reports on development of an 
automated system MetaSoft for implementing appli- 
cations. RB 


Computer Systems, P. TRON Project 1987: 
Open-Architecture Computer Systems. Ed: Ken 
Sakamura. Springer-Verlag, 1987, xi + 312 pp, 
$59.50. [ISBN: 0-387-70027-7] The editors proposed 
‘The Realtime Operating System Nucleus’ concept 
five years ago. These are proceedings of the third 
symposium of the international multicorporate asso- 
ciation subsequently formed to carry out the project. 
Twenty-three papers report on phases of this effort 
“to bring the dream of computerized society ...into 
reality:” industrial applications (extended to the 
home); workstations; servers, gateways; networks; 
the unifying VLSI chip. RB 


Computer Systems, P. The Amoeba Distributed 
Operating System: Selected Papers 1984-1987. Ed: 
Sape J. Mullender. CWI Tract, V. 41. Math Cen- 
trum, 1987, vi + 309 pp, Dfl. 46.70 (P). [ISBN: 
90-6196-325-7] A collection of previously-published 
articles on Amoeba, which is a joint effort of the 
Center for Mathematics and Computer Science and 
Vrije University, both in Amsterdam. Nineteen pa- 
pers by Tanenbaum, van Renesse, Mullender, and 
others comment on aspects of protection, protocols, 
reliability, file systems, wide area networking, appli- 
cations, experience. RB 


Computer Systems. PasRo: Pascal and C for 
Robots, Second Eztended Revision. C. Blume, W. 
Jakob, J. Favaro. Springer-Verlag, 1987, ix + 239 
pp, $35 (P). [ISBN: 0-387-18093-1] Aspects of this 
robot programming system are new since the First 
Edition (TR, April 1986). Includes the PasRo de- 
scription of that edition (based on Pascal), revised 
and extended, and a new PasRo description based 
on the programming language C. DFA 


Computer Systems, T(13-17: 1), S, P, L. Com- 
puter Communications, Second Edition. Robert 
Cole. Springer-Verlag, 1987, viii + 173 pp, $22 (P). 
[ISBN: 0-387-91306-8] How do computers communi- 
cate? Beginning with transmission in wires and end- 
ing with computer network protocols and end-to-end 
services, this brief text surveys its topic with an eas- 
ily followed mixture of theory, technique, and exam- 
ple. Intended for undergraduates in computer sci- 
ence, advanced computer studies students; suitable 
for computer professionals. RB 


Computer Systems, L, P. Computing and Change 
on Campus. Ed: Sara Kiesler, Lee Sproull. Cam- 
bridge U Pr, 1987, xiii + 255 pp. [ISBN: 0-521-34431- 
X] A series of research papers profiling the impact of 
computing at Carnegie-Mellon University during the 
period 1982-1986, examining attitudes, behavior, ef- 
fect on employees (secretaries, librarians, managers), 
and on students. Little is said about the impact on 
curriculum or on learning. Concluding chapter sum- 
marises present CML policies based on this extraor- 
dinary period of rapid computer growth. LAS 
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Computer Graphics, P, L. Color and the Com- 
puter. Ed: H. John Durrett. Academic Pr, 1987, 
x + 299 pp, $59.50. [ISBN: 0-12-225210-1] Moti- 
vated by experience with inconsistent, incompatable, 
distracting use of color, the editor assembled funda- 
mental information on color, color computer displays 
and human color perception, as a step toward more 
effective use. In addition, eleven application areas 
(including medical images, cockpit instrumentation, 
map making, business graphics) are explored. RB 
Computer Graphics, T(15-18: 1, 2), S, P, L. 
An Introduction to Solid Modeling. Martti Mantyla. 
Princip. of Comput. Sci. Ser., V. 13. Computer 
Science Pr, 1988, xiv + 401 pp, $42.95. [ISBN: 
0-88175-108-1] Solid modeling refers to represent- 
ing and processing geometric information on three- 
dimensional solid objects. Part one presents back- 
ground, problems, models employed in solid model- 
ing. Case studies of algorithms are given. The sec- 
ond part is the The Geometric Workbench, a simple 
solid modeling system based on the boundary mod- 
eling approach. Sample programs in C. Problem sec- 
tions. Appendices on homogeneous coordinates and 
point-set topology. Bibliography, list of programs, 
index. RJA 

Artificial Intelligence, $(16-17), P, L. Artificial 
Intelligence and Tutoring Systems: Computational 
and Cognitive Approaches to the Communication of 
Knowledge. Etienne Wenger. Morgan Kaufmann, 
1987, xxiii + 486 pp, $32.95. [ISBN: 0-934613-26-5] 
Survey and conceptual analysis of intelligent tutoring 
systems (ITS). Focus on the perspective of commu- 
nication of knowledge (warranted belief) in a shift 
from CAI (where emphasis is on programming of ex- 
pert’s decisions) to the programming of knowledge 
necessary for ITS. Underlying models of the student, 
of knowledge representation, and communication are 
explored as a basis for a cognitively-oriented software 
engineering. RM 

Artificial Intelligence, P. Ezpert Judgment and 
Ezpert Systems. Ed: Jeryl L. Mumpower, et al. 
NATO ASI Ser. F, V. 35. Springer-Verlag, 1987, viii 
+ 361 pp, $59.50. (ISBN: 0-387-17986-0] Proceed- 
ings of a NATO advanced research workshop, August 
1986. Researchers in psychology, decision analysis, 
and artificial intelligence gathered to compare the 
ways their disciplines study expert judgment, to eval- 
uate relative strengths and weaknesses, and to pro- 
pose profitable linkages among the approaches. Four 
keynote addresses, fifteen invited presentations. RB 


Artificial Intelligence, P. Machine Intelligence 
and Knowledge Engineering for Robotic Applica- 
tions. Ed: Andrew K.C. Wong, Alan Pugh. NATO 
ASI Ser. F, V. 33. Springer-Verlag, 1987, xiv + 
486 pp, $79. [ISBN: 0-387-17844-9] Proceedings of 
the NATO Advanced Research Workshop held at 
Maratea, Italy, May 12-16, 1986. A collection of 
papers which review recent advances in machine in- 
telligence and knowledge engineering for robotic ap- 
plications. Covers robot vision, knowledge represen- 
tation and image understanding, robot control and 
inference systems, task planning and expert systems. 
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and integrated software/hardware systems. Presen- 
tations include system implementation and indus- 
trial applications issues. PS 

Computer Science, P. Lecture Notes in Computer 
Science-282: Visualization in Programming. Ed: P. 
Gorny, M.J. Tauber. Springer-Verlag, 1987, 210 pp, 
$21.80 (P). [ISBN: 0-387-18507-0] Proceedings of a 
Fifth Workshop on Informatics and Psychology held 
in Scharding, Austria, May 1986. Thirteen papers by 
European and American authors, including a survey 
of visual languages (those programmed in visual im- 
ages and those for producing visual images), several 
psychology research papers (e.g., experiment com- 
paring BOXER visual LOGO with ordinary LOGO; 
cognitive analysis of visual interfaces), interface de- 
sign issues. RB 

Computer Science, S(18), P. Lecture Notes 
in Computer Science-281: Trends, Techniques, and 
Problems in Theoretical Computer Science. Ed: Al- 
ica Kelemenovd, Jozef Kelemen. Springer-Verlag, 
1987, vi + 213 pp, $21.80 (P). [ISBN: 0-387- 
18535-6] Selected contributions to the Fourth In- 
ternational Meeting of Young Computer Scientists, 
Smolenice, Czechoslovakia, October 1986. VLSI al- 
gorithms, problems in formal language theory, var- 
ious aspects of the theory of formal grammars, 
biologically-motivated structures, topics in artificial 
intelligence. DFA 

Applications, P. Mazimum-Entropy and Bayesian 
Spectral Analysis and Estimation Problems. Ed: C. 
Ray Smith, Gary J. Erickson. Fundament. Theor. 
of Physics. D Reidel (US Distr: Kluwer Academic), 
1987, ix + 322 pp, $59. (ISBN: 90-277-2579-9] Pro- 
ceedings of a workshop held at the University of 
Wyoming in August 1983; includes papers apply- 
ing Bayesian methods in quite diverse areas (e.g, 
signal processing, theory of the brain, tomography, 
databases). LAS 

Applications (Electrical Engineering), T(16- 
17: 1), L. The Theory of Electromagnetic Flow- 
Measurement. J.A. Shercliff. Cambridge U Pr, 1987, 
xiii + 146 pp, $14.95 (P). [ISBN: 0-521-33554-X] 
Reprint without changes of the text first published 
in 1962. The theory of electromagnetic flow mea- 
surement concerns the determination of fluid flow 
rates through measurement of the electromagnetic 
flow induced by a fluid moving through a magnetic 
field. AM 

Applications (Management ), T(13: 1-3). Math- 
ematics with Applications in Management and Eco- 
nomics, Sizth Edition. Earl K. Bowen, Gordon D. 
Prichett, John C. Saber. Irwin, 1987, xxiii + 993 pp, 
$37.95. [ISBN: 0-256-03140-1] Accessible to students 
with only one year of secondary school algebra, this 
text includes the prerequisite mathematics required 
for its coverage of linear programming, mathemat- 
ics of finance, simple optimization, and elementary 
probability and statistics. Computer programs in 
Basic and Pascal and the use of computer packages 
such as LINDO and Minitab are included. (Third 
Edition, TR, May 1973; Fourth Edition, TR, Decem- 
ber 1976; Fifth Edition, TR, October 1980.) JNC 
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Applications (Physics), P. A Commentary on 
Thermodynamics. William Alan Day. Tracts in Nat. 
Philo., V. 32. Springer-Verlag, 1988, ix + 96 pp, 
$48. [ISBN: 0-387-96615-3] The objective here is to 
set some topics from thermoelasticity in the context 
of field theories which incorporate the effects of heat 
conduction and inertia. The topics include linear and 
nonlinear thermoelasticity. MU 


Applications (Physics), T(18: 1, 2), P. Path 
Integral Methods in Quantum Field Theory. R.J. 
Rivers. Mono. on Math. Physics. Cambridge U Pr, 
1987, xi + 339 pp, $69.50. [ISBN: 0-521-25979-7] A 
concise graduate-level introduction to analytic func- 
tional methods in quantum field theory. The unusu- 
ally well-written, clear, and engaging text is heavily 
laced with elegant mathematics. MU 


Applications (Physics), $(16-18), P, L. Spinors 
and Space-Time, Volume 1: Two-Spinor Calculus 
and Relativistic Fields. Roger Penrose, Wolfgang 
Rindler. Mono. on Math. Physics. Cambridge U Pr, 
1987, x + 458 pp, $29.95 (P). [ISBN: 0-521-33707- 
0] The two-spinor calculus replaces the usual tensor 
calculus in this formal analysis of four-dimensional 
space-time geometry. The text begins at a relatively 
elementary level and progresses carefully through 
such topics as Einstein- Maxwell equations and Yang- 
Mills fields. MU 


Applications (Physics), T*(16-17: 1), S, P, L*. 
Introduction to Modern Statistical Mechanics. David 
Chandler. Oxford U Pr, 1987, xiv + 274 pp, $21.95 
(P); $39.95. [ISBN: 0-19-504277-8] An excellent in- 
troduction emphasizing major modern topics such as 
Monte Carlo sampling, renormalization groups, and 
the fluctuation-dissipation theorem. Exercise 2.18 is 
a must: “Put some water in a bowl. Sprinkle pepper 
on its surface. Touch a bar of soap to the surface 
in the middle of the bowl. What happened? Touch 
the surface with the soap again, and again. What 
happened and why? (Hint: Consider the formula for 
the Gibbs adsorption isotherm.)” BC 
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MATH TEXTS FROM SAUNDERS! 


Coming Soon! 


COLLEGE ALGEBRA 
TRIGONOMETRY 
ALGEBRA AND 
TRIGONOMETRY 


All by Stanley I. Grossman, 
University of Montana 


For your college algebra and trigonometry courses, 
this series of three texts stresses real-life applica- 
tions, exercises and lively historical anecdotes to 
help your students more readily absorb and truly 
enjoy the material. 
COLLEGE ALGEBRA 
TRIGONOMETRY 


ALGEBRA AND 
TRIGONOMETRY 


ISBN 0-03-007089-9 480pp. hardcover 
ISBN 0-03-007103-8 384 pp. hardcover 


ISBN 0-03-007129-1 512pp. hardcover 


FINITE MATHEMATICS 
WITH APPLICATIONS 
CALCULUS WITH 
APPLICATIONS 

FINITE MATHEMATICS 
WITH CALCULUS: AN 
APPLIED APPROACH 


All by Raymond F. Coughlin and 
David E. Zitarelli, both of Temple University 


Each text in this series emphasizes the application 
of concepts through the use of case studies in each 
chapter. Each case study is accompanied by exam- 
ples and exercises related directly to that case. Chap- 
ter summaries, worked examples and graded 
problem sections aid in students’ understanding of 
the material. 


FINITE MATHEMATICS ISBN 0-03-011292-3 432pp. hardcover 
CALCULUS WITH APPS. ISBN 0-03-011283-4 640pp. hardcover 
FINITE MATHEMATICS 

WTIH CALCULUS ISBN 0-03-011274-5 800pp. hardcover 


FUNDAMENTALS OF MATH 
ELEMENTARY ALGEBRA 
INTERMEDIATE ALGEBRA 


All by Denny Burzynski and 

Wade Ellis, both of West Valley College 

Used separately or in sequence, this new series of 
texts will take your students from fundamentals of 
mathematics to basic algebra to an intensive study 
of intermediate algebraic topics. Each text uses 
“Sample Sets” that show students how to work 
problems and “Practice Sets” to develop students’ 
problem-solving abilities. 

FUNDAMENTALS OF 

MATH ISBN 0-03-063901-8 550pp. paper 
ELEMENTARY ALGEBRA _ ISBN 0-03-063906-9 608 pp. paper 


INTERMEDIATE 


ALGEBRA ISBN 0-03-063903-4 640 pp. paper 


Available October 1988! 


INTRODUCTION 
TO TOPOLOGY 


Frederick H. Croom, University of the South 
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Special Functions and the Bieberbach Conjecture 


NICHOLAS D. KAZARINOFF, SUNY at Buffalo 


NICHOLAS D. KAZARINOFF: I received my Ph.D. for a thesis on special 
functions under the direction of the late R. E. Langer in 1954 at the 
University of Wisconsin. I have taught at Purdue University, the University 
of Michigan and the State University of New York at Buffalo. My current 
interests include fluid dynamics, dynamical systems, iteration of real and 
complex valued maps, and the geometry of theta series. 


In [6] J. Korevaar showed how the inequality 
sFy(-n®* t+k+1k+hyn®+kt+1;k + 3,2k + 1;e7') > 0, 
(t>0,n* >2,k =1,2,...,n*-1), (1) 
which is implied by the result 
F(x,a,n) =3,F,(-n,nta+2,4(at+1);a+1;4(a + 3);4(1 — x))>0 
(-l1<x<l1l,n=0,1,2,....a > —2) (2) 
of R. Askey and G. Gasper [1], is used by L. de Branges in his proof [4] that the 
celebrated Bieberbach conjecture is correct. My goal in this article is to give a 
largely self-contained exposition of Askey and Gasper’s proof of (2), accessible to 
all mathematicians and graduate students who are interested in learning the crucial 
role that special functions played in the proof of Bieberbach’s conjecture. In their 
paper [1] Askey and Gasper actually prove much more than (2). Here I prove less; it 
suffices to prove (2) for a > —1. 


The proof of (2) I give in section 4 below for a > —1 reduces to three essential 
ingredients. The first one of these is the identity 


3F,(2a, 2b, at+b;2a+2b,a+b+ 34; x) = |F,(a, b;a+b+ 4; x)’, (3) 


proved in 1828 by Th. Clausen in Altona (now a suburb of Hamburg, F.R.G.) in a 
short paper in Crelle’s Journal [2]. The second is an identity for the Gegenbauer (or 
ultraspherical) polynomials found in L. K. Hua’s book [5, p. 141]; namely, for 
y>A>O0 


[n/2] 
C(x) = » 4G) 9, (X), (4) 
k=0 


where [n/2] is the greatest integer in n/2 and 
(n- 2k +A)T(A)(v-A),P(n+0—-k) ; 
kIT (v)T(n+A—-—k +1) | (>) 


This identity likely can be traced back to Gegenbauer. His polynomials are 


c.(y, A) = 
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conveniently defined by the generating function relationship 
(1—2xt+2?7) = YVC'(x)t”. (6) 
0 


Since, by ({7, p. 141], [8, p. 220]) 
Cr(x) = [(27),/n!],F,(-2, n + 2v;» + 4;3(1 — x)), (7) 


(4)-(5) can be rewritten as 


»F,(—n,n + 2v; 7+ 4;4(1 - x)) 
[n/2] 
= Vi d,(v,A).F(27 — n,n - 27+ 2A;4 445401 -~x)), (8) 
j=0 


where 
dj(v, A) = [n!(20) n-2j/(m — 25127) q] 6): (9) 


The third ingredient, implicit in Askey and Gasper’s proof, is a transformation 
T,, that acts as follows: 


F(a, b,c; d,e;t) =T, .|,F(a,b;d;t)]  (e>e>0). (10) 


With a = 2k, n=n* —k—1, and x = 1 — 2e~', (2) becomes (1). The condi- 
tions on ¢t, k and n®* for the inequality (1) to hold are clearly fulfilled if —1 < x < 1, 
a > —1, and n is a nonnegative integer. I shall define 7, , and prove (2) in section 
(4) after I prove Clausen’s identity (3) in section (2) and Gegenbauer’s identity 
(4)-(5) in section 3. Upon completing this Introduction, I invite the reader to skip 
sections (2) and (3) at first reading and go directly to section (4) to see how the 
ingredients described above are simply combined to establish the inequality (2). 

For those unfamiliar with the special functions used above and later I close this 
Introduction with a presentation of the leading characters. First, the generalized 
factorial (a), =a(a+1)---(a+n-—1) for n=1,2,..., and (a),=1. The 
Gaussian hypergeometric function , F,(a, b; d; x) appearing above is the solution of 
Gauss’s hypergeometric differential equation 


x(1—x)y” + [d-(a+6+1)|xy’ — aby =0 (11) 
with y(0) = 1 and y’(0) = ab/d. Similarly, the generalized hypergeometric function 
3F,(a’, b’, c’; d’, e’; x) is the solution of 

x7(1—x)z” + [(34+a' +b +a’)x?-(1t+c' + e’)x|z” (12) 
+[c’e’ —(lt+a’+b'+d'+a'b' + a'd' + b'd’')x|z' — a’'b'c'z = 0, 
with y(0)=1, y’(0) = a’b’c’/d’e’, and y”(0) =(a’ + 1)\(b' + 1)(c’ +: 1)/(d’ + 


1)(e’ + 1). These functions have the power series representations implied by the 
definition 


pKa (dy,+++5 Gps By, -+ +5 bg3 X) = E Mearo"/|mt Eos} (13) 


I show at the end of the next section that appropriate specializations of the , F,’s in 
(13) satisfy the equations (11) and (12). 
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It follows from (6) (see also [7, p. 139], [8, p. 219]) that 


(v),(2xt — t?)"/n! 


| 
hs 


(l—-2xt+??) = 
0 


n 


X (v),(2x)"e"**/[nt(n — I, 


| 
Ms 


00 [n/2] 
= Yt Ye (HI) na 2x)" 7 LAM — 2k)!]. 
Thus, 
[n/2] 
Ci (x) = D(H) naa)" / LE — 2k)!]. (14) 


The Gegenbauer polynomials are orthogonal over (—1,1) with weight function 
(1 — x*)’"!/2, so that, in particular, any positive integral power of x may be 
expanded in a finite sum [5, p. 141; 7, p. 227; 8, p. 281-283]: 


[p/2] 
(2x)"/pt = Ye (=A + = 2k)Gae()/[ RA) peer} (15) 


I outline a proof of (15) at the beginning of section 3. Actually orthogonality is not 
what makes an expansion like (15) hold, but rather it is that the C’(x) (n = (0,1,...) 
form a simple set. A set of polynomials is simple if there is exactly one polynomial 
of each nonnegative degree in the set. Hence any polynomial P,,(x) of degree N can 
be written as a linear combination of the polynomials in the set { p,(x)}: just 
subtract the multiple c of p, that cancels the leading term in P,,. Then do the same 
with Py — cpy, ete. 


2. Clausen’s proof of (3). Clausen’s identity (3) lies at the root of R. Askey and 
G. Gasper’s proof that the members of a certain family of ,F,’s are positive. The 
proof of (3) given by Clausen in [2] is straightforward. He also proved in [2], [3] that 
(3) is the only case where a[,F,]’ of argument x is a constant multiple of a ,F, with 
the same argument. Thus, one cannot expect to find a simpler proof of (2) than the 
one given in section 4. Clausen’s proof of (3) is as follows. (I omit some tedious 
elementary algebra.) 

Clausen’s basic idea is: Suppose that u and v satisfy a linear second order 
differential equation L[ y] = py” + gy’ + ry = 0 and answer the question “what 
third order equation is satisfied by u*, uv, and v7?” by determining A, B, P, Q, R, 
and S so that with z=u?  (Au+ Bu’)L[u] + 2u(L[u]’ = M[z] = Pz” + Qz” 
+ Rz’ + Sz is an identity. For the case considered by Clausen this procedure is 
changed a little, because the third order equation to be satisfied by u’, uv, and v* is 
to have the particular form of an equation satisfied by a ,F,. Nevertheless, it 
succeeds. 

To carry out Clausen’s idea, multiply both sides of the differential equation (11) 
for y =,F,(a, b; c; x) by x to obtain 


xL[ y] = x2(1 — x)y” + [ex — (a +b + 1)x?] y’ — abxy = 0. (16) 
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Then differentiate (16) to obtain 
(xL[y])’ =x?(1—x)y” + [(c + 2)x — (a+ b+ 4)x?] y” 
+[c— (2a + 2b + ab + 2)x] y’ — aby = 0. (17) 
On the other hand, z =,F,(a’, b’, d’; c’, e’; x) satisfies the differential equation 
(12). Call its left-hand side M[z]. But if z = y7, then 
z= 2 yy’, = 2 yy” + 2(y’)’, and val — 2yy mt 4 6y’y”. (18) 
Replace z, z’, 2’, and z” in (12) by their equivalents in (18). The left-hand side of 
the result is M[y7]. Then consider 
(2Ay + By’) L[ y] + 2y(xL[y]y = M[y’]. (19) 
If y =,F,(a, b; c; x), the left-hand side is zero. Then M[y*] = 0, provided (19) is 
an identity; and we obtain (3). To force (19) to be an identity, think of both sides of 
(19) as being polynomials in y, y’, y”, y’”. These polynomials have no common 
factors. Thus (19) is an identity if the coefficients of corresponding monomials in 
powers of y, y’, y”, y”” are equal. Call the resulting system of equations (S). To 
simplify the algebra used for computing a’, b’, c’, ... . Clausen introduced auxiliary 
variables r=a’+bU+ec,s=a'b’+a'ce+b’c',u=a'b'c’,v =d’+e'’,and w= 
d'e’. In terms of these variables (S) becomes 
34+r=At+b4+4, l+v=At+cH2, 


1+r+s=A(at+b+1)+4Bab+2(a+b+1)+ ab, 


w=c(A+1), u=2ab(A+4+1), 6=B, 2(134+7r)=B(at+b+1), 
2(1 + v) = Be. 
It easily follows that 
B=6, A=2(a+b)-1, r=3(a+b), c=a+b+t}, v= 3c-1, 
w=(2c-—1)c, u=4(a+b)ab, and s=2(a+b) + 4ab. (20) 


One now concludes from the definitions of 7,..., w and (19) that a’, b’ and c’ are 
the zeros 2a, 2b, and a + b of the cubic y? — ry? + sy — u. Further, d’ and e’ are 
the zeros c and 2c — 1 of the quadratic y* — vy + w. Thus (3) is established. 

In terms of the operator 6 = xd/dx the differential equation (11) becomes 
[0.6 +c— 1) —x(6 + a)(@ + b)|w = 0. This suggests that the function w =, F, in 
(13) satisfies 


w=0. (21) 


oT (0+ 6,-1) - x11 (0+ a) 


To see this is the case observe that since 6x* = kx*, 


q fore) 


g[[(0@+6,-1)w= > GHG + b,- DTT (adex"| |e TT. 


i=1 k=1 


| 
Ms 
ls 


Ha)e|/[e - DT Ode} 
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Replace k by k + 1 to get 


q 
6[[(6+ 5,-1)w 
i=1 


[Fade] fe Tle 


. sts sus 


tM + Tax] () 


“fe + a,)w. 


This shows that the ,F, in (13) satisfies (21) and that its specializations for 
(p,q) = (3, 2) and (2,1) satisfy (12) and (11), respectively. I learned this derivation 
from my, now deceased, professor E. D. Rainville; see [8, Chapter 5]. 


3. Two proofs of the identity (4)-(5). First consider (15). To establish this 
identity one may either use the orthogonality of the Gegenbauer polynomials C/(x) 
with weight function (1 — x”)’~1/* or proceed as follows. On one hand, it follows 
easily from the generating function relationship (6) that 


dC?(x)/dx = 2vC?7*}(x). (22) 
If we use this relation in the expansion 
. [n/2] 
(2x) Jn! = » ay ,(V)Ci_o4(x), (23) 
k=0 
we get 
Ay n(? +1)= Vax n+i(?). (24) 


This relation implies that if a, ,(1) were known, then we could compute the values 
of a, ,(/) for all positive integers j. Since a, ,(v) must be a rational function of », 
we would then know a, ,(v) for all v > 0. 

On the other hand, since C}(cos 6) = sin[(n + 1)@]/sin @ and the orthogonality 
of the set {sin(k@)} on (—2/2, 7/2) is known from the elementary calculus, we 
have 


[G&G — x2)? dx = fr sin[(n + 1)6]sin[(m + 1)6] a0 


= (7/2)58 n,m? 


and 
J [@x)"/ni] Chae) — x?) dx 
= f° [2s 9)" /n!|sin[(n — 2k + 1)6]sin 6d, 
—17/2 


= (7/2)(n — 2k +1)/[k'\(n —k 4+ 1)!]. 
Thus, a, ,(1) =(n-—k+1)/[k\(n —k + 1)!], and the evaluation a, ,(v) = 
(n — 2k + v)/{k\(v),, 441] of the coefficients in (23) follows, which gives (15). 
Both Hua’s proof of (4)—(5) and the alternative one that follows it take advantage 
of (15). Using (15) in (14), we find that the coefficients c, in (4) are given by (their 
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dependence on (», A) is suppressed) 


[n/2]—k 
c, = [(=1)*() n-1/k!] yi (A +n —2k — 2i)/[iMA)n—2p-iaal, 
= ne OLETC +n—-2p- 2i)/| pLiMA) n—2n-i41l, 
T(r) /TA)] . (25) 
= |(A +n —- 2k)/k!| E (-1"(5}PC +n—p)/T(A+n-pt+k+1). 


Let Ag(x) = g(x + 1) — g(x). To obtain (5) from (25) Hua uses the iterated finite 
difference formula 


A‘| g(a + x)/g(b+x)] = ¥ (-1)"(p) ae +xt+k—p)/g(b+x+k—p), 


with x =n, a= v-—k and b=A — 2k + 1. Then the sum in (25) becomes 


y (-9"(K)(ro + n—p)/V(A+n—-p-—k+1)| 
pe (26) 


=A |T(vtn—k)/V(At+n-2k +1). 
The functional equation for the gamma function, xI'(x) = I(x + 1), and some 
algebra yield the identity 
A[T(a+n)/T(b+n)| =(a-—b)l(atn)/T(b+n+ 1). 
This result and the functional equation for I’ imply in turn that 
A7[T(a+n)/T(b+n)| (27) 
=T(a-—b+1)l(atn)/T(a-b-—g+1)T(bt+n+4q). 
Finally, if we use (27) with g =k, a=v—k and b=A — 2k +1, we conclude 
that the right-hand side of (27) becomes 
T(iv-A+k)T(vtn—-—k)/Tv-A)T(nt+A-—k +1), 


which implies (5). 

I now give a second proof of (4)—(5) modeled on arguments of Rainville [8, p. 
179-181]. It is short and sweet. However, it does depend upon the formula 
,F\(—n, a; c;1) = (c — a),/(c),, [8, p. 69], which I do not derive here. Using it, 
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one may rewrite the sum in (26) as 


DL Uk) ele + —p)/[p(A+n-p-—k+1)| 


=[T(vtn)/TAAtn-—k+1)] 
x E(k) p(k m2) /[ pt ~ 0 ~ ») 


=|[T(vtn)/T(Atn—k+4+1)|,F(-k,k -A-—n;1-—v-—n;1); 
=|T(ivtn)/TAt¢n-k+)D]Q+A-k-»7),/1+vr-n),;, 
which reduces to [(v -A+k)Tv+n—k)/T(vo-A)T(n+A—k 4+ 1). 
4. Askey and Gasper’s proof of (3). Askey and Gasper actually proved that 
S(x,a,n)= )Y) P§»%(x) > 0 
k=0 


for -—1 <x <1 and a> —2 in [1, Theorem 3], where the P{**)(x) are Jacobi 
polynomials and the sum S(x, a, n) is a constant multiple of F(x, a, n). However, 
for our purposes it suffices to establish the inequality (2), for a > —1 and 
—1 < x < 1, which we proceed to do. 

The operator T, ., implicitly used by Askey and Gasper, as applied to ¢” is 


Tt = {T(e)f-*/T(e)T(e — ¢)} f(e-y) Sty dy. (28) 
0 
If we let y = ts in (28), note that for the beta function 
B(a,B) = ['(1 —5)*'s®1 ds = F(a)T(B)/T(a + B) 
0 


and use the obvious consequence [(a + n)/I'(a) = (a), of the functional equation 
for the gamma function, then we find that 


Tt = (c) gt*/(e)k; 
hence, (10) holds. 

We now prove (2). First, observe, that (2) is not trivial; (3) cannot be directly 
applied to the ,F, in (2) to give the desired inequality. However if we use the result 
(8)—(9) and the transformation T, , with properly chosen e, and c, we can apply (3) 
to obtain (2). By (2) and (10), 


F(x, 4,2) = Tyr, aya+1) 2h(-n,-nt+at2a+ 4; t), (29) 


where ¢ = 3(1 — x). We apply (8) to (29) with » = 5(a + 2); and A = 5(a + 1). 
The result is 
[n/2] 


F(x, Qa, n) = Ty 41,0/2(a41) » d,(4(a + 2), 5(a + 1)) 
j=0 


X,F(2j-—n,n-2j+at1;4(a+t 2); 2); 
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or, since we may reverse the order of Ty 4.1 (¢+1)/2 and the summation 
[n/2] 
F(x,a,n)= > Ty 41,0./2(0+1) »F, (2) —n,n-2j+at+1;4+(a+ 2); t), 
j=0 


(30) 


where we have suppressed the dependence of the d , on a. We next evaluate 
Ty 4+1,(a+1)/2 acting on the ,F,’s in (30) using (10). The result is: 
[n/2] 
F(x,a,n)= )) d; 3F(2j—n,n-2jtat+1,3(at1);a+1,4(a + 2); 2). 
j=0 
But these ,F,’s now precisely fit (3) with 2a =2j;-—n and2b=n—-—2j+a+1. 
This comes about since we had a free choice of A in (0, v) in (8)! Thus using (3) we 
find that 


[n/2] 
F(x,a,n)= >> d,[.F(4(2/ —n),3(n-—2j+a+1);4(a+4 2); 1)’, (31) 
j=0 


By (5) and (9), the coefficients d AGC: + 2),5(a@ + 1)) in (31) are all positive for 

a > —1. At least one of the ,F;’s in (31) is always positive. To see this we observe 

that if n is even, then the ,F, in (31) corresponding to 7 = [n/2] is (recall that ¢ 
1 

= ,(1 — x)) 


»F,(0, 4(a + 1); 4(a + 2);4(1 —x)) =1>0; 
and if n is odd, then the ,F, in (26) corresponding to j = [n/2] is 
»F,(—4,4(a + 2); $(@ + 2); 4(1 — x)) = [1 + x)/2]' > 0 
for —1 < x < 1. This completes the proof of (2). 
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etc. ein Quadrat von der Form z = 1 + 


The Strong Law of Small Numbers 


RICHARD K. Guy 
Department of Mathematics and Statistics, The University of Calgary, Calgary, Alberta, Canada T2N 1N4 


This article is in two parts, the first of which is a do-it-yourself operation, in 
which ll show you 35 examples of patterns that seem to appear when we look at 
several small values of n, in various problems whose answers depend on n. The 
question will be, in each case: do you think that the pattern persists for all n, or do 
you believe that it is a figment of the smallness of the values of n that are worked 
out in the examples? 

Caution: examples of both kinds appear; they are not all figments! 

In the second part I'll give you the answers, insofar as I know them, together with 
references. 

Try keeping a scorecard: for each example, enter your opinion as to whether the 
observed pattern is known to continue, known not to continue, or not known at all. 

This first part contains no information; rather it contains a good deal of 
disinformation. The first part contains one theorem: 


You can’t tell by looking. 


It has wide application, outside mathematics as well as within. It will be proved by 
intimidation. 
Here are some well-known examples to get you started. 


Example 1. The numbers 2? + 1 = 3, 27 +1=5, 27 +1=117, 2?) +1 = 257, 
27" + 1 = 65537, are primes. 


Example 2. The number 2” — 1 can’t be prime unless 7 is prime, but 2? — 1 = 3, 
27? —1=7, 2° —1 = 31, 2’ — 1 = 127, are primes. 


Example 3. Apart from 2, the oddest prime, all primes are either of shape 4k — 1, 
or of shape 4k + 1. In any interval [1, ”], the former are at least as numerous as the 
latter (4k — 1 wins the “prime number race’’): 


VISA WILLIE 


Example 4. Pick several numbers at random (it suffices just to look at odd ones). 
Estimate the probability that a number has more divisors of shape 4k — 1, than it 
does of shape 4k + 1. For example, 21 has two of the first kind (3 & 7) and two of 
the second (1 & 21), while 25 has all three (1, 5, 25) of the second kind. 


Example 5. The five circles of Fic. 1 have n = 1,2,3,4,5 points on them. These 


points are in general position, in the sense that no three of the (7) chords joining 
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them are concurrent. Count the numbers of regions into which the chords partition 


each circle. 


Fic. 1. How many regions in each of these circles? 


I’ve been trying to formulate the Strong Law of Small Numbers for many years 
[9]. The best I can do so far is 


There aren’t enough 


small numbers to meet the 
many demands made of them. 


It is the enemy of mathematical discovery. When you notice a mathematical pattern, 
how do you know it’s for real? 


Superficial similarities 
spawn spurious statements. 


Capricious coincidences 
cause careless conjectures. 


On the other hand, the Strong Law often works the other way: 


Early exceptions 
eclipse eventual essentials. 


Initial irregularities 
inhibit incisive intuition. 
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Here are some misleading facts about small numbers: 
Ten per cent of the first hundred numbers are perfect squares. 
A quarter of the numbers less than 100 are primes. 
Except for 6, all numbers less than 10 are prime powers. 
Half the numbers less than 10 are Fibonacci numbers 


0,1,1,2,3,5,8,... 


and alternate Fibonacci numbers, 1,2,5,... are both Bell numbers and Catalan 
numbers. 


Example 6. The numbers 31, 331, 3331, 33331, 333331, 3333331, are each prime. 


Example 7. The alternating sums of factorials, 

3!—2!+ 1! =5 
44 — 3!+ 2!-—1!=19 
5! — 4! + 3! — 2! + 1! = 101 
6! — 5! + 4! — 3! 4+ 2! — 1! = 619 
7! — 6! + 5! — 41 4+ 3! — 2!4 1! = 4421 
8! — 7! + 6! — St 4+ 4! — 3! 4 2! — 1! = 35899 
are each prime. 


Example 8. In the table 


row 1 1 1 2 
row 2 1 2 1 3 
row 3 1 3 2 3 1 5 
row 4 1 4 3 2 3 4 1 7 
row 5 1 5 4 3 5 2 5 3 4 5 1 11 
row 6 1 65 4 3 5 2 5 3 4 56 1 13 
row 7 1 765 473 5 72 = 475 3 7 4 5 67 #1 = 19 
row8 1 8 765 47 3 85 7 2 75 8 3 7 4 +5 67 8 1 23 


row9 19 8 76 5 9 47 3 85 7 929 75 8 3 749 5 67 89 129 


row n is obtained from row n — 1 by inserting n between each pair of consecutive 
numbers which add to n. The number of numbers in each row is shown on the right. 
Each is prime. 


Example 9. Is there a prime of shape 7013 x 2” + 1? 
Example 10. Are all the numbers 78557 X 2” + 1 composite? 


Example 11. When you use Euclid’s method to show that there are unboundedly 
many primes: 


2+1=3 
(2x 3)+1=7 
(2x 3x5)+1=31 
(2X*3x5x7)+1=211 
(2x3xX5xX7X11)+1= 2311 
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you don’t always get primes: 


(2X3xX5X7X11 X13) + 1 = 30031 = 59 x 509 
(2X 3x*5xX7xX11 x 13 x 17) + 1 = 510511 = 19 x 97 X 277 
(2x*3x5xX7xX11 X13 x17 x 19) + 1 = 9699691 = 347 x 27953 


but if you go to the next prime, its difference from the product is always a prime: 
5-2=3 
11 — (2x 3)=5 
37 -(2X 3X5) =7 
223 —-(2X3xX5X7)=13 
2333 —- (2X 3X5 X7xX 11) = 23 
30047 - (2X 3X 57X11 xX 13) =17 
510529 —- (2 3X 5X 7X11 X13 xX 17) =19 
9699713 —- (2X 3X 5X 7X11 xX 13 x 17 X 19) = 23 


Example 12. From the sequence of primes, form the first differences, then the 
absolute values of the second, third, fourth,... differences: 


235 7 11 13 17 19 23 29 31 37 41 43 47 53 59 61 67 
1224 2 4 2 4 6 2 6 4 2 4 6 6 2 6 
102 2 2 2 2 2 4 4 2 2 2 2 0 4 4 2 


Is the first term in each sequence of differences always 1? 


Example 13. 2” is never congruent to 1 (mod n) for n > 1. 2” is congruent to 2 
(mod n) whenever n is prime, and occasionally when it isn’t (nm = 341, 561,...). Is 
2” ever congruent to 3 (mod n) for n > 1? 


Example 14. The good approximations to 5!/°, namely, the convergents to 
1 1 1 1 1 1 13 4 7 11 29 
+ ———— — — —.... are -, ~, 5,5, —>,5... 
2+4+1+14+14+24+ °°". 1°2°3'5’ 821 


which have Fibonacci numbers for denominators and Lucas numbers for numera- 
tors. 
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Example 15. 

(x+y) =x + y? + 3xy(x + y)(x?2 + xy + y?) 

(x+y) =x + p> + S5xy(x + y)(x? + xy + y?) 


(x + y)! =x’ + y! + Txy(x t+ y)(x* + xy + y?) 


Example 16. The sequence of hex numbers (so named to distinguish them from the 
hexagonal numbers, n(2n — 1)) are depicted in FIG. 2. 
The partial sums of this sequence, 1, 8, 27, 64,125, appear to be perfect cubes. 


Fic. 2. The hex numbers. 


Example 17. Write down the positive integers, delete every second, and form the 
partial sums of those remaining: 


12 3 €£# 5 £6 7 B® 9 YW 11 
1 4 9 16 25 36 


Example 18. As before, but delete every third, then delete every second partial sum: 


12245 67 8 £§ 10 11 YW 13 14W 16 
1% FY 19 4 37 @ 61 B 9] 
1 8 27 64 125 216 


Example 19. Again, but delete every fourth, then every third partial sum, then every 
second of their partial sums: 


123 "5 6 7 &® 9 +10 11 Ww 13 #14 «15 Ww 17 
3 6 11 17 24 33 43 54 67 81 96 113 


1 
1 £ 15 32 65 108 175 256 369 
1 16 81 256 625 
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Example 20. Again, but circle the first number of the sequence, delete the second 
after that, the third after that, and so on. Form the partial sums and repeat: 


Q224567 89W 11 12 13144 16 17 18 19 207 
@ 6YU 18 6% 46 58 7185 101 118 136155 %5 
© 24 56 96 154 245 326 444 580 735 


120 274 600 1044 1644 
720 1764 


Example 21. Write down the odd numbers starting with 43. Circle 43, delete one 
number, circle 47, delete two numbers, circle 53, delete three numbers, circle 61, and 
so on. The circled numbers are prime (FIG. 3) 


@) x (@) x *@) XX £ KA) KKH ADA A A 
KX Lt KK KAAS A BBA BAS 
ok Bt ho Bt A () ee ot KM 
B/S /SSSSSSSSLSIALLLS SS 
SY SSLSLALLASASASSLLLLSLLSL 
@M/LSSSSSASSLLSALSSALSS 
SY SLASLASSLLLL LCOBS LS SSS 
SY SSPLLASLSLOBALLASS SST 
SY SSLLASSLASLCA@LLSLLLLS LS 
SP SLLSSLLSCESL LSS SLA SLS 
SY SSLALSLLS SALSA LASAASSLS 
SP SSASSLALASLSLELLS SLT 
SP LSLASLASASLLALSLSSOLS LS 
PSLSLSLLLLLLALAASSASALS SS 
@M/SSSSLSSSSASSASLSSS 
SL SSSSEAMLASLSLSSASLLLLSSS 
SP LSSLLLLLLLIE® 


Fic. 3. Parabolas of primes remain. 
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Example 22. In Table 1 the odd prime values of n4 +1 and of 17 x 2” — 1 are 
printed in bold. They occur simultaneously for n = 2, 4, 6, 16, 20. 


TABLE 1 


17 X 2" - 1 


82 =2x 41 
257 
626 =2 x 
1297 
2402 =2 x 2175 = 3 Xx 
4097 = 17 x 4351 = 19x 
6562 =2 x 8703 = 37 x 
10001 = 73 x 17407 = 137 x 
14642 = 2 x 34815 = 3 x 
20737 = 89 x 69631 = 179 x 
28562 = 2 Xx 139263 = 3 X 
38417 = 41 x 278527 = 223 x 
50626 = 2 x 557055 = 37 x 
65537 1114111 
83522 = 2 Xx 2228223 = 3 x 
104977 = 113 x 4456447 = 59 x 
130322 = 2 x 8912895 = 3 x 
160001 17825791 
194482 = 2 x 35651583 = 3* x 
234257 = 73 X 71303167 = 13 x 
279842 = 2 x 142606335 = 3 x 


0 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 


Example 23. In Table 2 the prime values of 21 X 2” — 1 and of 7 xX 4"+ 1 are 
printed in bold. They occur simultaneously for n = 1,2, 3,7, 10, 13. 


TABLE 2 


20 =27 x5 
41 
$3 
167 
335 =5X 1793 = 11 x 
671 = 11 x 7169 = 67 X 
1343 = 17x 28673 = 53 Xx 
2687 114689 
5375 = 5° x 458753 = 79 x 
10751 = 13 x 1835009 = 11 x 
21503 7340033 
43007 = 29 x 29360129 = 37 x 
86015 = 5 Xx 117440513 = 3907 x 
172031 469762049 
344063 = 17 x 1879048193 = 11 x 
688127 = 114 x 7516192769 = 292 x 
1376255 = 5 X 30064771073 = 113 x 
2752511 = 19 x 120259084289 = 379 x 


0 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
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Example 24. Consider the sequence 

Xo= 1, Xqap = (Lt xh tx? 4+--- +x2)/(n +1) (nz 0). 
n 0 1 2 3 4 5 6 7 8 9 
x, Ll 2 3 5 10 28 154 3520 1551880 267593772160 


Is x, always an integer? 


Example 25. The same, but with cubes in place of squares: y) = 1, y,,,; =(1 + y 
+ yp +--+ +y3)/(n + 1) (n > 0). Same question. 


nm QO 1 2 3 4 5 
y Ll 2 5 45 22815 2375152056927 


Example 26. Also for fourth powers, z,,, =(1 + zg +zit+--: +2¢)/(n + 1). 
n QO 1 2 3 4 


Zr lL 2 9 2193 5782218987645 


And for fifth powers, and so on. 


Example 27. The irreducible factors of x” — 1 are cyclotomic polynomials, 1.c., 
x" —1=1[1,4,(%), so that ®)(x) = x — 1, ®,(x) =x + 1, ®3(x) = x?+x 41, 
®,(x) = x* + 1. The cyclotomic polynomial of order n, ®,(x), has degree p(n), 
Euler’s totient function. It is easy to write down ® (x) if n is prime, twice a prime, 
or a power of a prime, and for many other cases. Are the coefficients always +1 or 
0? 


Example 28. If two people play Beans-Don’t-Talk, the typical position is a whole 
number, n, and there are just two options, from n to (3n + 1)/2*, where 2* means 
the highest power of 2 that divides the numerator. The winner is the player who 
moves to 1. For example, 7 is a Aposition, a previous-player-winning position, 
because the opponent must go to 
(3X 74+1)/2=llor(3 xX 7-1)/2?=5 

and 11 and 5 are positions, next-player-winning positions, since they have the 
options (3 X 11 — 1)/2° = 1 and (3 X 54+ 1)/24 = 1. 

If + is the probability that a number is an position, and there are no 
0-positions (from which neither player can force a win), then the probability that a 
number is a # position is 1 — 7. This happens just if both options are positions, 
so 1 — r = 7’, and f is the golden ratio, (V5 — 1)/2 = 0- 618. 

So it is no surprise that 5 out of the first 8 numbers are positions, 8 out of the 
first 13, 13 of the first 21, 21 of the first 34, and 34 of the first 55, since the ratio of 
consecutive Fibonacci numbers tends to the golden ratio. 


Example 29. Does each of the two diophantine equations 
2x?(x* — 1) = 3(y? — 1) and x(x —1)/2 =2”-1 


have just the five positive solutions x = 1, 2, 3, 6, and 91? 
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Example 30. Consider the sequence a, = 1, a,,, = | /2a,(a, + 1) | (n > 1) 
n1234567 8 9 101112 13 14 15 16 17 18 19 20 21 
a, 123469 13 19 27 38 54 77 109 154 218 309 437 618 874 1236 1748 

1 2 4 8 16 32 64 128 256 512 


Are alternate differences, a5, ,, — @5,, the powers of two, 2*? 


Example 31. In the same sequence, are the even ranked members, a,,,,, given by 
2a,, + €,, where e, is the kth digit in the binary expansion of /2 = 
1.01101010000010... ? 


Example 32. Is this the same sequence as a, = 1, a, = 2, a, = 3, a,,, =a, +4,_> 
(n > 3)? 


Example 33. The nth derivative of x*, evaluated at x = 1, is an integer. Is it always 
a multiple of n? Values for n = 1,2,3,... are 
1x1,2x*1,3x1,4 x 2,5 X 2,6 x 9,7 X (—6),8 X 118,9 x (—568), 
10 X 4716,11 xX (—38160),12 x 358126,13 x (— 3662088), 14 x 41073096, 


15 X (—500013528), 16 x 6573808200, 17 x (— 92840971200), 
18 X 1402148010528,.... 


Example 34. In how many ways, c,, can you arrange n pennies in rows, where every 
penny in a row above the first must touch two adjacent pennies in the row below? 


nO123 45 6 7 8 9 10 11 12 13 «14 ~=— «+15 16 
c, 1 1 1 2 35 9 15 26 45 78 135 234 406 704 1222 2120 


To throw more light on such sequences, partition theorists often express their 
generating function 


io @) 
Yoox "=Lt+xt+x24+ 2x? + 3x4 + 5x? + 9x® 4+ 15x’ + --- 


O 00 000 & 
occo Go a 00000 Atco oft0 00D & 


cococo Boco 000 con 000 Pin BR SH Le 


Fic. 4. Propp’s penny partitions. 
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as an infinite product, 


(1 _ x”) —a(n) 


18 


n=l 


In this case, a(n) are consecutive Fibonacci numbers: 
n 12 3 4 5 6 7 8 9 10 
a(n) 1 0 1 1 2 3 5 8 #13 21 


Example 35. If p, is the kth prime, p, = 2, p, = 3,..., does 
00 ' > x PitPrat - PK 
1—x%*)  =1+4+ 2 
LEG") %7-00-) G4 
Answers 


1. No less a person than Fermat was fooled by the Strong Law! Euler gave the 
factorization 2°* + 1 = 641 x 6700417. All other known examples of Fermat num- 
bers are composite; Jeff Young & Duncan Buell [32] have recently shown that 
2?” + 1 is composite. 
2. There are very few Mersenne primes, 2? — 1. No one can prove that there are 
infinitely many; 2'' — 1 = 23 x 89 is not one. See A3 in [12] and sequence 1080 in 
[28]. 
3. In the “prime number race,’ 4k — 1 and 4k +1 alternately take the lead 
infinitely often. This was proved by Littlewood [18]. For many papers on this 
subject see N—12 of Reviews in Number Theory, for example, Chen [4]. 
4. A theorem of Legendre (see [6], for example) states that if D, and D_ are the 
numbers of divisors of n of shapes 4k + 1 and 4k — 1, then the number of 
representations of n as the sum of two squares is 4(D,— D_). So D,> D_ for 
every number! 
5. Before we reveal all, here is a circle (FIG. 5) with ten points to further confuse 
you. It has 256 regions. 

If the circle has n points, there are (") intersections of chords inside the circle, 
since each set of four points gives just one such intersection. The number of vertices 


in the figure is V =n + ("). To find the number of edges, count their ends. There 
are n+ 1 at each of the n points and four at each of the (") intersections, so 
2E =n(n+1)+ 4("). By Euler’s formula, the number of regions inside the circle 


1S 


n 


1 n 
E+1-V=2(])+ sn(nti)+1-(4)—2 


=(%) + n(n -1) 41 


ir ee ee a 


A direct proof, by labelling the regions with at most four of the numbers 1, 2,..., 
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Fic. 5. Circle partitioned into 2° regions. 


n — 1, will appear in [5]. The answer is just five of the n terms in the binomial 
expansion of (1 + 1)""1. For n < 6, this is all the terms, and the number is a power 
of 2. For n = 6, only 1 is missing. For n = 10 just half the terms are missing, and 
the number of regions is 4 - 2? = 256. 


# of point =1234 5 6 7 8 9 10 11 12 13° 14 
# of regions = 1 2 4 8 16 31 57 99 163 256 386 562 794 1093 


Some other famous numbers, e.g. 163 and 1093, also occur in this sequence, number 
427 in [28]. 
6. No member of this sequence is divisible by 2, 3,5, 7,11, 13, or 37, as may be seen 
immediately from well known divisibility tests. On the other hand, 17, 19, 23, 29, 
31,... divide 33...331 just if the number of threes is respectively 16k + 8, 
18k + 11, 22k + 20, 28k + 19, 15k + 1,..., while 41, 43, 53, 67, 71, 73, 79,... 
divide no members of the sequence. I don’t think that there is a simple description 
of which primes do, and which primes don’t, divide. The next member, 33333331, is 
also prime, but 333333331 = 17 x 19607843 
7. We’ve again given ourselves a good start, since £"_,(—1)"~*k! is not divisible by 
any prime < n. However, 

Of — 8! + 7! — 6! + 5! — 4! 4 3! — 2! + 1! = 326981 = 79 x 4139. 
8. This example, as well as example 5., was first shown to me by Leo Moser, a 
quarter of a century ago. Row n is the list of denominators of the Farey series of 
order n, i.e., the set of rational fractions r, 0 < r < 1, whose denominators do not 
exceed n. In getting row n from row n — 1, just y(n) numbers are inserted, where 
y(n) is Euler’s totient function, the number of numbers not exceeding n which are 
prime to n. It is fortuitous that 1 + U7_ (xk) is prime for 1 < n < 9. As (10) = 4, 
the number of numbers in row 10 is 29 + 4 = 33, and is not prime. 
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9. The expression 7013 X 2” + 1 is composite for 0 <n < 24160 [15]. Duncan 
Buell & Jeff Young have sieved out 325 further candidates n < 10° which might 
yield a prime. None is known, though it’s likely that there is one. 

10. The number 78557 X 2” + 1 is always divisible by at least one of 
3,5, 7, 13,19, 37, 73 [26, 27]. For this and the previous example, see also B21 in [12]. 
11. R. F. Fortune conjectured that these differences are always prime: see [8], [9] 
and A2 in [12]. The next few are 37,61, 67, 61, 71, 47, 107, 59, 61, 109, 89, 103, 79. 
There’s a high probability that the conjecture is true, because the difference can’t be 
divisible by any of the first k primes, so the smallest composite candidate for 
P =[Ip, is p;,,, which is approximately (k In k)? in size. The product of the first 
k primes is about e*: to find a counter example we need a gap in the primes near N 
of size at least (In N Inln N)?. Such gaps are believed not to exist, but it’s beyond 
our present means to prove this. 

12. This is N. L. Gilbreath’s conjecture, which has been verified for k < 63419 [16]. 
Hallard Croft has suggested that it has nothing to do with primes as such, but will 
be true for any sequence consisting of 2 and odd numbers, which doesn’t increase 
too fast, or have too large gaps: A10 in [12]. In an 87-08-03 letter, Andy Odlyzko 
reported that he had verified the conjecture for k < 102°. 

13. D. H. & Emma Lehmer discovered that 2” = 3 (mod n) for n = 4700063497, 
but for no smaller n > 1. 

14. The kth Lucas number and the (k + 1)th Fibonacci number are 


145)" 1— 5 \* 1 14 y5\"" 1—y5\"" 
+| 5 ) ana | -| 5 | . 


2 2 


Their ratio, as k gets large, approaches (5 — /5)/2 = 1 - 381966011, whereas 
51/5 = 1 - 379729661. The next few convergents to 5!/°, 


40 109 912 1021 26437 27458 


eee 


do not involve Fibonacci or Lucas numbers. Compare sequences 256 & 260 and 924 
& 925 in [28]. This example goes back to 1866 [25]. 

15. This is quite fortuitous [30]. Put x = y = 1, giving 27"! -2 = (2n + 1)x2 x 
3”~! It’s true that 


27-1=3x 3°, 24#-1=5x3}, 2©-1=7~x 3? 


but it’s clear that the pattern can’t continue. 

16. The (n + 1)th hex number, 1 + 6 + 12 + --- +6n = 3n? + 3n + 1, when ad- 
ded to n°, gives (n + 1)°, so the pattern is genuine. It is instructive to regard the nth 
hex number as comprising the three faces at one corner of a cubic stack of n° unit 
cubes (FIG. 6). 

17, 18, 19, and 20 are examples of Moessner’s process, which does indeed produce 
the square, cubes, fourth powers and factorials. Moessner’s paper [20] is followed by 
a proof by Perron. Subsequent generalizations are due to Paasche [22]: see [19] for a 
more recent exposition. 

21. A thinly disguised arrangement of Euler’s formula, n* + n + 41, which gives 
primes for —40 <n < 39. For n = 40, n*+n+ 41 = 417. See Al and Fig. 1 in 
[12]. For remarkable connexions with quadratic fields, continued fractions, modular 
functions and class numbers, see [29]. 
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Fic. 6. The fifth hex number. 


22. The initial pattern is explained by the facts that if n is odd, n* + 1 is even, and 
17 X 2" — 1 is a multiple of 3. Thereafter it’s largely coincidence until n = 24, for 
which n* + 1 = 331777 is prime, while 17 x 2” — 1 = 285212671 = 149 x 
1914179. See [17], [24] and sequences 386 and 387 in [28]. 
23. This is also a coincidence, until we reach n = 18, for which 21 x 27-1 = 
5505023 is prime, while 

7 X 4" + 1 = 481036337153 = 166609 x 2887217. 
See [31], [23] and sequences 314 & 315 in [28]. 
24. A sequence introduced by Fritz Gdbel. A more convenient recursion for 
calculation is (n + 1)x,,, = x,(X, + ),(n > 1). If you work modulo 43, you'll 
find that for 


n =0123 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 
x, =1 235 10 28 25 37 10 20 15 38 19 42 36 34 2 35 39 31 13 2 
n= 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 
x, =6 26 28 29 4 14 42 5 2017 4 20 16 29 42 13 42 20 8 23 33 


and xX4)(X4 + 42) = —10(—10 + 42) = —320, which is not divisible by 43, so x4, 
is not an integer, although x, is an integer for0 <n < 42. 
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25. Similar calculations, mod 89, using the relation (n + 1)y,,, =y,(y2 +n), 
show that y,. is not an integer. For this, and the previous example, see E15 is [12]. 
26. Since this question was asked, Henry Ibstedt has made extensive calculations, 
and found the first noninteger term, x,, in the sequence involving kth powers, to be 


kK 2 3 4 5 6 7 8 9 10 11 
n 43 89 97 214 19 239 37 79 83 239 


He also found corresponding results with different initial values. The longest to hold 

out (n = 610) are the cubes (k = 3, Example 25) with x) = 1, x, = 11. 

27. The first cyclotomic polynomial to display a coefficient other than +1 and 0 is 
Bigs(x) = x48 + x47 + x4 — 3 — 2 — 74 — 0 — 39 4p 86 4 5 4 34 

433 4 32 4 y3l — 428 — x4 _ 722 p24 IT 4 yo 4g YIS 4 14 
+ x33 4 x2 — 79 — x8 — 2x7 — x8 — x t+ x2? 4x41 

Coefficients can be unboundedly large, but require n to contain a large number of 

distinct odd prime factors; see [8]. More recently, Montgomery & Vaughan [33] have 

shown that if ®, = La(m,n)x”™ and L(n) = Inmax, |a(m, n)| then, for m large, 
m/2 mi/2 

Caan SEU") Se 

28. This game was misremembered by John Conway from John Isbell’s game of 

Beanstalk [13]. The Fibonacci pattern is not maintained: only 52 of the first 89 

numbers, 81 of the first 144, 126 of the first 233, and 201 of the first 377, are 

A-positions. The probability argument is fallacious: the probabilities of the status of 

the two options are not independent. 

29. True, but why the coincidence? 

30 and 31. The patterns of powers of 2 and of binary digits of 2 both continue; 

see [11], [14] and sequence 206 in [28]. 

32. A different sequence, number 207 in [28], which agrees for n < 9, but then 

continues 28, 41, 60, 88, 129, 189, 277, 406, 595, 872, 1278,... 

33. If y = x* and y,(1) denotes the value of d”y/dx” at x = 1, then 


Yneill) = Yn) + (T) Ina) — (3) y-20) + 215 )on—a(1) — 31(9) 5 -«(D 


+o... +(-1)"(n _ 1)!. 


This was not known to be a multiple of » + 1 when it was submitted to the 
Unsolved Problems section of this MONTHLY by Richard Patterson & Gaurar Suri. 
But in an 87-05-28 letter, Herb Wilf gives a proof, using the generating function for 
Stirling numbers of the first kind. His proof in fact shows that n(n — 1) divides 
y, (1) just if n — 1 divides (n — 2)!, which it does for n > 7, provided that n — 1 is 
not prime. 

34. This sequence was investigated by Jim Propp. Except that a(12) = 55, the 
pattern of Fibonacci numbers does not continue: 


n=11 12 #13 ~~ =«=14 15 16 17 18 
a(n) = 35 55 93 149 248 403 670 = 1082 


Since this was written, Wilf [21] has linked the generating function with Ramanujan’s 
continued fraction, and he observes that the numbers of propper partitions with k 
coins in the lowest row are yet another manifestation of the Catalan numbers, 
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1, 2,5, 14, 42,... [7]. These partitions are a variant of some considered by Auluck 
[1]. Auluck’s partitions have the pennies contiguous in every row, not just the 
lowest. Their numbers 1,1,2,3,5,8,... are another good example of the Strong 
Law. 

35. The expansion of the product as a power Series, is 


Ltxrt+ x2 txtt 2x5 + 2x8 + 3x7 + 3x8 + 4x? + 5x! + 6x! + 7x}? + 9x19 
10x!4 + 12x) + 14x'© + 17x)” + 19x18 + 23x)? + 26x79 + 30x71 + 35x” 
+40x?? + 46x74 + 52x + 60x7© + 67x27 + 77x78 + 87x +--+. 
The sum is the same, until... + 31x71 + 35x72 
+41x7? + 46x74 + 54x? + 60x7© + 69x?” + 78x78 + 89x29 + ---. 


This was entry 29 in Chapter 5 of Ramanujan’s second notebook [2], [3]: but he had 
crossed it out! 
Let me know if ve missed out your favorite example! 
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A BIRTHDAY GREETING 


The readers and editors of the MONTHLY send their warm best wishes to Professor I. J. 
Schoenberg, who has contributed so much to mathematics in general, and to the MONTHLY, 
in particular, on the occasion of his 85th birthday. 

The American Romanian Academy has recognized the occasion by conferring on him an 
award, “In recognition of his distinguished contributions to the advancement of scholarship 
in the sciences, in the spirit of free exchanges of values and ideas.” His publisher, Birkhauser, 
has recognized the occasion by publishing two volumes of Dr. Schoenberg’s mathematical 
writings, edited by Carl de Boor. 
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Dedicated to the memory of Irving Reiner 


Abstract. Letn > 1 and F: R" > R" be any C! map (into). Then F may be written F = C° E, where 
E is an expansion (i.e. ||E(x) — E(x’)|| > ||x — x’|| for all x, x' © IR") and C is a contraction (i.e. 
t 
|C(x) — C(x’)|| < ||x — x’|| for all x, x’ © R".) For n > 2, there is a diffeomorphism F: R" > R" that 
cannot be written F = EC. Some related results are proved. For example, it is shown that the “typical” 
continuous map F: R! > R! can neither be written F = Co E nor F = E oC. 


We prove some theorems about factoring maps of Euclidean n-space into itself as 
a composition of an expansion and a contraction. Our results can be given a 
cosmological flavor (if we believe the universe is R°*), by saying that the present 
state of the universe could have been achieved as an expansion followed by a 
contraction, but possibly not as a contraction followed by an expansion. The proofs 
of our Theorems 5 and 6 show why we require that the map F to be factored satisfy 
some differentiability requirements, for if F is everywhere badly nondifferentiable, 
no such factorization is possible (see Remark 3). 


THEOREM 1. Let F be a C! mapping of R" into R". Then F is a composition of a 
C! expansion E followed by a C} contraction, F = Co E. 


Proof. Writing d( p,q) = || p — q||, we must show that there is a C! expansion E 
of R” for which Fe E~' is a C! contraction. (It is easy to verify that if E is an 
expansion, then it must be a homeomorphism.) That is, for all p,qeR”, 
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d( F(E~'(p)), F(E~(q))) < d(p, q), which is equivalent to: for all P,Q € R", 
d(F(P), F(Q)) < d(E(P), E(Q)). 

Set P = E~\( p), Q = E~(q). So we need to find an expansion E that expands 
more than F does. For this purpose, it suffices to find an expansion E which is 
infinitesimally more expanding than F is, by which we mean that the differential E, 
of E increases the length of each tangent vector to R” more than the differential of 
F does. For if we have found such an E, then E increases the length of any curve 
more than F does. Then, if we denote by y the inverse image under E of the straight 
line segment joining E(P) to E(Q), then 


d(F(P), F(Q)) < length (F(y)) < length (E(y)) = d(£(P), E(Q)). 
This first part of the proof holds quite generally—for example on any Riemannian 
manifold. It remains to show that there is, for any n > 1, a C! expansion E which 
is infinitesimally at least as expanding as the given C! mapping F. 
We may choose E to have the form 


E(p) =A(r)-p, 
where r = || p|| is the distance to the origin, and A(r) is any smooth function defined 
for r > 0 and satisfying 
(i) A(r) is increasing 
(ii) A(r) > max{||F,(v,)||: ||x|| = 7, v, is a unit tangent vector to x in R"}, and 
(iii) A(r) > 1. 


That such functions A exist is obvious. That the associated mapping E is 
expanding (infinitesimally and hence globally also since E will be one : one) follows 
from (ii), just as does the fact that E expands more than F does, follows from (11), 
once we See that (1) implies that 


| Ee(v,) |] > ACIxll) - Hoxll 


for any tangent vector v, at x. To prove this last fact, we may assume that v, points 
outward, i.e. that v, - x > 0, since E, is linear. (Otherwise replace v, by —v,.) Now 
if y € R” satisfies || y|| > ||x||, then E may be thought of as acting on the pair x, y 
by first multiplying by A(||x||) so that 


x>X=X(i[x||)-x and yoy =A(llx|l) -y, 


and then pushing y out even farther, since’ A(||y||) > A(||x||). The homothety by 
A(||x||) iticreases distances, since \ > 1, and pushing y out farther increases its 
distance from <x still more, since || y|| > ||x||, as a simple geometrical argument in the 
plane through 0, x, and y shows. So d(x, y) is increased at least by a factor A(||x||) 
if || y|| = ||x||. For y infinitesimally close to x, this says that || £,(v,)|| > A(||x|) - |le,|| 
for any tangent vector at x, and we are done. 

Notice that we may choose A(r) to be a constant for small r, which will 
guarantee that E is differentiable at the origin. Also, A(7), and hence E, may be 
chosen to be C™, so that the contraction C will have as many derivatives as F does. 


Remark 1. The same method of proof will work on any complete simply-con- 
nected open Riemannian manifold of nonpositive sectional curvature. Notice how- 
ever that some open Riemannian simply-connected manifolds do not admit any 
expanding map that strictly expands some distance—for example a manifold with 
finite volume cannot admit such an expansion. 
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THEOREM 2. There exists, ifn > 2, a C® diffeomorphism F of R” onto R” which 
may not be written as F = EC, where C is a contraction of R” and E is an 
expansion. 


Proof. Define, for each t > 1, H,: S"~' > S"~' to be the dilation by the factor ¢ 
on R”~', transferred to S"~! by stereographic projection. Thus, points get pushed 
away from the South pole, towards the North pole. Now define F: R” — R” to act 
as the application of H,,, to the sphere S”~‘(t) of radius ¢ centered at the origin in 
R”. (Thus, we think of R” as an onion, with the ¢-th layer being S”~‘(r).) If 
necessary, smooth F near the origin. 

If we were to have F = EoC, then we would have E~! = Co F~}, so that the 
contraction E~! would be more contracting than F~' is. Consider now what E7! 
does to S”~'(t), the sphere of radius ¢ centered at 0, when ¢ is large. Since F7' 
contracts the complement of some small neighborhood of the North pole on 
S"~*(t) to a small neighborhood of the South pole, and since E~' must contract at 
least as much as F, the (n — 1)-volume of E~' of this complement must be at most 
equal to the (n — 1)-volume of F~' of this complement, which is very small. Now 
since E~' is a contraction, the (n — 1)-volume of E~' of that small neighborhood 
of the North pole is also very small. Consequently, the (n — 1)-volume of 
E~+(S"~'(t)) is very small if ¢ is large. By the isoperimetric inequality, we conclude 
that the n-volume of E~'(B"(t)) is very small if ¢ is large, where 


B"(t) = {x ER”: ||x|| < ¢}. 
Thus 
lim vol(E~(B"(t))) = 0, (*) 


But the B”(t)’s increase as sets as ¢ increases, so the same is true of E~‘(B"(t)). 
Consequently, vol( E~*(B"(t))) must increase with t. However, vol E~1(B"(t)) > 0, 
for all ¢, since E~* is one: one and continuous, and thus E~! is an open map. This 
contradicts (*) and shows that no such E exists. 


Remark 2. The above proof actually shows that F cannot be written as F = e °c, 
where c reduces the (n — 1)-volumes of hypersurfaces and e locally expands the 
(n — 1)-volumes of hypersurfaces. (Here we think of the domain of e as the range 
of c.) Notice that, by the isoperimetric inequality, such mappings c and e would 
have to reduce, respectively expand, n-volumes locally. This raises the question of 
whether we can write such an F as F = ec y, where y reduces n-volumes, and ¢e 
locally expands them. 


THEOREM 3. There exists a C® map F: R' — R' that cannot be written F = E°C 
where C is a contraction and E is an expansion. 


Proof. Let p; = i, gq, =i+1/i, i = 2,3,4,.... Let F bea C® map on R’ such 
that F( p,) = 0 and F(q;) = 1 for all i. (Actually, we may take F real-analytic.) If 
we could write F = E oC, then we would have E(C( p;)) = 0, E(C(q;)) = 1 for all 
i. Since E would be a homeomorphism, we would have C( p,;) = E~‘(0), C(q;) = 
E~\(1). But d(E~'(0), E~*(1)) < d(p,, q;) > 0, and we thus have a contradiction. 


THEOREM 4. Let F: R'— R! be a one: one C! map. Then there exist a C' 
contraction C and a C' expansion E such that F = EeC. 
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Proof. It clearly suffices to find g,h © C\(R*) such that g’(x) > 1 for all x, 
|h’(x)| < 1 for all x and F = goh. To do this, let h’(x) = F(x) where |F’(x)| < 1, 
and define A’(x) = sgn F’(x) where |F’(x)| > 1, and let h(0) = 0. This defines 
h(x). lf |F’(x)| < 1, define g’(h(x)) = 1. If |F’(x)| > 1, let g’(h(x)) = | F(x). 
Let g(0) = F(0). This defines g(x). Now F=geh since F(0) = g(h(0)) and 
F’(x) = g"(h(x))h(x) for all x. Q.E.D. 


THEOREM 5. There exists a continuous map F: R — R that cannot be written 
F = CoB, where C is a contraction and E is an expansion. 


Proof. Let F be any nowhere differentiable continuous function, and suppose 
F=CcoE, Since E is an expansion, it is a monotonic function on R, and 
consequently its derivative E’(x) exists and is finite on a set A whose complement 
has zero measure. Since C is a Lipschitz function, C’(x) exists and 1s finite off a set 
B of zero measure. As x runs over A, E(x) certainly runs over a set of positive 


measure. Consequently am © of )\(x) = C’(E(x))E(x) exists and is finite for 
x 
some x © R, which is impossible if F = Co E. 


THEOREM 6. There exists a continuous map F: R — R that cannot be written 
F = EC, where E is an expansion and C is a contraction. 


Proof. Let F be the Weierstrass function: 


00 3 
F(x) = ¥ a’cos(ab"x);0 <a<1, ab> x7 
—a 


n=1 
where b is an odd integer (see [1, pp. 401—404]). We will actually prove that we may 
not write F = He°C where H is a homeomorphism, and C is a contraction. For 
there must exist an x © R such that H’(C(x)) exists and is finite, because H is 
monotone and the range of C must contain a nontrivial interval. For this x (see [1, 
pp. 404]—note that one actually has there N, > 6 > 0 and not merely N, > 0, for 
all n) there exists a sequence x, — x such that |f(x,) —f(x)|/|x, — x| > oo. 
Clearly, C(x,,) = C(x) for at most finitely many n, which exceptions we discard. 
Then 


C(x,) — C(x) 


X,— Xx 


ae — f(x) 


_|H(C(x,)) = H(C(x)) | 
C(x,) — C(x) 
As n > oo, the left-hand side approaches oo, while the first term on the right-hand 


side approaches |H’(C(x))| < oo. But the last term is bounded by 1, which is clearly 
impossible. 


Remark 3. By a result of Jarnik [2], every continuous function F: R > R, with 
the exception of a set of functions of the first category in C(R), has every extended 
real number as a derived number at every point. So the above proofs show that the 
‘typical’ continuous map F: R — R cannot be written either as F = EoC or 
F=CoE., 
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The William Lowell Putnam Mathematical Competition 
LEONARD F. KLOSINSKI, Santa Clara University 

G. L. ALEXANDERSON, Santa Clara University 

LOREN C. LARSON, St. Olaf College 


The following results of the forty-eighth William Lowell Putnam Mathematical 
Competition, held on December 5, 1987, have been determined in accordance with 
the governing regulations. This annual contest is supported by the William Lowell 
Putnam Prize Fund for the Promotion of Scholarship, left by Mrs. Putnam in 
memory of her husband, and is held under the auspices of the Mathematical 
Association of America. 

The first prize, $5,000, was awarded to the Department of Mathematics of 
Harvard University. The members of the winning team were: David J. Moews, 
Bjorn M. Poonen, and Michael Reid; each was awarded a prize of $250. 

The second prize, $2,500, was awarded to the Department of Mathematics of 
Princeton University. The members of the winning team were: Daniel J. Bernstein, 
David J. Grabiner, and Matthew D. Mullin; each was awarded a prize of $200. 

The third prize, $1,500, was awarded to the Department of Mathematics of 
Carnegie-Mellon University. The members of the winning team were: Petros I. 
Hadjicostas, Joseph G. Keane, and Karl M. Westerberg; each was awarded a prize 
of $150. 

The fourth prize, $1,000, was awarded to the Department of Mathematics of the 
University of California, Berkeley. The members of the winning team were: David 
P. Moulton, Jonathan E. Shapiro, and Christopher S. Welty; each was awarded a 
prize of $100. 

The fifth prize, $500, was awarded to the Department of Mathematics of the 
Massachusetts Institute of Technology. The members of the winning team were: 
David T. Blackston, James P. Ferry, and Waldemar P. Horwat; each was awarded a 
prize of $50. 

The six highest-ranking individual contestants, in alphabetical order, were David 
J. Grabiner, Princeton University; David J. Moews, Harvard University; Bjorn M. 
Poonen, Harvard University; Michael Reid, Harvard University; Constantin S. 
Teleman, Harvard University; and John S. Tillinghast, University of California, 
Davis. Each of these students was designated a Putnam Fellow by the Mathematical 
Association of America and awarded a prize of $500 by the Putnam Prize Fund. 

The next five highest-ranking individuals, in alphabetical order, were Daniel J. 
Bernstein, Princeton University; Constantine N. Costes, Harvard University; Jeremy 
A. Kahn, Harvard University; Rav Kumar Ramakrishna, Cornell University; and 
Japheth Wood, Washington University, St. Louis. Each was awarded a prize of 
$250. 

The following teams, named in alphabetical order, received honorable mention: 
Rice University, with team members Charles R. Ferenbaugh, Thomas M. Hyer, and 
John W. McIntosh; Stanford University, with team members Thomas H. Chung, 
John A. Overdeck, and Joshua R. Zucker; the University of Toronto, with team 
members Gary F. Baumgartner, Edward I. Doolittle, and Jeffrey S. Rosenthal; 
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Washington University, St. Louis, with team members Daniel N. Ropp, Peter 
Shawhan, and Japheth Wood; and Yale University, with team members Kamal F. 
Khuri-Makdisi, Robert S. Manning, and William M. Nelson. 

Honorable mention was achieved by the following thirty-eight individuals named 
in alphabetical order: Thomas R. Amoth, Oregon State University; Leland F. 
Brown, California Institute of Technology; Emory F. Bunn, Princeton University; 
Yeow Meng Chee, University of Waterloo; Timothy Y. Chow, Princeton University; 
William P. Cross, California Institute of Technology; Galia D. Dafni, Pennsylvania 
State University; Douglas R. Davidson, Princeton University; Frank M. D’Ippolito, 
University of Waterloo; Edward J. Doolittle, University of Toronto; Serge Elnitsky, 
Carleton University; Chenteh Kenneth Fan, Harvard University; Petros Hadjicos- 
tas, Carnegie-Mellon University; Thomas R. Hagedorn, Princeton University; 
Thomas S. Harke, University of Alberta; Waldemar P. Horwat, Massachusetts 
Institute of Technology; William C. Jockusch, Carleton College; Alex T. Kachura, 
University of Waterloo; Joseph G. Keane, Carnegie-Mellon University; Kamal F. 
Khuri-Makdisi, Yale University; Jordan Lampe, University of California, Berkeley; 
Daniel D. Lee, Harvard University; John W. McIntosh, Rice University; Robert S. 
Manning, Yale University; David P. Moulton, University of California, Berkeley; 
Matthew D. Mullin, Princeton University; Du Nguyen, Ottawa University; David 
L. Petry, University of Oregon; Daniel N. Ropp, Washington University, St. Louis; 
David B. Secrest, University of Illinois, Urbana-Champaign; Robert G. Southworth, 
California Institute of Technology; Glenn P. Tesler, California Institute of Technol- 
ogy; Martin Trudeau, Université de Montréal; Christopher S. Welty, University of 
California, Berkeley; Karl M. Westerberg, Carnegie-Mellon University; Glen T. 
Whitney, Harvard University; Russil Wvong, University of British Columbia; and 
Joshua R. Zucker, Stanford University. 

The other individuals who achieved ranks among the top 101, in alphabetical 
order of their schools, were: Amherst College, Peter H. Anspach; Bethel College, 
Jonathan P. McCammond; University of British Columbia, Wayne J. Broughton; 
Brown University, Kevin S. McFarland, David J. Morin; California Institute of 
Technology, Jared C. Bronski, Philip W. Nabours; University of California, Berke- 
ley, Jonathan E. Shapiro; University of California, Los Angeles, Joseph M. Rojas; 
University of California, San Diego, David L. Ruhm; Carleton University, Michael 
J. Bradley, Stephen A. Smith; Case Western Reserve University, Patrick T. Headley, 
William E. Kirby; University of Chicago, Linda E. Green, Robert P. Stingley, 
Andrew S. Yeh; Dalhousie University, Daniel J. Peters; Harvard University, Michael 
J. Callahan, David Cook, Michael P. Mitzenmacher; University of Hawaii, Jor-Kuen 
E. Lo; University of Houston (Clear Lake), John Ken Burton, Jr.; University of 
Illinois, Urbana-Champaign, James M. Grochocinski; University of Maryland, 
Catonsville, Michael J. Johnson; Massachusetts Institute of Technology, David T. 
Blackston, Claudio C. Chamon, James P. Ferry, Mark Kantrowitz, James R. Rauen; 
University of Michigan, Ann Arbor, William F. Doran IV, Matthew A. Klimesh, 
Paul Kominsky; Oberlin College, David B. Carlton; University of Pennsylvania, 
Michael Albert; Princeton University, Rahul V. Pandharipande; University of 
Regina, Simon H. Lee; Rice University, Charles R. Ferenbaugh, Thomas M. Hyer; 
Siena College, Gregory S. Spradlin; University of South Carolina, Roger B. Milne; 
Stanford University, John C. Loftin, John A. Overdeck; University of Texas, 
Austin, Jared L. Levy; University of Toronto, Gary F. Baumgartner, Jeffrey S. 
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Rosenthal; Virginia Polytechnic Institute and State University, Patrick R. Brown; 
Washington and Lee University, John D. Boller; Washington University, St. Louis, 
David S. Shobe; University of Waterloo, Marc A. Chamberland, Bryan K. Feir, 
Giuseppe Russo. 


There were 2170 individual contestants from 359 colleges and universities in 


Canada and the United States in the competition of December 5, 1987. Teams were 
entered by 277 institutions. 


The Questions Committee for the forty-eighth competition consisted of Harold 
M. Stark (Chairman), Gerald A. Heuer, and Abraham P. Hillman; they composed 


the problems listed below and were most prominent among those suggesting 
solutions. 


PROBLEMS 


Problem A-I 
Curves A, B, C, and D, are defined in the plane as follows: 


x 
A= Sy) x7- t= =I, 
{e y) x y x2 + y? 


y 
{Cx y):2xy ay 


(x, y): x° — 3xy? + 3y = 1}, 


C 
D = {(x, y): 3x*y — 3x — y? = 0}. 


Prove that AN B=CQOD. 
Problem A-2 
The sequence of digits 


123456789101112131415161718192021... 


is obtained by writing the positive integers in order. If the 10”th digit in this 
sequence occurs in the part of the sequence in which the m-digit numbers are 
placed, define f(n) to be m. For example, f(2) = 2 because the 100th digit enters 
the sequence in the placement of the two-digit integer 55. Find, with proof, /(1987). 
Problem A-3 
For all real x, the real-valued function y = f(x) satisfies 


py’ —2y’ +y = 2e”. 
(a) If f(x) > 0 for all real x, must f’(x) > 0 for all real x? Explain. 


(b) If f’(x) > 0 for all real x, must f(x) > 0 for all real x? Explain. 


Problem A-4 
Let P be a polynomial, with real coefficients, in three variables and F be a 
function of two variables such that 
P(ux,uy,uz) =u’?F(y—x,z—x) for allreal x, y, z,u, 


and such that P(1,0,0) = 4, P(0,1,0) = 5, and P(0,0,1) = 6. Also let A, B,C be 
complex numbers with P(A, B,C) = 0 and |B — A| = 10. Find |C — A]. 
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Problem A-5 
Let 


- —y x 
G(x, y) = apr eee): 


Prove or disprove that there is a vector-valued function 


F(x, y, z) = (M(x, y,z), N(x, y,z), P(x, y,z)) 
with the following properties: 


(i) M, N, P have continuous partial derivatives for all (x, y, z) # (0,0, 0); 
(ui) Curl F=0 for all (x, y, z) # (0,0, 0); 


(iii) F(x, y,0) = G(x, y). 
Problem A-6 


For each positive integer n, let a(n) be the number of zeros in the base 3 
representation of n. For which positive real numbers x does the series 


0 a(n) 
nat 

converge? 

Problem B-1 

Evaluate 

4 In(9 — x) dx 
Problem B-2 
Let r, s, and ¢ be integers with O < r,0 < s, andr+s <t. Prove that 


(5 t+] 


to), G) ,_G) ., G). 


(4) C4) (£5) L.4s) eet) 


(Note: (” ) denotes the binomial coefficient n(n — 1):--(n+1-k)/ 
k(k —1):--3-2-1,) 


Problem B-3 
Let F be a field in which 1 + 1 # 0. Show that the set of solutions to the 
equation x? + y* = 1 with x and y in F is given by (x, y) = (1,0) and 
r-—1 2r 


(x,y) = ret 1? ret]? 


where r runs through the elements of F such that r? # —1. 


Problem B-4 
Let (x1, y,) = (0.8, 0.6) and let x,,, = x,cosy, — y,siny, and y,,, = x,siny, + 
y,cos y, for n = 1,2,3,.... For each of lim, _,,,x, and lim, _,,y,, prove that the 


limit exists and find it or prove that the limit does not exist. 
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Problem B-5 
Let O, be the n-dimensional zero vector (0,0,...,0). Let M be a2n Xn matrix 
of complex numbers such that whenever (2), 2,,..., Z2,)M = O,, with complex z,, 
not all zero, then at least one of the z,; is not real. Prove that for arbitrary real 
numbers rj, ’,..., %,, there are complex numbers w,, w,,..., w, such that 
Wi ry 
Re| M| - | |= | - 
Wy, yn 


(Note: If C is a matrix of complex numbers, Re(C) is the matrix whose entries are 
the real parts of entries of C.) 


Problem B-6 

Let F be the field of p? elements where p is an odd prime. Suppose S is a set of 
(p” — 1)/2 distinct nonzero elements of F with the property that for each a ¥ 0 in 
F, exactly one of a and —a is in S. Let N be the number of elements in the 
intersection SM {2a: a © S}. Prove that N is even. 


SOLUTIONS 


In the 12-tuples (149, Ng,..., No, M1) following each problem number below, n, 
for 10 > i > 0 is the number of students among the top 204 contestants achieving ij 
points for the problem and n_, is the number of those not submitting solutions. 


A-1. (72, 24, 22, 0, 0, 0, 0, 0, 5, 0, 45, 36) 


Solution 1. First note that (0,0) doesn’t belong to either 4d N B or C/N D, so in 
what follows suppose that (x, y) # (0,0). 

Let Eqg(i), i= 1,2,3,4, denote the equation that defines the set A, B,C, D 
respectively. Also, let f(x, y)Egq(i) denote the equation obtained by multiplying 
each side of Eq(i) by f(x, y). 


The matrix product 
(; | Eq(1)\ _ | £q(3) 
Y *}\ Eq(2) Eq(4) 
shows that A 1 B GC CN D, and 


vere -(3 *) (no 
Eq(2) vy *} \ Eq(4) 


x y 
7 x*>+y? x*+y? |/ Eq(3) 
| *__ }\ Bq(4) 


x? + y? x? + y? 


shows that CON DCANQB. 
These two inclusions show that 4N B= COD. 
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Solution 2. Let z = x + iy. Then 
(x,y) €ANB iffz?=3i+1/z 
iff z> = 3iz + 1 
iff (x,y) ECND. 


A-2. (117, 28, 22, 2, 0, 0, 0, 3, 10, 5, 10, 7) 

The r-digit numbers run from 10’~! to 10” — 1, so there are 10” — 10’~' of them. 
Thus, the total _ number of digits in numbers with at most r 
digits is g(r) = Le_,k(10* — 10*7*) = —1 + Derk(k — (k + 1))10* + 710” = 
—P7at10* + 710” = "0" — (10’ — 1)/9 for r > 1. But 0 < (10” — 1)/9 < 10’, so 
(r — 1)10” < g(r) <rl0". Thus, (1983) < 1983 - 101983 < 104 - 101983 = 101987, 
and g(1984) > 1983 - 101984 > 103 - 10184 = 10187, It follows that f(1987) = 1984. 


A-3, (119, 32, 1, 0, 1, 4, 2, 0, 0, 5, 11, 29) 

The general solution to the differential equation is f(x) = (x* + bx + c)e* with 
b and c real. For such a function f(x) = (x* + (b + 2)x + (b+ c))e*. Clearly 
f(x) > 0 for all x if and only if D = b* — 4ac < 0, and f’(x) > 0 for all x if and 
only if D’ =(b+2)?- 4b+c)=6b°+4b+4-4b-4c=D+4<0. The 
answer to (a) is “no” because D < 0 does not imply D’ < 0. (For example, take 
b=c=1; then f(x) > 0 for all x but f’(—1) = 0.) The answer to (b) is “yes” 
because D’ < 0 implies D < 0. 


A-4. (14, 8, 6, 4, 0, 0, 0, 1, 28, 9, 25, 109) 

Letting u=1 and x=0, we have F(y,z) = P(0, y,z) is a polynomial. 
F(uy, uz) = P(0, uy, uz) = u*F(y, z), so F is homogeneous of degree 2. Now 
P(x, y, Z) = F(y — x, z — x) implies that 


P(x, y,z) =a(y—x)+b(y—x)(z—x) + e(z-— x) 
with a,b,c real. Then 4= P(1,0,0)=a+b+c, 5 = P(0,1,0) =a, and 6= 
P(0,0,1) = c. It follows that4 =a+b+c=5+b5+ 6, andso b= —7. 
For the complex numbers A, B,C of the hypothesis, we have 5(B — A)* — 
1(B — AC — A) + (C — A)? =0. Let m=(C — A)/(B — A). Then 5 — 7m + 
6m? = 0. The roots of 6m? — 7m +5 = 0 are complex, so |m| = /5/6. Hence, 


|C — A| = ¥5/6 |B — A| = (5/3)V30. 


A-5. (8, 0, 2, 0, 0, 0, 0, 0, 1, 1, 57, 135) 
Note that Curl G = 0 unless (x, y, Z) is on the z-axis. If F exists, then by Stokes’ 
Theorem, 


G(x, y)-dr= | F(x, y,z)-dr= [| [ (Curl F) - ads = 0, 
[G(x ») [ Fs v2) f J (Cutt F) 
where C is the ellipse x? + 4y* = 1, z = 0, in the xy-plane and S is the part of the 


ellipsoid x* + 4y?+z?=1 with z > 0. However, the integral is not zero. For 
example, on C, x? + 4y? = 1 and thus, 


[ Gx ») a= [(-yi+ xj ) -dr= [ [2aa - 2 Area( E), 


where E is the interior of C. Thus F does not exist. 
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A-6. (5, 10, 2, 0, 1, 0, 0, 3, 1, 4, 80, 98) 
For each integer k > 0, the integer n in base 3 has k + 1 digits iff 3% <n < 
3*+1 _ 1, Among the integers in this interval there are (* jokti—s for which 
a(n) = i, so 
3kt1_y k 

> xa”) = y (A) xigkers _ 2( x 4 2)*. 
n=3* i=0 

Thus 


2(x + 2)* x") -2(x + 2)* 
re ne < I 


3k+3 3 3k ? 
3 a 3 


and therefore 


mix+2 amb da) mix+2)\* 
< <2 
27 5 h(a ) 2X n pa 27 | 


It follows that the series converges (for x > 0) iff (x + 2)/27 < 1; that is, 0 < x < 
25. 


B-1. (148, 5, 4, 1, 0, 0, 0, 0, 1, 0, 15, 30) 
Solution 1. Let I denote the value of the integral. The substitution 9 — x = y + 3 


gives 
_ f° yin(y + 3) dy 
2 yIn(y + 3) + yin(9- y) ” 
SO 


=2, and J=1. 


2 yin(x + 3) + jIn(9 - x) 
Solution 2. More generally, if 
x) dx 
s=f bx) de 
f(x) + f(a+b-x) 
then 
a+b-—x)dx 
s- pr A ) 
a f(x) + f(a+b- x) 
=(b-a)- | — feat) 
po f(atb—t)+f(t) 
so S = (b — a)/2. In this problem f(x) = /In(9 — x), a= 2, b= 4, s0 S = 1. 
B-2. (47, 3, 4, 0, 0, 0, 0, 0, 1, 3, 62, 84) 
Solution 1. Let 


rirs,1) 40), —) 


=(b-a)-S 
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Then 


(if 
— Gar) 2a) Gea) 
aca) (4s) 


= F(r,s—1,t)+ F(r+1,s-— 1,1). 
The proof now follows easily by induction on s. 


Solution 2. We find that 


By ec) 
r+i 


Using the binomial expansion, we see that 


j=0 t—s+l (l1—x)'°*? 


1 1 
(1 _ x)"*? (1 _ x) 


E("Ea)er] (res Pat 


| 
TT 
g 
I 
fan) 


ELEC PCT le 


Equating coefficients of the x° term, we obtain 


(ith j-x(rtj(tsre). 


i=0 
Hence 
s 
’ (5) sirl(t—r-—s)! (t+ 1)! t+1 
i=0 ( i t! s'(t-—s+1)! (r—s+(*7 8) 
r+i 


Solution 3. The problem is equivalent to proving that 


s _ 
(le) 
NX t tt 1-s! 

r+i 
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It is straightforward to show that 


and, therefore, letting / = r + i, we have 
pu) We). eal eens 
Ss . Ss 1 ts _ 
a — ' t—l 
ri 5 5 


Now, Ljz7 ( ! )(, G ‘ ; is the number of ways to choose a sequence of ¢ binary 
digits, interrupted immediately after the /th digit by a comma, such that there are r 
1’s preceding the comma (among the first / digits) and t — (s + r) 1’s following the 
comma (the final ¢ — / digits). 

But, interpreting the comma as another symbol in the sequence, we see that this 
is the number of sequences of ¢ + 1 symbols containing s 0’s, where the comma 
must be the (7 + 1)st nonzero symbol. Thus, the sequences are completely de- 


termined by the positions of the 0’s, and so the numbers of sequences is (: i" 1), 


Thus, 
I t-t \_ (‘ + 1) 
E (MGs | s 
The result follows. 


B-3. (27, 31, 5, 0, 0, 0, 0, 0, 3, 11, 57, 70) 
Let 
r7—1 2r 


= d = 
*r r>4+1 ane ar r>+1 


for r any element of F such that r? # —1. 

It is easy to check that (1,0) and (x,, y,) satisfy x7 + y? = 1. 

The problem is thus to show that if x, y are in F with (x, y) # (1,0) and 
x*+ y?=1, then x =x, and y=y, for some r. We observe that for x, ¥ 1, 
r=y,/(1 — x,), and this suggests that we set r = y/(1 — x). Then we have 


y \ y> + (1— x)’ 
Pere (2ayyi-2eGes 
1-x (1 — x) 
yr +x*-2x4+1 2—2x 2 
= or CF, 
(1 — x) (j-x)? 1-» 
and, therefore, r* — 1 = 2/(1 — x) — 2 = 2x/(1 — x). It follows that 
r?>— 1 2r 
Xx, = 3 =x and sy i rr 


“ 
+ 
— 
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B-4. (13, 14, 15, 1, 0, 0, 0, 0, 11, 23, 79, 48) 

Let y, = Arccos0.8 and 6,=y,+ y, + -:: +y,. Then x,4, = cos@, and y,,, 
= sin@,. Since sin@ = sin(w — 80) <a-—8@ for 0<6@< 7, one easily proves by 
induction that 0 < 8,< 6,,, <7 for n= 1,2,3,.... Hence L = lim, _, ,.0, exists 

= (). 


since 4, 8,,... is a monotonic bounded sequence. It follows that lim 
Since cost and sin ¢ are continuous for all real ¢, 


n-> oon 


0= lim y,,; = lim sin@, = sin( lim @,) = sin L. 
n— oc nu- oo nu-—- 0 


As 0 < L < 7a, this implies that L = 7. Then 
lim x,+4, = lim cos @, = cos( lim 6, = cos L = cos7 = —1. 
B-5. (10, 6, 3, 2, 2, 0, 0, 0, 8, 3, 16, 154) 
Solution. Write M = A+iB where A and B are real 2n Xn matrices and 
N = (A, B), areal 2n X 2n matrix. If 
Wy 


w=|[|-|=utiv 


where u and v are real column vectors of length n, then Re( Mw) = Re[(A + iB) 
(u + iv)] = 
u u 
Au- Bo=(A B)( “,)=N( 4) 

and so we need to show that N is invertible. Suppose that x = (x,,...,X5,) 18 a 
real vector of length 2” such that xN = O,,. Then xA = O,, xB = O, and hence 
xM = x(A + iB) = O,. Therefore, by hypothesis, x = O,, and hence N has an 
inverse. 


B-6. (8, 1, 1, 1, 0, 0, 1, 2, 1, 3, 62, 124) 


Solution 1. For a in S, let 2a = €,s, where s, isin S and e, = +1. Set M= 
(p? — 1)/2 — N so that M is the number of a in S such that e, = —1. If a and b 
are in S and s, = s, then a = +b and hence a = b. Therefore, as a runs through 
S, s, runs through S as well. Hence in F, 


aP-)? Tl a= TI] (es.) =(-)"T1s,=(-1)"T14. 
aes aes aes aes 
Hence, (— 1)” = 2‘°-/2, Using Lagrange’s Theorem for finite groups or the Euler 
Theorem or the little Fermat Theorem, one has 


YWPP-V/2 we (QP-L/PTDM = (qyPtD? _ 1 


and, therefore, M is even. But ( p? — 1)/2 is also even and therefore N is even. 


Solution 2. Let Z, denote the finite field of p elements {0,1,2,..., p — 1}, and 
the elements of F by {a + bx: a,b © Z,}. 

Let H = {1,2,...,(p — 1)/2} and Sy = {a + bx: bE H, a & Z,, or b = 0 and 
a © H}, and set T) = 2S). Then S, satisfies the conditions of the problem and 
| So A To| 1s even. 
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Observe that any other set S with the conditions of the problem can be obtained 
from S, by a succession of exchanges of the form “take a out of S, and replace it 
with — a.” 

Suppose then that S is a set which satisfies the conditions of the problem, that 
T = 2S and that |S M T| is even. Suppose that a € S and S’ = (S — {a}) U 
{—a}, T’ = 28S". 

If a/2 © S and 2a € S then |S’ N T’| = |S Q T| — 2 (neither a nor 2a are in 
S’O T’). If a/2 © S and —2a € S, then |S’ N T’| = |S A T| (ais not in S’ N T’ 
but —2a is). If —a/2 © S and 2a € S, then |S’N T’| = |SQT| (a is not in 
S’O T’ but —a is), andif —a/2 © S and —2a € S then |S’N T’| = |SQ T| + 2 
(both —a and —2a are in S’N T’). 

In each case the net change in the cardinality is 0 (mod 2), and the result follows 
by the preceding remarks. 


Letters to the Editor 


Editor: 


Concerning The partial order of iterated exponentials, 93 (Dec 86) with notation 
as defined therein: 
(1) the alleged “Theorem 6”, stated without proof, is in error. For instance, in 
(S;, ~<), the elements [23415] and [23145] are not covers, since [32145] is in 
between them. 
(2) The conjectures at the end of the paper fail. Their truth would strongly suggest 
that the partial order induced on S, coincides with the Bruhat order on S,. They 
do coincide for n < 4; but, for instance in S, again, [43215] and [34512] are 
comparable in the former, but not in the latter. 
Professor John Stembridge has written a very interesting paper which explores the 
relationship between the two partial orders on S,, and which reveals (1) and (2) in 
the process. I am greatly indebted to him for sharing his results with me. 


Barry W. Brunson 
Western Kentucky University 


Editor: 


It was with great interest that I read Stan Wagon’s article, “Fourteen Proofs of a 
Result about Tiling a Rectangle,” in the August/September, 1987, issue of the 
MONTHLY. I first heard of this result while a graduate student in a complex analysis 
class taught by Paul Cohen. He gave the proof using the double integral of the 
complex exponential function, which seemed to be a triumph of analysis over 
combinatorial reasoning. I have given this problem to my analysis classes sporadi- 
cally over the years and, to my surprise, in the spring of 1987 a proof was found by 
an undergraduate, Tom Amoth. His approach, given below, appears to be different 
than the fourteen proofs in the article. 
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Amoth’s proof that a rectangle tiled with rectangles, each of which has at least one 
integer side, must itself have an integer side: 
Position the large rectangle so that the origin is at the lower left-corner. Then draw 
all diagonal lines of the form x + y = integer. Let L(T) be the length of these lines 
that lie in rectangle T, divided by ¥2 . Then: 
1. If T has an integer side, then L(T) equals the area of 7. This is readily shown 
by cutting T into rectangles of unit width and then noting that diagonal lines run 
the entire length of such a rectangle only once. 
2. If T has its lower-left corner on a lattice point and L(T) equals the area of T, 
then 7 must have an integer side. Suppose the contrary, and assume T has 
dimensions m + a by n+ b where m,n EN and 0 < a, b <1. We need only 
consider U, the upper-right corner of JT having dimensions a X b. If a+ b> 1, 
then L(U) =a+b-—1=ab-— (1 —a)(1 — b) < ab, while if a + b <1, then 
L(U) = 0. In either case, L(U) < ab, the area of U, contradiction. 
Since L and area are both additive for any interior-disjoint configuration of 
rectangles, these two facts imply that the large rectangle has an integer side. 


Bob Burton 

Department of Mathematics 
Oregon State University 
Corvallis, Oregon 


UNSOLVED PROBLEMS 


EDITED BY RICHARD GUY 


In this department the MONTHLY presents easily stated unsolved problems dealing with notions 
ordinarily encountered in undergraduate mathematics. Each problem should be accompanied by 
relevant references (if any are known to the author) and by a brief description of known partial results. 
Manuscripts should be sent to Richard Guy, Department of Mathematics and Statistics, The 
University of Calgary, Calgary, Alberta, Canada T2N IN4. 


Is Selfduality Involutory? 


BRANKO GRUNBAUM* 
University of Washington, GN-50, Seattle, WA 98195 


G. C. SHEPHARD 
University of East Anglia, Norwich NR4 7TJ, England 


The concept of duality occurs in many parts of mathematics. Details may vary 
from one field to another, but the essential feature of duality is that two objects of 
the same kind are related in a symmetric manner. Here we shall be interested 
primarily in geometric duality, though similar considerations and problems arise in 
other contexts. 

Possibly the most familiar example of duality concerns convex polyhedra. We 
restrict the discussion to the three-dimensional case; there are immediate generaliza- 
tions to higher dimensions. Two polyhedra P, and P, are said to be duals of each 
other if there exists a bijection 6 from the family of vertices and faces of P,, to the 
family of faces and vertices of P,, which reverses inclusion. The bijection 6 is called 
a duality map between P, and P,. In FIGURE 1 we give an example of such a map 
between a cube and an octahedron. We follow the convention that 6 is specified by 
labelling each element (vertex or face) of P, with the same symbol as the element of 
P, into which it is mapped by 6. 

If there exists a duality map 6 from a polyhedron P to itself, we say that P is 
selfdual. An example of a selfdual polyhedron is given in FIGURE 2, and a duality 
map 46, is indicated in FIGURES 2(a) and (b). In the selfdual case, the range of a 
duality map is equal to its domain of definition, so the duality map is a permutation 
on the set of vertices and faces of P. In the example, the permutation corresponding 
to 6, is given in the caption to FIGURE 2. The rank r(6) of a selfduality map 6 is 
defined as its order (period), that is, as the smallest positive integer n such that 5” is 
the identity. Thus for the map 6, defined above we have r(d,) = 8. However, the 
same polyhedron admits another selfduality map 6, indicated in FIGURES 2(a) and 
(c) which is of rank 2, that is, it is an involution. This suggests the following 
definition of the rank r(P) of a selfdual polyhedron P: it is the minimum value of 
r(8) over all selfduality maps 6 of P. 


*The last stages of this research were supported in part by National Science Foundation grant 
DMS-8620181. 
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Fic. 1. Sketches of a dual pair of convex polyhedra. Vertices and faces corresponding to each other have 
been marked with the same label. 


Problem 1. Does every selfdual convex polyhedron P have rank 2? In other 
words, does every selfdual P admit an involutory selfduality map? 

If 6 is a selfduality map, then 6? clearly induces a selfisomorphism of P. Hence 
the problem can be restated as asking whether 6 can always be chosen so that this 
selfisomorphism is the identity. It is worth noting that if P admits a selfduality map 
of rank 2k where k is odd, then 6* is of rank 2, so for such a polyhedron P an 
involutory selfduality map necessarily exists. 

If a selfdual polyhedron P has rank 2 then we may use the same labels for the 
vertices and facets, which makes the special character of such polyhedra much more 
evident (see FIGURE 2(d)). 

The definition of duality given above belongs to the combinatorial theory of 
convex polyhedra; similar considerations in the metric theory of polyhedra arise 
through the geometric transformation known as polarity. Assuming that the origin 
O is an interior point of P, the polar P* = m(P) is defined to be the convex 
polyhedron 


P* = {x © E*|(x, y) < 1forall ye P}. 


To each vertex v of P corresponds a face of P*, namely that which lies in the plane 
with equation (x, v) = 1, and to every face of P corresponds a vertex of P*. This 
correspondence is easily seen to be a duality map (see, for example, Grunbaum 
[3, Section 3.4]). We shall say that a convex polyhedron P is selfpolar if P* is the 
image of P under a projective map 7, and then we shall call the mapping o = 7747 
a selfpolarity map of P. Clearly a selfpolarity map o induces a selfduality map 6 of 
P, and the rank of o is defined as the rank of the corresponding map 64. It can be 
shown that the rank of o does not depend on the choice of the origin O. 


Problem 2. Does every selfpolar convex polyhedron possess an involutory 
selfpolarity map? 

An affirmative answer to this problem would imply the same answer to Problem 
1 for selfpolar polyhedra—but not for selfdual polyhedra in general, since the 
following question is still open. 
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G J 
F 
(b) 
1 
[ES 
F H 8 6 
G 7 
(c) (d) 


Fic. 2. (a) A convex polyhedron P. (b) A selfduality map 6, which shows that P is selfdual; 5, can be 
represented by the permutation (1A)(2ES5D4C3B)(6J9H8G7F), hence r(6,) = 8. (c) Another selfduality 
map 6, of P; 6, can be represented by the permutation (1A)(2D)(3C)(4B)(5E)(6H)(7G)(8F)(9J), hence 
r(6,) = 2 and also r(P) = 2. (d) Another representation of the selfduality 6, of P, in which correspond- 
ing vertices and faces have been labelled by the same symbols. 


Problem 3. Is every selfdual convex polyhedron P isomorphic to a selfpolar one? 

Analogous problems arise in another example of geometric duality, namely that 
of configurations of points and lines in the (Euclidean or projective) plane. Here, 
following tradition, we say that C is an (n,) configuration if C consists of a set of n 
distinct points and n distinct lines, such that each point of C lies on precisely k of 
the lines, and each line of C passes through exactly k of the points. A configuration 
is selfdual if there is an inclusion-reversing map 6 of the family of points and lines 
of C onto itself. Examples of selfdual configurations are shown in FIGURE 3, where 
the points of the configurations are indicated by solid dots in order to distinguish 
them from those intersection points of the configuration lines which are not points 
of the configurations. It is well known (see, for example, Schroeter [7], Dorwart [2]) 
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that there exist three distinct configurations (9,) and nine configurations (10,), each 
of which is selfdual. (This fact may have led to the misguided terminology of some 
writers who call every configuration (n,) selfdual; our definition coincides with that 
which is generally accepted—see, for example, Steinitz [8].) The rank of a selfduality 
map, and the rank of a configuration, can be defined in complete analogy to the 
case of polyhedra. The configurations in FIGURE 3 have rank 2, as indicated by the 
labels, and the same is true for all the other selfdual configurations we examined, 
including all configurations (9,) and (10;). 


_\S 


(¢) @ 


FiG. 3. (a) The selfdual Pappus configuration (9, ). (b) The selfdual Desargues configuration (10;). 
(c) Another selfdual configuration (10, ). (d) A selfdual configuration (24, ). 


Problem 4. Does every selfdual configuration have rank 2? 

We conjecture that both Problems 1 and 4 have affirmative answers. It is clear 
that analogous questions can be asked in other areas of mathematics in which a 
duality relation can be defined, for example in the case of balanced designs. 

There is a geometric counterpart to Problem 4. Let the origin be any point of the 
plane that does not lie on a line of the (n,) configuration C in E’. For each point v 
of C construct the line 


{x € E*|{x,v) = 1}, 
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and for each line (x, s) = 1 of C construct the point s. The set of points and lines 
so obtained forms a configuration C* = 7(C) of type (n,) called the polar of C. 
The configuration C is called selfpolar if C* is the image of C under a projective 
transformation 7. Then the mapping o: C —> C defined by o = 7 “a is a selfpolarity 
map of C. 


Problem 5. Does every selfpolar configuration possess a selfpolarity map of 
rank 2? 

An affirmative answer to Problem 5 would imply the same for Problem 4 if it 
could be established that every selfdual configuration is isomorphic to one that is 
selfpolar. 

If the answer to Problem 1 is negative, the following more detailed question can 
be asked: 


Problem 6. Does there exist, for every k = 2,3,..., a selfpolar polyhedron P, 
such that min r(6) = 2k, the minimum being taken over all selfpolarities 6 of P? 

Clearly, analogous problems can be posed if the answer to any of Problems 2, 4, 
or 5 is negative. 

Since selfdual polyhedra and configurations have been under active consideration 
for more than a century, it is strange that problems such as those stated here not 
only remain unsolved, but do not even seem to have been previously asked. We can 
only suggest that this may be due to a confusion of thought. In all accounts we have 
been able to locate, the definition of selfduality for convex polyhedra is immediately 
followed by a completely unjustified statement which, in essence, asserts that 
“obviously, a polyhedron P is selfdual is another way of saying that r(P) = 2” (the 
precise wording depends, naturally, on each author’s terminology). As examples of 
this substitution of meaning we may mention Bruckner [1, p. 76, lines -2, -1], 
Hermes [4, p. 132, line 17], Jucovic [5, p. 5, lines 3-6]; in the presentation due to A. 
Cayley (see Kirkman [6, p. 185]) the precise meaning of “selfdual’” (“autopolar’’) is 
not made clear, but the notation indicates that the same idea was intended. 
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EDITED BY DENNIS DETURCK, DAVID J. HALLENBECK, AND ANITA E. SOLOW 


A Simple Verification of Ilieff’s Conjecture 
for Polynomials With Three Zeros 


G. L. COHEN AND G. H. SMITH 
School of Mathematical Sciences, University of Technology, Sydney, New South Wales 2007, Australia 


Ilieff’s Conjecture has been discussed by Marden [1], [2], who in fact attributes 
the conjecture to the Bulgarian mathematician B. Sendov in 1962. It states: If P(z) 
is any polynomial with all its zeros in |z| < 1, and Zz, is any one such zero, then P'(z) 
has a zero in |z — Z,| < 1. 

Proofs of this have been given for polynomials of degree at most five and in a 
number of other special cases for polynomials of any degree. References to these 
results may be found in Marden [1] and Schmeisser [4]. Five or six different 
methods of verification of Ilieff?’s Conjecture have been given for cubics, and two 
(by Saff and Twomey [3] and Schmeisser [4]) also include the result for polynomials 
P(z) = (2 — Zo)"°(z — 2,)"(z — 22)”, considered here. Our justification for giving 
another proof is that our approach is fully self-contained and extremely simple. 

The result is quickly achieved for polynomials P(z) as just given (where, by 
hypothesis, Zo, Z,, Z, lie in |z| < 1), when n =n, +n, + n, > 4. We set 


P'(z) =n(z ~ 29)" (2-4) "(z- 22)" (zm )(z— my), (1) 


and wish to show that P’(z) has a zero in |Z — Z)| < 1. This is immediate if n, > 1, 
SO Suppose ny = 1. Writing P(z) = (z — Z))Q(z), we see that 


P’(Z9) = Q(Z9) = (Zo — 21) "(Zo — Zz)". (2) 
From (1), with n, = 1, and (2), 
n|Zy — W,||Zo — Wal = 120 — 21l120 — 22] < 4- (3) 


Hence, if n > 4, then |Z) — wy||Z) — w,| < 1, so that |z)7 — w| < 1 for j = 1 or 2. 
Thus Ilieff’s Conjecture is verified when P(z) has a multiple zero. 


Remark. It follows in the same way that Ilieff’s Conjecture is true for any 
polynomial with p zeros if its degree n satisfies n > 2?~'. 

In the only case remaining for the polynomial P(z) above, we have ng = n, = ny 
= 1. For this, we use the general 


LEMMA. Let P(z) be any polynomial of degree n > 2. Let 2, be a simple zero of 
P(z) and let w,, Wy,...,W,-1 be the zeros of P’(z). If |P"(Z)| = (n — 1)|P’(Z9)/ 


then |Z) — w,| < 1 for at least one j = 1,2,...,n — 1. 
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Proof. We may assume P(z) is monic. Then P’(z) = nII7Z{(z — w,). If P’'(z) # 
0, we have, on taking logarithms and differentiating, 
P’(z) "=! 1 


P'(z) 


Since Z) is a simple zero of P(z), zo#w, for any j. Suppose |P’(2Z9)| > 
(n — 1)|P(Z9)|. Then if |z9 — w,| > 1 for all j, we have 


P"(Z9) n—1 1 
n-1</|— < ——— <n-l, 
P’(zZo) j= [20 — 
a contradiction. Hence |Z) — w,| < 1 for some j. 
Notice also that, for the same polynomial with zeros zp, Z,,...,Z,—1, We may 


write P(z) = (z — Z,)Q(z) and obtain 
P'(z) O29) _ tI 4 
P'(z9) Q(z) j=1 70 — Z 
Returning now to the cubic P(z) = (z — z))(z — z,)(z — Z,), we set x = 
Zo — Zib Y = [20 — Za 4 = [Zo — 3(%1 + 22), B= G4, — ZL SOO <x <2,0<y 


< 2,0 <a<2,0 <b < 1. See FIGURE 1 in which the vertices of the triangle ABC 
are at Zo, Z,, Z>, respectively. 


A, 2 


Fic. 1. 


From (4), we have 
1 1 


Zo 41 Zo ~ 22 


P’(zZ9) 


4a 
P'(Z9) 


xy’ 


so, by the Lemma, Ilieff’s Conjecture is true whenever xy < 2a, and in particular 
(since x < 2 and y < 2) whenever x < a or y < a. We assume therefore that x > a 
and y >a. 

From (3), we have 3|z) — w,||Z9 — w.| = xy, So Ilieff's Conjecture is true if 
xy < 3. Assuming then that xy > 3, we have x? + y? = (x — y)? + 2xy > 6. Since 
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x >a, y>a and cosA = (x? + y? — 4b?) /2xy > (6 — 4)/2xy = 1/xy > 0, tri- 
angle ABC in FIGURE 1 is acute-angled. Then the triangle’s circumradius R satisfies 
R < 1 (proved below) and, if h is the length of the perpendicular from A to BC, a 
classical theorem in Euclidean geometry (also proved below) gives us 


xy = 2Rh < 2a. 


As above, the Lemma confirms Ilieff’s Conjecture in this case, and our proof is 
complete. 

Two matters remain to be clarified. 

In FIGURE 2, O is the circumcentre of the triangle ABC, E is on the circumcircle 
and D is the foot of the perpendicular from A to BC. The right-angled triangles 
ABD, AEC are similar, so |AB|/|AD| = |AE|/|AC]|, from which, in the notation 
above, xy = 2Rh. 


FIG. 2 


We are not sure whether it is “obvious” or not, that the circumradius R of an 
acute-angled triangle lying within a unit circle satisfies R < 1. The following is a 
proof. Let ABC be the triangle. If A, B, C are on the unit circle then R = 1. If B, 
C are on the unit circle and A is not then, since the triangle is acute-angled, A is in 
the major segment, and using FIGURE 3 in which F is the midpoint of BC and A’ is 
on the unit circle, 


|BC| |BC| 


. A < . a 
sin A sin A 


2R 


By rotation and/or translation, perhaps renaming the vertices, the other possibilities 
may be reduced to this one. 


Fie. 3 
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UC Spaces Revisited 


GERALD BEER 
Department of Mathematics, California State University, Los Angeles, CA 90032 


In any exposure to metric space topology, spaces with the following properties 
are considered: (a) each continuous function on the space with values in an 
arbitrary metric space is uniformly continuous; (b) each open cover of the space has 
a Lebesgue number, i.e., there exists a number 6 > 0 such that each subset of 
diameter less than 6 lies entirely in some member of the cover. Both of these 
properties hold for compact metric spaces, but as is well known, neither char- 
acterizes compactness, and the class of spaces in which (a) holds is precisely the 
class in which (b) holds. This larger class of spaces has been called the UC spaces 
[11], the Lebesgue spaces [7], or the Atsuji spaces ({2], [3]) (in recognition of Atsuji’s 
fundamental paper [1]). This class seems to be periodically rediscovered in this 
MONTHLY ([4] and [6]). Perhaps the most visual internal characterization of a UC 
space X is the following [6]: the set X’ of limit points of X is compact, and for each 
e> 0, {x © X: d(x, X’) > €} 1s uniformly discrete, i.e., there exists 6 > 0 such that 
if d(x, X’) > « and d(y, X") > « and x # y, then d(x, y) > 6. From this descrip- 
tion, one can easily construct exotic UC spaces. For example, if {e,: n € Z*} is the 
standard orthonormal base in /,, then 


{(1/j)e,;n€Z*, jEZ*} vu {0} 


is a nonlocally compact UC space. 

Among the many characterizations of UC spaces, perhaps the most intriguing 
describe the way they sit between the compact spaces and the complete ones. A 
space X is compact if and only if each sequence with distinct terms has a cluster 
point, and it is complete if and only if each sequence with distinct terms in which 
pairs of terms are arbitrarily close eventually has a cluster point. Similarly, a space X 
is UC if and only if each sequence with distinct terms in which pairs of terms are 
arbitrarily close frequently has a cluster point ({(3] and [10]). With respect to 
decreasing sequences of nonempty closed sets, a space X is compact if and only if 
each such sequence has nonempty intersection, and a space X is complete if and 
only if each such sequence whose diameters go to zero has nonempty intersection. An 
analogous characterization holds for UC spaces [3], provided we replace the diame- 
ter functional by another geometric functional A, defined as follows: 


A(S) =0 
(A) = sup{d(a, X — {a}): aE A} (A # @). 
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Intuitively, 4 measures the maximal degree to which points of a subset A of the 
space X are isolated. 

Which metrizable spaces X admit a UC metric? The answer is quite simple: X’ 
must be compact. In essence, this result is due to Nagata [8], who showed that the 
fine uniformity for a metrizable space is a metric uniformity precisely when X’ is 
compact (see also [9]). The result also follows from a nontrivial fact about refine- 
ments of sequences of open covers of a metrizable space X: if (Q,,) is a sequence of 
open covers of X, then there exists a compatible metric p such that for each positive 
integer n, {{w © X: p(x, w) < 1/n}: x © X} refines Q, [5; p. 196]. To See this, let 
d be any compatible metric for X. For convenience, we denote the parallel body 
{x © X: d(x, X’) < €} by S,[X’; «]. For each n € Z*, let Q,, be the following open 
cover of X: 


Q,= {S,[X51/n]} Uv ({x}: x eX — X'S. 


Now let p be the metric induced by the sequence ({2,,) as described above. To show 
that p is a UC metric, we need only show that for each « > 0, (S,[X’; e])° is 
uniformly discrete. If not, then for some positive «, we can construct a sequence 


X15 Wi, X45; W, X35 W3, eee 


in (S,[ X’; €])° such that for each n, 0 < p(x,, w,) < 1/n. By construction, for each 
n> 1/e, x, must liein S,[X’; 1/n], for otherwise, the p-ball of radius 1/n about 
x, would contain no other point (in particular, it would not contain w,). As a result, 
d(x,, X’) — 0; so, by the compactness of X’, (x,) has a cluster point p. Since all 
terms of the sequence are isolated points, no term can occur infinitely often, so, the 
point p must be a limit point of X. This produces a contradiction, for our sequence 
lies entirely in the closed set (S,[ X’; €])‘. 

Neither approach above to the question of UC metrizability seems appropriate 
for the nontopologist. We produce the first utterly elementary proof. 


THEOREM. Let X be a metrizable space. If X’ is compact, then X is UC metrizable. 


Proof. If X’ = @, then the zero-one metric is compatible with the (discrete) 
topology of the space. Thus, in this case, X is UC metrizable. Otherwise let d be a 
compatible metric for X. Define p: X X X — [0, co) by 


0 ifx=y 
p(x, ¥) = ale y) + max{d(x, X’),d(y, X)} fx #y. 
Now if a, 8, and y are nonnegative, we have 
max{a,y} <a + y < max{a, 6} + max{B,y} 


Since p is the sum of a metric and a pseudometric, p is a metric. Clearly, p is at 
least as strong as d. Conversely, we show each p-ball contains a concentric d-ball. 
To this end, fix x € X and e > 0. If x © X — X’, there exists 6 > 0 with S,[x; 6] 
= {x}. Thus, S,[x; 6] C S,[x; e]. Otherwise, x © X’. If d(x, y) < €/2, then 
p(x, y) = d(x, y) + max{d(x, X’), d(y, X’)} 
<¢e/2 + max{0,d(y,x)} <e/2+¢e/2 =e. 


Thus, S,[x; €/2] C S,[x; €]. 
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To show that p is a UC metric, we again need only show that for each «e > 0, 
(S,[X’; €])* is uniformly discrete. If not, then just as before, for some positive , we 
can construct a sequence 


X15 Wy Xo, Wa, X3, Wy... 


in (S,[ X’; e])° such that for each n, 0 < p(x,, w,) < 1/n. By the definition of the 
metric p, for each n, d(x,, X’) < 1/n. By the compactness of X’, (x,,) has a cluster 
point in X’, leading us to the desired contradiction. M& 


We remark in closing that if (X,d,) and (Y, dy) are arbitrary metric spaces, 
and f: X — Y is continuous, then there always exists an equivalent metric for X 
which makes that single function uniformly continuous, namely, 


p(x,w) = dy(x,w) + dy(f(x), f(w)). 


REFERENCES 


1. M. Atsuji, Uniform continuity of continuous functions of metric spaces, Pacific J. Math., 8 (1958) 
11-16. 
2. G. Beer, Metric spaces on which continuous functions are uniformly continuous and Hausdorff 
distance, Proc. Amer. Math. Soc., 95 (1985) 653-658. 
3. , More about metric spaces on which continuous functions are uniformly continuous. Bull. 
Australian Math. Soc., 33 (1986) 397-406. 
4. M.A. Chaves, Spaces where all continuity is uniform, Amer. Math. Monthly, 92 (1985) 487-489. 
5. J. Dugundyji, Topology, Allyn and Bacon, Boston, 1966. 
6. H. Hueber, On uniform continuity and compactness in metric spaces, Amer. Math. Monthly, 88 
(1981) 204-205. 
7S. Nadler and T. West, A note on Lebesgue spaces, Topology Proc., 6 (1981) 363-369. 
8. J. Nagata, On the uniform topology of bicompactifications, J. Inst. Polytech. Osaka, 1 (1950) 
28-38. 
9. J. Rainwater, Spaces whose finest uniformity is metric, Pacific J. Math., 9 (1959) 567-570. 
10. G. Toader, On a problem of Nagata, Mathematica (Cluj), 20 (43) (1978) 78-79. 
11. W. Waterhouse, On UC Spaces, Amer. Math. Monthly, 72 (1965) 634-635. 


Line Integrals of Higher-Order Derivatives 


G. N. HILE AND R. Z. YEH 
Mathematics Department, University of Hawaii, Honolulu, Hawaii 96822 


1. Introduction. In advanced calculus one learns that a smooth function can be 
recovered from its gradient via a line integral; indeed, one has the formula 


f(x) = f(x) + f Vy) a, (1) 


where the line integral can be taken over any sufficiently regular curve running from 
X, to x. But what if only higher order derivatives of f are known, say all partial 
derivatives of order m for some integer m greater than 1? It seems to be not 
generally recognized that an analogue of (1) holds for higher order derivatives; one 
can recover f from its derivatives of order m via a line integral. Uniqueness is 
guaranteed up to an arbitrary polynomial of degree m — 1. We give the formula and 
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explain the notation later; it is 


(x)= ye f 5 Oe ale-»)"],  @) 


a! 


where for convenience we take x, = 0 as the starting point for the line integral. 

A related problem in advanced calculus is that of determining whether an n-tuple 
(u,,..., u,,) of functions in a domain in R” is the gradient of some function f/f. If the 
domain is simply connected and the functions {u;} are of class C', the so-called 
compatibility conditions, 


du,/Ox,= 0u,/dx; l<i<j<n, (3) 


are necessary and sufficient to guarantee the existence of such an f. The same 
question can be raised for higher order derivatives, with an analogous answer. If 
certain compatibility conditions are met, then a given collection of functions {u,} 
will correspond to the derivatives of order m of some function f. 


2. Recovery of a Function from Derivatives. It is important to use economical 
notation; otherwise, one is quickly lost in a sea of subscripts. We let a = (a,,..., a,) 
denote a multiindex of dimension n; thus a is an n-tuple whose components are 
nonnegative integers. The differential operator D“%, as customarily defined, is 


D*% ‘= 0105 eee Onn, 


where ¢d, denotes differentiation with respect to the ith variable. The order of D® is 
the quantity 


ja] =a, ta, +--- +a,. 
Thus if f is a real-valued function of x = (x,, x5,..., x,), D%f is the derivative 
g lal ¢ 
D°f = a aye ty yen 
Oxy} 0x5? eee Oxo 


We always assume that our functions are sufficiently smooth so that mixed partials 
do not depend on the order of differentiation; for example, 07f/0x; 0x, = 
0*f/ax ; 9x; How many distinct derivatives does f have of order m? The question 
is equivalent to asking in how many ways m indistinguishable apples can be 
distributed among n people; hence ({2], [3]) the formula is (™ tna), 

Monomials x* are defined as 


x% = xix 5? eee xOn 


n 
and the generalized factorial a! is 
al:=a,!a,!--- a,!. 
A domain 2 in R” is a connected open set. A smooth curve @ in @ is a 
parametrized set @ = { y(t): a < t < b}, such that y is a C’ function from [a, b] 
into Y, with velocity y(t) never zero. A simple smooth curve never meets itself 


except possibly at the two ends. For real-valued functions f in C(M) and g in 
C'(QZ), we have the line integral 


ff0) dg(y) = f(y) vay) dy = f.Xf0) dg(y) 


dy; 


dy,. 
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For f and g both in C\(Q), it is not difficult to verify the integration by parts 
formula, 


y=y(b) 
f fag + f gat = false) = F(y)8(y) 
e 6 6 y=y(a) 

We now verify the recovery formula (2) under the assumptions that f is a real 
valued function in C”(Q), with the origin 0 lying in ZY, and with the line integral 
taken over any simple smooth curve in Z running from 0 to x. For the case m = 1 
the formula reduces to (1) with x, = 0, and so the result is already known. We 
assume the result is valid for a given positive integer m, and verify it for the next 
integer m + 1. 

Using integration by parts we can verify the identity 


ay DFO) . px (x- = 
d|(x - y)"] = hr + f 
thus (2) may be written also as 


f(x)= ¥ Df) “af > (x ea 


' 
jaj<m — @ 9 lal\=m 


pe SOT avy): 


d[D*f(y)]. (4) 


Let e’ = (0,...,1,0,...,0) denote the ith unit multi-index of dimension n. We have 
the chain of equalities 


y FE ayy = 5 A? Faves») a 


! 
ja|=m jaj=m 


5 a 9 (x= 9)" D**°f(y) dy; 


(ao by, 


dy, (a+e')! 


I 
| 
v M = 


XL  D'f(y)=— 


1 |BJ=m+1, B>1 


B! 


0 
But since race — y)® = 0 if B, = 0, the condition 8, > 1 can be discarded, and 
Jj 


the chain continues as 


-- ¥ Glen lai-- 


|Bl=m+1 B! |Bl=mt+1 


oT a(x) ae 


This last expression can now be substituted into (4) to yield formula (2), with m in 
(2) replaced by m + 1. 


3. Compatibility Conditions. Let (u,}\—m be a collection of real-valued func- 
tions of class C! in a simply connected domain J in R”. What conditions guarantee 
the existence of a function f of class C”*t! in D such that u, = D*f for each u,? If 
such an f exists, then clearly the equations 


OUgses = OUpse, for |B| =m-— 1, l<i<j<n, (5) 


must hold, as each side of the equality reduces to D®+*+*f. For example, take 
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m = 3, n = 2; then we have the compatibility conditions 
0\UE,1) = 05U (3, 0)> Ua 2) = 05U,1)> 0 Uo, 3) = dU q,>)- 
In general (5) is a system of equations in which there are (" tno) choices for B 


with |8| = m — 1, and then (") choices of i and j with 1 <i <j < n; hence there 
is a total of 
[” +n-— *) (3) 
m—1 2 


compatibility equations listed in (5). We show that these equations are sufficient; if 
they all hold then the existence of f is guaranteed, as long as the domain J is 
simply connected. We show further that f can be obtained explicitly as a line 
integral involving the functions {u,}, and that f is uniquely determined up to a 
polynomial of degree m — 1. 

For m = 1, (5) reduces to the usual conditions (3), and again the result is already 
known ([1], [4]). We assume the result is true for a given positive integer m, and 
proceed to the case m + 1. So assume we have real-valued functions {u,} j—m4+1 all 
in C'(Z), with 


OUpses = 9 Ugs ers for |B| =m, l<i<j<n. 


These formulas allow us to apply the case m=1 to each collection ug, 2, 
Ups ¢2r+++> Ugyen, to conclude that for each B with |B| = m there exists a function 
ug in C*(Q) such that 


0 Ug = Uggs fori=1,...,n. 
Moreover, if |y| = m — 1, then |y + e'| = m, and 
OUy rer = Uyperse = Uyserszes = IUyyer- 


Thus the family (ug }\g)—,, satisfies the induction hypotheses, implying the existence 
of a function f in C™*'(Q) such that 


Df = ug, for |B| = m. 
But actually f lies in C”**(Q), since each ug is in C?(M). Furthermore, if 
|a| = m+ 1, then a = B + e' for some B with |B| = m and some e’; thus 
D°f = D’*¢f = 0,( D8f) = d;ug = Ugye = Ug. 


The induction proof is complete. 
Now that we know some f exists, we see in view of (2) that any such f must be of 
the form 


fay= De fy Malay 4), 


lal<m—-1 9 Jal=m 


where the first summation on the right provides a polynomial of degree m — 1. If 
we replace this polynomial by another polynomial of the same degree, we obtain a 
function g(x) differing from f(x) by a polynomial of degree m — 1. Then for every 
a with |a| = m, we have D%g(x) = D*f(x) = u,(x). This argument shows that the 
totality of functions having {u,} as mth order derivatives are precisely those 
represented by the above equation for f. 
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4. An Application and Further Remarks 

(a) The discovery of formula (2) arose in our study of the reduction of a higher 
order partial differential equation to a first order system. As a simple example, 
consider the biharmonic equation in the plane, 


Nb = Qyeex + 2xxyy + Pyyyy = 0. (6) 


We reduce this equation to a first order system by naming all the third order 
derivatives of 9; 


u= Pxxx> v= Px xy> P = Py yy> q = Pyyy- (7) 
Then we obtain from (6) and differentiation of (7) the first order system 


u,+2v,+q,=0 


v, — uy, = 0 

8 
Pp, — vy = 9 8) 
qx — Py = 9. 


Now assuming that we have a solution (u, v, p, g) of the system, the problem arises 
of recovering a solution @ of the biharmonic equation. The last three equations of 
(8) are compatibility conditions; therefore we may use formula (2) to obtain a 
function ¢ such that the prescriptions (7) hold. The first equation of (8) then reveals 
that ¢ is a solution of the biharmonic equation. 

A similar analysis applies to more general higher order partial differential 
equations in any number of independent variables. 

(b) Formula (2) can be viewed as a generalized Taylor formula with remainder. 
It can be used to determine sufficient conditions for the validity of the infinite 
expansion 


D°f (0) 


a! 


f(x) = 2 


a 


x, 


where the sum is taken over all multi-indices a of dimension n. 

(c) If the origin 0 is replaced by an arbitrary point x, in J, then (2) is modified 
by replacing 0 with x, and x with (x — x,)*. 

(d) Versions of (2) appear in the literature ((5], [6]), but with the line integral 
taken over a straight line running from 0 to x. They are obtained by parametrizing 
the line and applying the Taylor formula with remainder for functions of one 
variable, with f being viewed as a function of one variable along the line. Our 
formula is more versatile because we do not require that 0 and x can be connected 
by a straight line, a requirement which can be met of course only in restricted 
domains. 
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A Note on Closure Continuity 


JAMES CHEW 
Mathematics Department, N.C. A & T State University, Greensboro, NC 27411 


In their classroom note [1], Andrew and Whittlesy gave the following definitions: 

A function f: X — Y is closure continuous at x © X provided that if V is an 
open set in Y containing f(x), there exists an open set U in X containing x such 
that f(U) ¢ V. (Here context makes clear in which topological spaces the closure 
operations are taken; i.e. the bar on U refers to closure in X, while the bar on V 
refers to closure in Y.) The function f is said to be closure continuous on X if it is 
closure continuous at each point of X. 

Theorem 1 of [1] states that a function that is continuous on YX is closure 
continuous on X. An example is given showing that the converse is false. Thus 
global closure continuity is strictly weaker than global continuity. 

The purpose of this note is to exhibit a relationship between closure continuity 
and a well-known generalization of continuity. Recall that a function f: X —> Y is 
called connected if the image of each connected set in X is a connected set in Y. 


THEOREM 1. If f: X — Y is closure continuous on X, then f is connected. 


Proof. In what follows, we may assume that the given connected set in X is all of 
X and that Y = f(X). (If not, we simply consider the relevant relative topologies.) 
Suppose that Y is not connected. Then Y = V U W, where V and W are nonempty, 
disjoint open sets. Let A = f-'(V). For each x & A, there exists an open set U 
containing x such that f(U) C V by closure continuity of f on X. We claim that 
Uc A. If not, f(U) would intersect W, say at p. Then p = f(x’) for some x’ € U 
and so f(x’) € V= Vor p& VN W, contradicting the disjointness of V and W. 
Hence, A = f~!(V) is an open set in X. Similarly, we see that B = f~1(W) is open 
in X. Hence X is not connected because X = A U B where A and B are nonvoid, 
disjoint open subsets of X. 

Theorem 5 of [1] states that if X and Y are regular spaces then f: X — Y is 
closure continuous on X if and only if f is continuous on X. A result of Rowe [2] 
states that a function f: R — R is continuous if and only if f satisfies both of the 
following conditions: (1) f is connected; (2) f~'(y) is closed for each real number 
y. It is well known that for real-valued functions of a real variable the condition of 
being connected is equivalent to the Intermediate Value Property (Darboux Prop- 
erty): Given a number C between f(a) and f(b), there exists c between a and b 
such that f(c) = C. It is also well known that if f: R — R is such that f’(x) exists 
for each x € R then /’(x) satisfies the Darboux Property. Now, let f: R — R be 
any function such that f’(x) exists but is discontinuous on R. Then f(x) is a 
connected function that is not closure continuous on R. (If f’(x) were closure 
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continuous on R, then f’(x) would be continuous by Theorem 5 of [1].) Hence, the 
converse of Theorem 1 of the present note is false in general. 

A pair of conditions each strictly weaker than continuity is called a factorization 
of continuity if together the pair of conditions implies continuity. Thus, for 
real-valued functions of a real variable, {Darboux Property, closed point inverses} 
is a factorization of continuity. The example 1 in [1] shows that {closure continuity, 
closed point inverses} fails to be a factorization of continuity. We close this note 
with the following question: What are some factorizations of continuity with closure 
continuity as a factor? 
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Dimensional Analysis and Calculus Identities 


J. R. TORCZYNSKI 
Fluid and Thermal Sciences Department, Sandia National Laboratories, Albuquerque, NM 87185 


Introduction. In everyday life, we encounter a variety of units. For example, we 
might put 10 gallons of gasoline into the tank of our 2000-pound automobile and 
then proceed to use it up by driving 200 miles at a speed of 50 miles per hour during 
the next 4 hours. These common units are measures of certain physical attributes— an 
hour is a unit of time, a mile is a unit of distance, and so forth. However, we could 
have just as easily expressed the trip distance d in kilometers. There is nothing 
special about miles or kilometers: they both express the physical property of 
“length.” Similarly, hours, minutes, and seconds are measures of the physical 
attribute called “time.” 

If we denote the trip time in the above example by ¢, the units of ¢ are time, or 
[¢] = T, where we have followed the convention of denoting the units of a variable 
by [...] and the unit of time by 7. Note that here we are making a distinction 
between common units such as hours and fundamental units, or dimensions, such as 
time. When we say that a variable has dimensions of time, we mean that the 
fundamental physical property of this variable is time. Now this time may be 
measured in common units like minutes or hours, but the dimension (property, 
fundamental unit) is time. Similarly, we denote the dimensions of d by [d] = L, 
where the unit of length is denoted by L. 

Denoting the average velocity by v, we might be tempted to make the statement 
that [v] = V, where V is the fundamental unit of velocity. This statement is correct; 
nevertheless, it is more illuminating to say [v] = L/T since velocity has the 
dimensions of length divided by time. Length, time, and velocity are not three 
“independent” dimensions, so we would not wish to include all in a list of 
“fundamental” dimensions. Moreover, in a problem involving only v and the sound 
speed c, we would include only V = L/T, rather than both L and T, in a list of 
independent units since L and T appear only in this combination in all the problem 
variables and thus are not independent units in this problem. 

So far, all the physical quantities we have considered have dimensions associated 
with them. It is possible, however, to construct “dimensionless” quantities. Angles 
are a good example of this. An angle @ is defined as the ratio of s, the length of 
circular arc it intercepts, to r, the length of the radius of the circle. It is clear that 
[s] = [r] = L. We can solve for the dimension of 8 as follows: 


[9] = [s/r] = [s]/[r] = L/L = 1. 


Thus, the angle @ is a dimensionless or “pure” number. 


The Buckingham Pi Theorem. Proofs employing dimensional reasoning fre- 
quently involve and enhance our understanding of the fundamental issues in 
mathematical and physical problems [1], [2]. This method, as applied to physical 
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systems, was first rigorously described by Buckingham [3], whose Buckingham Pi 
Theorem, outlined below, expresses the basic principle of dimensional analysis. 
Given a problem described by N parameters x; with M’ units u,, then there are 
K = N — M dimensionless parameters II, which fully describe the problem, where 
M < M's the number of independent units. By fully describing the problem, it is 
meant that the equations of the problem can be recast into a form where only the 
dimensionless parameters and dimensionless variables appear. Thus, two problems 
with different values of the x, are actually the same when made dimensionless if the 
values of the II, are the same. Also, for each function q in the problem relating the 
x, such that q(x,,..., xy) = 0, there is a function p relating the II, such that 
p(il,,..., If,) = 0. In the case where one such function exists, this result is 
typically expressed as II, = p,(Il,,..., I,_,, ,43,..., I), where tacit use has 
been made of the implicit function theorem. Except for pathological cases, this 
inversion can be performed, and we shall assume this to be true for all later 
examples. 

The justification of this theorem is seen in the construction of the dimensionless 
II,. We can write the dimensions of each of the x, in the following way. 


M’ 
[x;] = I] uli . 
J=1 
Choose the following form for each of the II,. 
N 
IT, = [| xj. 
i=1 
The dimensions of the II, are then as given below. 


(1) = Tf]. 


These equations are combined with the relations for [x,] to yield explicit expressions 
for the dimensions of the II,. 


Now we wish the II, to be dimensionless, or [II,] = 1 for all k. Hence, the 
exponents of all the units must vanish. This results in the following matrix equation. 


N 
» Bix = 0. 
i=1 


Note that columns of the a-matrix are just the eigenvectors of the 6-matrix having 
an eigenvalue of zero. Let the B-matrix have rank M < M’. Thus, by definition 
there are M independent units. Since the B-matrix has M linearly independent rows 
and N columns, it has K = N — M such eigenvectors. Hence, the a-matrix has K 
columns, corresponding to the K different II,. Note that the specific choice of the 
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II, is not unique. For example, A, = II,II, is an equally good choice, correspond- 
ing to the new eigenvectors a(® = af) + af, 


Dimensional Analysis in Physical Science. A few illustrative examples are useful 
to show how dimensional-analysis ideas may be applied to problems in areas of 
physical science. 

Suppose we wish to find an expression for the angular frequency { of a simple 
pendulum. We take the pendulum bob to be a point mass m suspended from a 
frictionless pivot by a massless rod of length / in a gravitational field of acceleration 
g. The maximum angular displacement from the vertical is denoted by 8. We have 
five parameters: x, = Q, x, =m, x,=/, x, = g, and x, = 8. Appearing in this 
problem are three units: length u, = L, time u, = 7, and mass u, = M. The 
dimensions of the five parameters are given below in terms of these units: [Q] = T7}, 
[m] = M}, [J] = L’, [g] = L'T~?, and [6] = 1. Thus, in this case we have five 
parameters and three independent units, so we expect 5 — 3 = 2 dimensionless 
parameters: II, = Q(1/g)'/? and II, = @ form one possible set of dimensionless 
parameters. We presume that there is a function relating the angular frequency { to 
the other four parameters, so there exists a function p such that p(II,, Il,) = 0. 
Assuming this relation may be solved for II, we find II, = f(@) or 


Q = (g/1)'"F(8). (1) 


Here f is some unspecified function. 

At this point a few observations may be made. Note that the mass m of the 
pendulum does not enter into this relationship, so the frequency of a simple 
pendulum is independent of its mass, a point not immediately obvious. One might 
argue then that we shouldn’t have counted the mass as a parameter, which reduces 
the number of parameters to four. If this is done, however, the number of units is 
reduced from three to two (no other parameter has units of mass), so we would still 
expect 4 — 2 = 2 dimensionless parameters. 

We can also apply physical intuition to (1). If the idea of a pendulum as a 
time-measuring instrument has any validity, then f(@) should have a nonzero limit 
as @ approaches zero. [In fact, f(0) = 1, which exemplifies the fact that in physical 
problems, dimensionless parameters are often O(1).] Thus, had we neglected to 
include 6, the finite amplitude of the swing, in our parameter list, we would have 
found that p(II,) = 0. Hence, II, would be a constant, so Q = C(g//)'/*, where C 
is a constant, dimensionless number. A guess that the constant C is O(1) would be 
“physically” excellent. Reasoning along similar lines (an exercise for the reader), 
one is able to show that f(@) is an even function of @. 

Let us consider another physical problem (much less laboriously than above). 
Find the force F on a sphere of diameter D moving with velocity U through an 
incompressible fluid of density p and viscosity p. The physical quantities have the 
following units: [F] = M‘L'T~?, [D] = L}, (U] = L'T~, [p] = M'L~?, and [py] = 
M'L~'T~!. With five parameters and three units, we expect two dimensionless 
parameters, which we choose to be II, = F/pU’*D* and II, = pUD/p. Thus, the 
drag force is given by 


F = pU*D?f(pUD/), (2) 


where again f is an undetermined function. The parameter II, is referred to as the 
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Reynolds number [4], a dimensionless number of great importance in many prob- 
lems involving fluid motion. 


Physical knowledge can be used as above to Say a little more about the function 
f. It is expected from the equations of motion that the fluid density becomes 
unimportant (that is, can be left out of our list of parameters) for small values of the 
Reynolds number, whereas the viscosity becomes unimportant for large values of 
the Reynolds number. It is left as an exercise for the reader to use these facts to 
show from (2) or directly that F = C,UDp for small Reynolds numbers (Stokes’ 
law), and that F = C,pU7D* for large Reynolds numbers (turbulent drag). Here, C, 
and C, are dimensionless constants, coincidentally both O(1). 


Dimensional Analysis and Geometry. It is not widely appreciated that dimen- 
sional analysis alone is sufficient to prove some elementary mathematical theorems. 

One of the best examples is the elegant proof of the Pythagorean Theorem [5] 
employing dimensional analysis. The Pythagorean Theorem states that a? + b? = c’, 
where a and b are the legs and c is the hypotenuse of a right triangle. Denote the 
two acute angles by a and 8, as in FIG. 1. Now giving two angles and one side 
determines a triangle, and by hypothesis one of the angles here is a right angle, so 
specifying a and c determines all quantities associated with a right triangle. By this 
statement we mean that there are functions of a and c that describe a, b, B, the 
area A, and any other quantity related to the triangle. The angles a and B are 
dimensionless, [a] = [8] = 1, but a, b, and c have units of length, [a] = [b] = [c] 
= I}. Thus, from dimensional reasoning, we have the following relations: a(c, a) = 
cf, (a), b(c, a) = cf,(a), and B(c, a) = g(a). The area A has the units of length 
squared, [A] = L, so we find 


A(c, a) = c7f(a). (3) 
Here, f, f,, f,, and g are unspecified functions (although we, with our understand- 
ing of trigonometry, know what these functions are). As an aside, note that values of 
a and 6 may be interchanged (either a or B could be called “a”), so a = g(f) and 


hence the function g is its own inverse function, g = g 1. 


Cc 


Fic. 1. An arbitrary right triangle. 


Partition the right triangle into two right triangles using the altitude from the 
right-angle vertex to the hypotenuse. One triangle shares acute angle a with the 
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original triangle; hence, its other acute angle is 8 = g(a). The other right triangle 
shares acute angle B with the original triangle and therefore has a = g(f) as its 
other acute angle. Note in passing that we have just used dimensional analysis 
(apparently in a way previously unrecognized) to prove the complementarity of a 
and £ (their sum is a right angle, as shown in FIG. 1): a + B = Op, or g(a) = Op — a, 
where here we define a right angle to have a measure of 0,. The hypotenuses of 
these two new right triangles are a and b, respectively. Clearly, the sum of their 
areas equals the area of the original triangle 


A(a,a) + A(b,a) = A(c, a). (4) 


Next, we use (3), the explicit formula for the area, in (4) and divide out the common 
factor f(a) to obtain (5), the Pythagorean Theorem. 


a°f(a) + b*f(a) = c*f(a) 
a* + b* = c’. (5) 


As an aside, suppose there were an additional independent length scale R, say 
the local curvature of space. For example, we might be interested in spherical 
trigonometry. Now all triangle quantities must be functions of a, c, and R, so 
B = g(a,c/R) since c/R is a new dimensionless parameter. Although it is still true 
that a = g(8,c/R), our subdivision of the triangle runs into trouble since B # 
g(a, a/R) and a # g(8, b/R). In such a Situation, there generally will not be a 
“Pythagorean Theorem.” 

In a similar vein, it is possible to prove that the three angles of an arbitrary 
triangle are supplementary. This proof, although qualitatively the same as the 
complementarity proof above, is lengthier, so we leave it as an exercise for the 
interested reader. Hint: draw the segments joining the midpoints of the three sides 
of the triangle. 


Dimensional Analysis and Calculus. It is also possible to use dimensional analysis 
alone to prove some simple calculus identities. More specifically, we examine the 
relation 


dx dy\~? d’y P 
dy? Fa dx? (6) 


and prove it using dimensional reasoning. When compared to the conventional 
calculus proof of (6), the dimensional argument is elegant and straightforward, 
involving an understanding of the mathematical and physical basis of (6), rather 
than emphasizing the mechanical aspects of repeated application of the chain rule. 
We are interested in finding a formula relating the nth derivative of x from 
x = x(y) to the derivatives of y from y = y(x). The conventional calculus ap- 
proach to this problem is straightforward, emphasizing a mechanical use of the 
chain rule. First, we identify 
d dy\"' d 
—_ = a —. (7) 


dy dx dx 
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Then the desired relation becomes the following: 


(2) 2}. : 


Let us examine the n = 2 case. All one need do is use (8) and painlessly “turn the 
crank.” 


ey'a2 7 
dx 
z) OE) & 


dy  d*y 
-(> dx? 


Little insight goes into or comes out of this mechanical type of a proof, and there 
are ample opportunities for algebra errors. 

We now wish to use dimensional analysis to find the explicit formula for the nth 
derivative in the reversed coordinates, where 


d"x dy\"} d\" 5 

dy” ba dx{ > (9) 
Several facts become clear upon reflection. First, the explicit formula depends only 
on integer powers of the first through nth derivatives. That the formula cannot 
depend on the coordinates x and y themselves is understood from the fact that 


translating the function in the x- and y-directions does not affect slopes, curvatures, 
etc. 


; 10 
dy” dx° ” dx" (10) 


d"x (<. dy _ aly | 
Second, the explicit formula must be dimensionally correct in both x and y. Third, 
the explicit formula cannot depend on the coordinate label. In other words, the 
labels x and y may be exchanged in the explicit formula, again without altering the 
functional form, so F, must be the same function in (10) and (11). 


d"y s al 


= ee 11 
pe Ble (11) 
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Let us examine in detail the application of this dimensional reasoning to the 
proof of the n = 2 case. The second derivative is a function of only the first and the 
second derivatives. 


dx | dy dy ) (12) 


——_ — E —_,—_ 
dy? *\ dx’ dx? 
Next, we invoke dimensional correctness in x and y, essentially using the 
Buckingham Pi Theorem [3], where we take [x] = X! and [y] = Y!. The left hand 


side of (12) has one power of x in the numerator and two powers of y in the 
denominator. Thus, its dimensions are 


d*x 
Fa = X'y~?, (13) 


The arguments of F, have the following dimensions: 


Fa = xy’, (14) 
d*y -—2yl 
S| = X~y?. (15) 


There are three parameters and two independent units, so we expect that there will 
be 3 — 2 = 1 dimensionless parameters. There is only one way to combine these 
quantities to give the dimensionless variable II. 


d*x { dy \?| d? 
n= (2) (2) | (16) 
dy” \ dx} \ dx? 


Since by assumption there is a function (F,) relating the three parameters, we know 
there is a function p(II) such that p(II) = 0. Hence, II must be a constant, 
independent of whatever specific function y = y(x) we are considering. We can 
manipulate (16) into the following form. 

dx dy\~? d? 
ow) —| —, (17) 
dy dx dx 

Finally, we use the fact that we may interchange x and y. 

d*y tT dx \~* dx (18) 
dx? "\ dy} — dy? 

Comparing the above two equations shows that II* = 1, so II = 1 or —1. One can 


trivially observe (by thinking about signs of slopes and concavities) that II = —1. 
This yields the desired result. 


d*x dy\~? d*y 
, -(5 be (19) 


Note that in this derivation we have not used the chain rule or taken derivatives. 
Rather, this identity follows from purely dimensional considerations. 
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An alternate and perhaps more subtle dimensional proof of (19) involves the 
physical idea of the radius of curvature R, of a function y = y(x) (see Fic. 2). 
Since R. is determined by matching the first and second derivatives of a function to 
those of a circle, 


dy d’y 
= G| —,—,> |. 
. | dx’ dx? | (20) 
y 
xX 
Fic. 2. The radius of curvature. 
Since R., x, and y all have the dimensions of length, 
d*y\" | ay 
R.=|—>| Al— |]. 21 
c | dx2 | [= | ( ) 
But R, cannot depend on the coordinate system, so 
R ax - a 22 


Combining (21) and (22) gives the following: 
d*x 4d? d 
val (23) 
dy dx dx 
We are now essentially at (12) and can use the same line of reasoning to complete 
the proof. Thus, from our physical understanding of the radius of curvature, the 
mathematical identity is proved even more directly and elegantly. 


One may inquire whether the derivative identities may be proved by dimensional 
analysis for n > 2. The answer is yes, almost. The functional form is always 
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correctly given, but some of the dimensionless constants remain undetermined. As 
an example, we may proceed as above for the n = 3 case, using the results already 
obtained for smaller n. In this case we find 


d*x dy \~* dy C dy \~°{ dy ? 
a [ne —— + — _ 
dy? (>| dx? a dx? }’ 


where C is an undetermined constant. Dimensional analysis takes us no further. To 
determine C, we would have to use a test function (such as y = x7! or y = e”), 
which yields C = 3. For larger n, there are progressively more undetermined 
constants, and the method becomes impractically cumbersome. 


Conclusions. It is seen that dimensional reasoning alone is sufficient to prove the 
identity 
d*x dy \~? d*y 
dy? dx dx? 
This proof does not depend upon taking derivatives. Instead, it stresses insight into 
the underlying physical basis of the identity (involving the radius of curvature). 
Such a method of proof is seen to be elegant and is useful for developing physical 


and mathematical intuition rather than mechanical proficiency. 


Acknowledgments. The author wishes to acknowledge the pleasure and utility of many discussions 
with Professor Hans W. Liepmann, one of the undisputed masters of dimensional analysis. 
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Reconsidering Area Approximations 


GEORGE P. RICHARDSON 
The Rockefeller College of Public Affairs and Policy, State University of New York at Albany, 
Albany, NY 12222 


A recent survey of calculus texts for adoption revealed that all contained 
presentations of the familiar method of approximating a definite integral by 
summing the areas of n trapezoids of equal width h, 

"AC X1) + f(x; 
ge PLGed +I), 
i=1 2 
Only one text contained any mention of the simpler approximation obtained by 
summing the areas of rectangles computed at midpoints, 


< Xj;_1 + X; 
i=l 
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and there the midpoint-rectangle method was quickly dismissed in an example. The 
overwhelming tendency to ignore the midpoint-rectangle method in modern texts is 
unfortunate, and somewhat puzzling, because it is conceptually simpler than the 
trapezoid approximation and is actually more accurate. A good case can be made 
for the claim that the midpoint-rectangle method is the first numerical approxima- 
tion to the definite integral that students should see. 

Although the greater accuracy of the midpoint-rectangle approximation is well 
known, it is nonetheless surprising to many, perhaps partly because we are so used 
to seeing the trapezoid method emphasized in introductory texts. Yet, the result is 
easy to see geometrically (see FIGURE 1). The comparison of the area approxima- 
tions is facilitated by noting that the area of the midpoint-rectangle is equal to the 
area of another trapezoid determined by the tangent to the graph of f at the 
midpoint. From the figure it is then evident that as long as f’’(x) is continuous and 
nonzero over [x,_;, X;], so that the direction of curvature does not change, the area 
representing the midpoint-rectangle error will always be less than the area repre- 
senting the trapezoid error. 


as at trapezoid error |-3::: 


a rectangle error = 


Xx, y m, x, x 


x; 
Fic. 1. Geometric comparison of the midpoint-rectangle and trapeziod approximations for I f(x)dx. 


Most calculus texts state without proof a theorem equivalent to the following. If f 
has a continuous second derivative on [a, b] then the error e; in approximating 


ff) dx 


using n trapezoids of width h = ° —a)/n satisfies 


a) 
IN 
® 


5 (b- a){_ max 7’(2))). 


axx<b 


The corresponding error bound e, for n rectangles, which is not commonly 
mentioned in calculus texts, is 


IN 


0<ep< 57h%(b~a)| max ll]. 


a<x<b 
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Proofs can be found in most numerical analysis texts, e.g. [1], [2], [3], [6, pp. 
361-400]. While the inequalities here do not support a precise conclusion, the 
denominators in these error bounds certainly hint that the midpoint-rectangle 
approximation is likely to be about twice as accurate as the trapezoid method. 

The following derivation of series expressions for the approximation errors of the 
two methods (stimulated by [4, pp. 80-—84]) is of interest because it is accessible to 
first-year calculus students, is apparently not well known, and gives more explicit 
statements of the approximation errors. 

Suppose f satisfies the hypotheses of Taylor's Theorem over [a, b]. Partition 
[a, b] into n subintervals [x,_,, x;] each of length h. Let m, represent the midpoint 
of [x,_,, x,], and for any x in [x,_,, x;] expand f(x) in a Taylor series about m,. 
On integrating over [x,_,, x,], the even powers in the resulting series cancel, leaving 


fm, )+- (1) 


9231 ”(m,;) + 


Summing © over the n subintervals yields the rectangle approximation and a series 
expression for its error: 


[ 100) &= L sm, )h + 


To derive the comparable result for the trapezoid rule, use the same Taylor series 
to express f(x,;_,) and f(x,;) in terms of f(m,) and its derivatives. Averaging 
f(x,;_,) and f(x,) and then solving for f(m,) produces 


(m,) = Le TOD pm) — Af) =) 


Substituting the right side of (3) into (1) in place of f(m,) and collecting like powers 
of h yields 


5 Es"(m) + 5 E%(m,) + . (2) 


a ae 


. fea) + fe), 2h 4° 
[ f(x) de= Sh ope m,) ~ ae f (Mm) — = 


which gives the following result for the trapezoid rule, compare to (2): 


b “ f(xi-1) + F(x) ; 
[ f(x) & = nn a ~Es"(m 


i=l ( 4) 
n 
a ~E1%(m, ) - 
From the series error terms in (2) and (4) it is again evident that the rectangle 
approximation is of the same order as the trapezoid approximation but is likely to 
be about twice as accurate. More precisely, the error terms show that both methods 
are exact for linear functions, while the midpoint-rectangle approximation has 
exactly half the error of the trapezoid approximation for quadratic and cubic 
polynomials. For functions with nonzero higher derivatives over [a, b], the smaller 
error inherent in the midpoint-rectangle approximation will theoretically become 
evident once h is sufficiently small. 
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There is both puzzle and promise in these results. The puzzle is that the 
midpoint-rectangle method is almost uniformly ignored in introductory calculus 
texts, and the trapezoid rule universally emphasized, although the midpoint rule is 
likely to be more accurate. Moreover, the midpoint-rectangle method is conceptually 
simpler and is closer to the fundamental notion of a Riemann sum. For building 
intuition about the definite integral, probably every student should experiment with 
a simple computer implementation of the method. 

Besides their pedagogical implications, the series expansions for the approxima- 
tion errors in (2) and (4) provide a further benefit. As suggested by FIGURE 1, the 
errors have opposite signs. Taking advantage of that fact, one can easily note that 
doubling (2) and adding it to (4) eliminates the terms involving h? [4, pp. 80-84]. 
Thus on dividing by 3 we obtain an approximation of the definite integral that is 
accurate to an error proportional to h°: 


[2) de = 2 ¥ [la s) + 4flm) + MIA se ELM) 


ai 5! 


= ¥ f(m,) = +» 


ae T! 4 
This substantial improvement is, of course, Simpson’s rule, and equations (2) and 
(4) provide an elegant derivation. (One introductory calculus-with-applications text 
has a nicely intuitive development along these lines without the detailed error terms 
[3, pp. 454-—457].) The same technique done with trapezoid approximations with n 
and 2n trapezoids also yields Simpson’s rule and can be efficiently iterated in 
Romberg’s method [5] to produce much more accurate, higher-order approxima- 
tions. 
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EDITORS’ NOTE: We have received several letters concerning F. Brauer’s article [1], which presented a 
proof of a version of Taylor’s theorem. First, James Smith (San Francisco State Univ.) pointed out that 
the same approach appeared in a 1950 MONTHLY article by C. L. Seebeck [2]. Moreover, Seebeck 
obtained the Lagrange form of the remainder. Sherman Stein (Univ. of Calif., Davis) observed that 
essentially this approach appeared in the second edition of his calculus text [3]. However, the third and 
fourth editions gave Taylor’s theorem with the integral form of the remainder, to reinforce integration by 
parts and the Fundamental Theorem of Calculus, and because the integral form is stronger than the 
Lagrange form. Stein adds, “Although the formulas that appear in the statement and proof of the 
integral version are messier than in the Lagrange version, I believe that the concepts are simpler, more 
natural, and more important.” Finally, Peter Fowler (Calif. State Univ., Hayward) and Rudolf Vyborny 
(Univ. of Queensland, Australia) pointed out that Brauer’s version should have explicitly stated the 
assumption of the continuity of the (n + 1)st derivative of f at x. For more on this assumption see the 
following article by S. D. Chatterji. 
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A Frequent Oversight Concerning the Integrability of Derivatives 


S. D. CHATTERJI 
Dépt. de Math., Ecole Polytechnique Fedérale de Lausanne, CH-1015 Lausanne, Switzerland 


In a recent article in this MONTHLY [1], F. Brauer proposed an elementary 
presentation of the following form of Taylor’s theorem: “Suppose that for some 
integer n > O the functions f, f’,..., {© are continuous on [a, b] and that f("t» 
exists and satisfies | f("*»(t)| < M on (a, b). Then on [a, b], 


f(x) = fla) + f(a)(x — a) + 


- £(a) 


—— (x= a)" +R, (x) 


with 


The proof is based on the fact that if g: [a,b] ~ R is differentiable and g’ is 
bounded on [a, b] then 


['s'(x) de = g(b) - g(a). (1) 


a 
However, the validity of (1) cannot be considered to be an entirely standard 
elementary fact if g’ is not assumed to be continuous or at least Riemann integrable 
on [a, b]. The possibility that g’ need not be Riemann integrable seems to have 
been overlooked. 
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ELEMENTARY PROBLEMS 


E 3282. Proposed by D. M. Milosevic, Pranjani, Yugoslavia. 


Given a triangle ABC, suppose that w,,w,,w, are the lengths of the angle 
bisectors, s is the semiperimeter, r is the radius of the incircle, and R is the radius 
of the circumcircle. Prove that 


wet+wetw?<s?—r(R/2-71), 


with equality if and only if the triangle is equilateral. 


E 3283. Proposed by Orrin Frink*, Pennsylvania State University. 


Every simple closed polygon in the plane has three centroids, namely the centroid 
of its vertex set, the centroid of its boundary, and the centroid of its interior. In 
general all three are distinct. 

(a) In the case of a triangle show that these three centroids coincide if and only if 
the triangle is equilateral. 

(b) Which of the three centroids are affine invariants? 


E 3284. Proposed by Paul Erdos, Hungarian Academy of Sciences, Budapest. 


Determine the minimum number of edges in a triangle-free graph on 2 vertices 
whose complement contains no complete graph on n vertices. 


*Professor Frink died on March 4, 1988. 


762 


1988] PROBLEMS AND SOLUTIONS 763 


E 3285. Proposed by W. K. Nicholson, University of Calgary. 


Let R be a simple ring with unity element, i.e., a ring with unity element having 
no two-sided ideals other than itself and {0}. Let G be a multiplicative subgroup of 
the set of invertible elements of R. 

(a) Under the assumption that either x € G or 1 — x © G for every x € R, show 
that R is a division ring. 

(b) If, in addition to the assumption of (a), we assume that —1 © G, show that 
G = R \ {0}. 


E 3286. Proposed by William C. Waterhouse, Pennsylvania State University. 


Suppose A, B,C, D are n by n matrices and C’, D’ denote the transposes of 
C, D. In a comment on problem 6057 [1987, 1020], M. J. Pelling remarked that the 
condition 


CD’ + DC’ =0 (*) 
implies 
det( 4 4 = det( AD’ + BC’) (* *) 


if D is nonsingular, but gave an example with singular D satisfying (*) in which 
one side of (* *) is 1 and the other side is —1. Prove that in any case (*) implies 


A B 


2 
_ , 1\2 
C By = det( AD’ + BC’)’. 


det( 

E 3287. Proposed by David M. Bloom, Brooklyn College. 

It is easy to see that for each x in (1, e) there is a unique y = g(x) in (e, oo) such 
that x” = y*, or equivalently (log y)/y = (log x)/x. For example, g(2) = 4. Prove 
that if h(x) = x8, then h is a decreasing function on (1, e). 

E 3288. Proposed by John J. Cade, Pikeville College, Pikeville, KY. 

It is well known that if a, m, and n are positive integers with a > 1, then 

(a"™—1,a"—1) =a” — 1, 


where (m,n) is the greatest common divisor of m and n. Determine 
(a™ — 1, a" + 1) and (a” +1, a” + 1). 


SOLUTIONS OF ELEMENTARY PROBLEMS 
Angle Bisectors 


E 2966 [1982, 594]. Proposed by P. J. Giblin, University of Liverpool, U.K. 


A, B, P,, P,, P3 are distinct points in the plane. P;P,A, P;P,;B are proper trian- 
gles, i.e., no two of P,, P,, P,; are collinear with A or with B. The anticlockwise 
angles from AP, to AP,, AP, to AP;, BP, to BP,, BP, to BP, are 9,, 9, >), $9. If 
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a, = AP,, b; = BP, and if the relations 
sin6, a, sin@, a, ~— sin(@,—6,) a, 


sing, b, sing, b, _—sin(¢, — %) b, 
hold, show that the angle AP,B has the same pair of bisectors as one of the angles of 
the triangle P,P,P,;. (Possibly the internal bisector of one angle is the external 
bisector of the other.) 


No complete solution was received. A solution is sketched on pp. 182-183 of the 
paper, “Source genericity of caustics in the plane,” by J. W. Bruce, P. J. Giblin, and 
C. G. Gibson, Quart. J. Math. Oxford (2) 33 (1982) 169-190. 

L. Kuipers (Switzerland) solved the altered problem in which the first of the three 
ratios assumed to be equal is replaced by its reciprocal (but in which the conclusion 
is unchanged). In the above article Bruce, Giblin, and Gibson obtained not only the 
assertion of the problem but also its converse; in addition they proved that in fact 
all three ratios must be +1 (and so respectively equal to their reciprocals). Thus 
Kuipers’ argument is consistent with the assertion of the problem but does not 
establish it as such. 


Matching Distances to Vertices 


E 3135 [1986, 215]. Proposed by Hiiseyin Demir, Middle East Technical Univer- 
sity, Ankara, Turkey. 


For a scalene triangle ABC inscribed in a circle, prove that there is a point D on 
the circle whose distance from the opposite vertex is the sum of its distances from 
the other two vertices, and construct D with ruler and compass. 


Solution I by J. Leech, University of Stirling, Scotland. Suppose BC > AC > AB. 
Perturb BAC into an isosceles triangle BXY by determining X on the ray BA and Y 
on BC such that BX = BY = AC. The circumcircle of BXY meets the circumcircle 
of BAC at the required point D. To prove D has the required property, extend AD 
to a point Z such that DZ = DC. Then triangle DZC is similar to triangle BXY, 
because the angles at D and B are both supplementary to angle ADC. Conse- 
quently, triangle DBY is congruent to triangle ZAC, since AC = BY by construc- 
tion, the angles at A and B are equal and angle BDY = angle BXY = angle DZC. 
Hence DB = ZA = DA + DC. 


Solution II by P. Tzermias (student), University of Patras, Greece. Let a, b,c be 
the lengths of the sides opposite A, B,C, and let x, y,z be the lengths of 
DA, DB, DC. We seek point D such that y= x +z. By Ptolemy’s Theorem, 
ax + cz = by. Substituting for y yields x/z = (c — b)/(b — a). Thus, D lies on 
the “Circle of Apollonius” determined by A and C using the ratio (c — b)/(b — a). 
This circle has a standard construction (see N. Altshiller-Court, College Geometry, 
1952, p. 15). 


Editorial comments. Since c — b and b — a must have the same sign, D must lie 
on the arc cut by the side of intermediate length. Consequently, if ABC is isosceles, 
then D can only be the vertex common to the two equal sides. On the other hand, if 
ABC is equilateral, then D can be any point on the circumference; Leech’s two 
circles then coincide. 
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Several solvers noted that the existence of D follows from the Intermediate Value 
Theorem. If D on the arc opposite B is close to the shortest side of ABC, then its 
distance to B is less than DA + DC, but if D is close to the longest side, then 
DB > DA + DC. 

Other solutions independent of Ptolemy’s Theorem were submitted by J. Dou 
(Spain), L. Kuipers (Switzerland), and by P. L. Hon (Hong Kong). 

E. Morgantini (Italy) submitted a paper entitled “Una Quartica Bicircolare Della 
Geometria Del Triangolo” making reference to this problem. 

In addition to the solvers mentioned above, correct solutions were received from S. Arslanagi¢ 
(Yugoslavia), A. Bager, H. Eves, J. Fukuta (Japan), H. Kappus (W. Germany), O. P. Lossers (Nether- 


lands), J. P. Robertson, J. S. Robertson, V. Schindler (E. Germany), R. A. Simon (Chile), B. A. Troesch, 
M. Vowe (Switzerland), and the proposer. 


Large Discs in Convex Unions 


E 3139 [1986, 216]. Proposed by Andrew Vince, University of Florida, Gainesville. 


Let G,,..., Gy be closed plane convex sets, each of diameter less than or equal 
to 1 and such that U?__G, is convex for all (m,n) with 1 <m<n<N. Prove or 
disprove: the union of the G, can contain no disc of diameter greater than y3. 


Solution by J. Pach, Hungarian Academy of Sciences, Budapest, and C. A. 
Rogers, University College, London. The assertion of the problem is false; discs of 
arbitrarily large diameter are possible. 

In 1983 we gave the following Proposition (cf. [1]): 


PROPOSITION. For every natural number n, there exists a sequence C{”, Ci”, 
..., CM of compact convex sets satisfying: 

(1) diam C™ <1 forl<r<N; 

(2) Us_.C™ is convex forl<r<s<N; 
and 

(3) UX ,C{™ contains a disc of radius n. 


We did not prove this result explicitly but asserted that it could be established 
using ideas similar to those of §2 of our paper. Since §2 is rather complicated, it 
seems worth giving a fairly simple proof of this proposition. 

Consider the four parabolic arcs defined by 


y = p(x), —2n<x<2hn, 

for j = 1,2,3,4, where 

p(x) = 4n? +1 — x’, 

p(x) = 4n? — x?, 

p? (x) = —4n? + x?, 

p(x) = —4n*-14 x’. 
We introduce the sequences 

PL) = (-2n+k, pY(-2n+k)), O<k<4n, 


of points on these arcs, 1 <j < 4. We define a sequence D,, D,,..., Dy,,, of 
convex domains as follows. 
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D, is the trapezium with vertices P?, P, PO, P. For 2 <i < 2n, D, is the 
double trapezium PS? ,, P{),, PO ,, PH ,, P,., PO, Finally D,,,, is the 
trapezium Pf? ,, P{&), PLP, PE,. 

For 2n+2<i<4n+1, D, is the region bounded by the line segments 
PO,» 9; PE.y_2, and P2.4_>,, P&P. 4-2; the arc of the first parabola joining 
Peo, 5, and P£)., ,,, and the arc of the third parabola joining PS,,_,, and 

) 


See the figures showing these sets in the cases n = 1 and n = 2. 


Pf) 


Ps 


The case n = 1. The convex sets D,, D,, D;, D,, D;. The sets D,, D,, D; have been emphasized. The 
vertical scale has been decreased by a factor 2. 


PD 


Pgh 


The case n = 2. The convex sets D,, D,,..., Dy. The sets D,, D,,..., Ds have been emphasized. The 
vertical scale has been reduced by a factor 2. 
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Note that the sides of the trapezia and double trapezia that are not parallel to the 
y-axis are tangents to the second parabola at points of the form P{?,, or tangents to 
the third parabola at points of the form P£?,,. 

It is easy to verify that the union 

Ss 
UD, 
i=r 
is convex for 1 <r <s <4n+ 1. Define further sets D,,,5, Djn43,-- 
by taking 


-> Dansiyn 


Dansyrre 1<Si<4nt+1, 1<r<N, 


to be the set obtained by translating the set D, by distance r in the direction of the 
positive y-axis. It is easy to verify that 


AY 
UD, 
i=r 
is convex for 1 <r<s < (4n + 1)N. Further U@* PD, contains the rectangle 
—2n<x<2n, O<y<N. 


Let C,,1 <i < (4n + IN, be obtained from D, by dividing all the y coordinates 
by N/(4n). Then each partial union 


; . 
Uc, l<r<s<(4n4+1)N, 
i=r 
is convex, and the, union of the (4n + 1)N sets contains the square 
-2n<x<2n, O<y<4n, 


and so contains a circular disc of radius 2n. Further, if ¢ > 0, is given, and N is 
taken to be sufficiently large, each set C, has diameter less than 2(1 + «). The 
required result follows by a change of scale. 


1. J. Pach and C. A. Rogers, Partly convex Peano curves, Bull. London Math. Soc., 15 (1983) 321-328. 


_ Editorial Comment. No solutions were received before the customary deadline. 
However, G. B. Purdy called our attention to the Pach—Rogers paper [1] and 
suggested that we invite Pach and Rogers to prepare a detailed proof of their 
proposition. We are grateful to Purdy for this suggestion and to Pach and Rogers 
for accepting our invitation. 


Another Geometric Inequality 


E 3146* [1986, 299]. Proposed by Walther Janous, Ursulinengymnasium, In- 
nsbruck, Austria. 


Prove or disprove the following statement: If a, b,c are the sides of a triangle 
and s = (a+b+c)/2, then 


2s{¥s-—a+t+ys—b+ys—c}< 3{ [be(s —a) + yca(s — b) + yab(s —c) }. 


Solution by Magda P. Nakassis, Gaithersburg, MD. The statement is correct, and 
the two sides are equal iff a = b =\c, even when degenerate triangles are consid- 
ered. 
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If s = 0 there is nothing to prove. If s > 0, then there are angles u, v, w between 
0 and 7/2 such that cos*u = (s — a)/s,cos*v = (s — b)/s, cos*w = (s — c)/s. 
These satisfy sin?u = a/s, sin’v = b/s, sin’w = c/s, and cos?u + cos*v + cos?w = 
1. Upon division by s*/?, the desired inequality now becomes 


2(cos u + cosv + cosw) < 3(cos usin v sin w + sin ucos v sin w + sin u sin v cos Ww) 


for three angles between 0 and 7/2 whose squared cosines sum to 1. 

We rewrite the right side as 3(cos u cos v cos w — cos(u + v + w)). Because the 
squared cosines sum to 1, we may assume cosw < y1/3 . For a fixed w in this 
range, cos(u + v + w) attains a maximum value f(w), which will help put a lower 
bound on the right side. We compute f(w). Since 0 <u+vu+w < 32/2, for a 
fixed w the maximum of cos(u + v + w) occurs at a local extremum of u + v + w 
or on the boundary. 

For the boundary points, one of u or v reaches 0 or 7/2. If either reaches 0, then 
by the constraint w and the other must equal 7/2, and if either reaches 7/2, then 
the other must equal 7/2 — w. In either case u + v + w = a and cos(u + v + w) is 
minimized. 

Using a Lagrange multiplier for the constraint cos?u + cos’v = 1 — cos*w, an 
interior extremum of u+vu+w must satisfy 1 — 2A cosusinu = 0 = 
1 — 2Acos vsinv, or sin2u = sin2v. With u, v € [0, 7/2], this implies u = v or 
u+v=q/2. If the latter, then w must be 7/2 and u+ v0 + w= 17, which again 
minimizes cos(u + v + w). Hence u =v for the only extremum that may have 
cos(u+ou+w)> —1. 

We now want the value of f(w) = cos(2u + w), where 2cos*u = 1 — cos?w. 
Since cos2u = 2cos?u — 1, we can write this in terms of z = cosw as f(w) = 
—z>—(1 — z7)yl1 + z?. Note that this value is negative, but that when 0 < z < 1, 
it exceeds —1. Hence when z > 0 this is the unique point where the maximum is 
attained. 

Having replaced cos(u + v + w) by f(w), we can substitute x = cosu, y= 
cos v, z = cos w. Now it suffices to prove that the function h = 3(xyz + g(z)) — 
2(x + y + z) 1S non-negative for each fixed z with z< y1/3 , where g(z) = 
—f(w) =z? + (1 — z”)y1 +z”, and we have the constraints x? + y* + z7=1 
and x, y,z > 0. 

Again we use Lagrange multipliers to minimize h over x, y for each fixed value 
of z. An internal extremum must satisfy 3yz — 2 — 2Ax = 0 = 3xz -2-—2)y. 
This yields x = y or 3z = —2A. In the latter case, substitution yields 3z(x + y) = 2. 
However, since x” + y” is fixed, x + y is maximized when x = y, and hence we 
have 3z(x + y) < 3z(2\(1 — z7)/2). This is strictly increasing when 0 <z 
< y1/2, and it reaches 2 at z = y1/3 . Hence when z < y1/3 this case lies in the 
domain only if x = y = z, and we may assume x = y for the internal extremum. 

When z is fixed, at the boundary points one of x or y is 0 and the other is 
¥1 —z?. At such points h reduces to h(z) = 3z? + 3(1 — z7)¥1 4+ 2? — 2z- 
2v1 — z?. 

Note that h(0) = 1 and h(y1/3 y= — 2y2 1/3 , which are positive. In fact, 
h(z) is strictly decreasing and hence always positive in this range. Differentiating 
and solving for roots (by squaring both sides) is rather messy; another way to show 
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positivity of h is via Taylor series expansion (for the square roots) and remainder 
analysis. 


This leaves only the internal extremum x = y = j(1 — z*)/2. Here h(z) = 32° 
+ 3(1 — 2?)(z/2 + V1 + 22) — 22 - 4/(1 — z?)/2. Now h(0) = 3-2/2 and 
h(y1/3 ) = 0. Again h is strictly decreasing for 0 < z < 1/3, which can be shown 
by similar means. 

Note that h = 0 requires x = y= z= y1/3,1¢e, a=b=c. 


Also solved by G. Turnwald (West Germany) and I. A. Sakmar (Turkey), and partially solved by P. 
Khajeh-Khalili. Three incorrect solutions were received. 


A Bernoulli Inequality 
E 3160 [1986, 565]. Proposed by L. I. Nicolaescu, University Al. I. Cuza, Iassy, 
Romania. 
Prove that 
(4n)!(4n — 2)! 
((2n)!}4 


where B, denotes the nth Bernoulli number. 


By, Ban—2 
B,, 


WV 
NS 


< , ON 


Solution I by A. A. Jagers, Technische Hogeschool Twente, Enschede, The 
Netherlands. Since |B,,| = 2(2n)!§(2n)/(27)*”,,where § denotes the Riemann zeta 
function {(2n) = L%_,k~ 2", the given inequality is equivalent to 7*§(4n)§(4n — 2) 
> {(2n)*. This inequality is valid for all n > 1, because {(4n — 2) > §(4n) > 1 and 
§(2n) < §(2) = 97/6 < vr. 


Solution IIT by Robert E. Shafer, Berkeley, CA. Let 
F(n) = 2f(4n)£(4n — 2)/&(2n)*. 


The desired inequality is equivalent to F(n) > 1 for integers n > 2. We prove more 
strongly that F(n + 1) > F(n) for n > 3/2 and F(2) = 360/49 = 7.3469... . The 
latter follows from £(4) = 74/90, §(6) = 7°/945, and §(8) = 7°/9450. Note that 
lim F(n) = 17 = 9.8696... . 


no 


To obtain the desired inequality for F note that for p > 2, n > 3/2 we have 
fi 1 1 1 | 1 1 1 1 1 1 1 
p?" ptt p" 2n+1 pent} p™” 2 


Using the expression for ¢ involving products and primes, we have 
§(4n — 2)/§(2n) = IT(1 — 1/p?")/(1 — 1/p*"~?) 
<IT1/(1 + 1/p"*") = TE(1 — 1/p*"**)/(1 — 1/p*"*?) 
= {(4n + 2)/§(2n + 1), . 
where the products run over all primes. Multiplying by [(4n), we have 
§(4n)§(4n — 2) — §(4n + 2)§(4n) 


¢(2n) < ~ <(Qn+1) for n = 3/2. 
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Applying this for n and n + 1/2 yields 
§(4n)§(4n — 2)/f(2n) < §(4n + 2)§(4n)/§(2n + 1) 
< §(4n + 4)f(4n + 2)/f(2n + 2). 


Finally, noting that ((2n) > ((2n + 2), we obtain the desired inequality F(n) = 
a¢(A4n)k(4n — 2)/(K(2n))* < 72&(4n + 4)k(4n + 2)/(K(2n + 2))* = F(n + 1). 


Editorial comment. The monotonicity of the function F defined by Shafer can be 
obtained more simply by logarithmic differentiation, since 


F(s) (2s) 84s) 84s = 2) 


F(s) > Ss) + (4s) * (4s —2) 


00 8 4 4 
= A = - aT > 0 
» (n) n2s 4s nts—2 


for any real s greater than 1. Here A(n) = log p if n is a power of the prime 
number p and A(n) = 0 otherwise. 
Several solvers noted that the problem is essentially solved by the inequality 


2(2n)! 2(2n)! 
an ~< |Banl < Din ina 
(277) (27)°"(1 — 2°-*") 
in M. Abramowitz and I. A. Stegun, Handbook of Mathematical Functions, National 


Bureau of Standards (1964), p. 805 eq. 23.1.15. This yields the desired inequality if 
a?(1 — 2'~2")4 > 1, which is true for n > 2. 


Also solved by N. Franceschine, C. Georghiou (Greece), R.-F. Gloden (Italy), C. Hill, O. P. Lossers 
(The Netherlands), C. Vandermee (The Netherlands), and the proposer. 


Inverse of a Combinatorial Matrix 


E 3161 [1986, 565]. Proposed by Ira Gessel, Brandeis University, Waltham, MA. 


Let A = (4;,); ;59 be the infinite lower triangular matrix defined by a,, 


~ (‘)(,Z,}. Let A~* = (b;;);, > 0- Show that 


» bia! = f((1 + y)x)/f(x), 
j= 


I 


0 


where 


~ x” e e 
f(x)= > —5 = 1h (2Vx ). (dy is one of the modified Bessel functions. ) 
Jao nt! 


Solution by David Callan, University of Bridgeport. Multiply both sides of the 
desired equality by f(x). To prove that the coefficients of x”y/ on both sides are the 


n 


same, we must show L7_, ") b;; = ("). We can write this in matrices as DB = C, 

where B = A7', C= (")In, joo and D = ((") In. iso Multiplying on both sides 
2 

by A, the required equality holds if and only if D = CA, ie, (") = r"_o(")a ji 
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Evaluating the sum using the Vandermonde convolution mr ) (: ) = ("3 , *), we have 


CMU) EGG) 


l = 


TMs 


as desired. 


Editorial comment. This problem was published with an unfortunate misprint, so 
that the matrix element a;, was given as ()( i instead of (s)(,2 ). For this 
i)\es I\i-s 
misprinted problem, 


» bay! = fx)/f(2), 
ay li! 


0 


i, 


as shown by J. M. Cohen, J. B. M. Melissen (The Netherlands), W. A. Newcomb, 
and G. S. Sylvester. 


The intended problem was also solved by The Chico Problem Solving Group and the proposer. 


A Series of Distances 


E 3179 [1986, 812]. Proposed by N. J. Lord, Tonbridge School, England. 


Given a real number x, let f(x) denote the distance from x to the nearest 
rational number with denominator n (not necessarily in its lowest form). For which 
values of x does the series U°_, f(x) converge? 


Solution by L. E. Mattics, University of South Alabama, Mobile, AL. If x is an 
integer the series converges to zero; we shall show that it diverges otherwise. 

If x is irrational, then the numbers nx — |nx| are uniformly distributed in the 
interval (0,1). (Cf. Hardy and Wright, Theory of Numbers, Th. 445.) Consequently, 
if A is the set of positive integers n such that 1/4 < nx — |nx| < 1/2 and A(m) is 
the number of integers n in A with n < m, then lim, _,, A(m)/m = 1/4. Hence 
the series bi, . ,1/(4n) diverges. But if n € A, then f(x) = x — [nx]/n > 1/(4n). 
Thus Uf,(x) diverges. 

Finally, suppose x = r/s, where r and s are relatively prime integers with s > 1, 
and let p be a prime not dividing s. Then f(x) > 1/(ps). Since X, primel /P 
diverges, Uf, (x) also diverges. 


Also solved by K. F. Andersen (Canada), E. H. Grossman, N. Felsinger, M. Gutkowski, O. P. Lossers 
(The Netherlands), W. A. Newcomb, and the proposer. 


A Triangular Inequality 
E 3180 [1986, 812]. Proposed by Murray S. Klamkin, University of Alberta, 
Canada. 
If A, B,C are angles of a triangle, prove that 


A B C A  &B —C 
cos + cos > + cos >1+ sin + sin + sin. 
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Solution by O. P. Lossers, Eindhoven University of Technology, Eindhoven, The 
Netherlands. We consider the function V defined by 


VA, B.C A B C [A B C 
, B,C) = cos— + cos— + cos— — |sin— + sin— + sin—]. 
) = cos 5 + Coss + cos5 sin> + sin> + sin> 


on the compact set S defined by A>0, B>0, C>0, and A+Bt+C=a.V 
attains an absolute maximum and minimum; we will show that the minimum is 1 
and the maximum is 3(/3 — 1)/2. 

Suppose C = 0, so that A + B = a. The identity cos(7/2 — x) = sin x implies 
V(A, B,0) = 1. Similarly V(O, B, C) = V(A,0, C) = 1, so V is identically 1 on the 
boundary of S, and we need only consider interior points. 

For fixed C # 0, we have 


A B A  &B C C 
V(A, B,C) = [cos + cos — [sin + sin | + cos = — sin | 
5 A-B A+B A+B C —C 
= ZCOS 4 (os — Sin 4 | + cos — sin 
A-B TrT—-C -a-C C —C 
= 2cos (cos 4 — sin 4 | + cos 5 sin 7 


Since |x| < 7/4 implies cos x > sin x, the last expression is minimized only when 
A = 0 or B = 0, and it is maximized when A = B. This shows that V is in fact 
minimized on the boundary A - B-: C = 0. As for the maximum, by symmetry V is 
also maximized for a fixed A # 0 when B = C. Hence V attains its maximum only 
at A = B= C = 7/3, where V = 3/2(¥3 — 1). Thus we have 1 < V(A, B,C) < 
3/2(¥3 — 1), with lower equality only at a degenerate triangle and upper equality 
only at an equilateral triangle. 


Editorial Comment. Several readers used Lagrange multipliers to find the ex- 
trema of V. Lagrange’s method seems particularly suitable for studying the extrema 
of such functions of the angles of a triangle. For example, it shows that the cosine 
sum and the sine sum separately satisfy the inequalities 


; A B C 3 
< — +4 — — < -y3, 
cos COs + cos <5 3 


e e B . C 3 
1< sin + sin + sin < >” 
with lower equality only for a degenerate triangle and upper equality only for an 
equilateral triangle. H. Guggenheimer remarks that the difference between the 
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maximum and minimum values of V is surprisingly small when compared to the 
corresponding differences in the inequalities just quoted. 


Solved also by the proposer and 32 others. 


A Convergent Sequence 
E 3197 [1987, 300]. Proposed by L. A. Rubel, University of Illinois, Urbana, IL. 


Suppose dj, a5, a3,... iS a Sequence of real numbers satisfying 0 < a, < 1 and 
O<a,<1 for n=2,3,.... Put S,=a,+a,+-:- +a, and 7,=S,+ S,+ 
- +S). Must U*%_,a,/T,, converge? 


Solution by L. E. Mattics, University of South Alabama. We show more generally 
that La,/T," converges as long as a > 1/2. If La, converges, then the problem is 
trivial, since 1/7, < 1/a, for all n, so henceforth we assume Ya, diverges. 

Since 0 < a, <1 and Lia, diverges, for each integer m > 0 there is a unique 
positive integer n(m) such that S,/,,) > m > Sicmy—1- The sequence n(m) is strictly 
increasing, so 


T 


n(m) 


> S 


n 


=S,+---4+5S 


n(m) 2 


ay + -+8§ 


n(m) 


> m(m + 1)/2. 


The sequence T, is also strictly increasing, so 


(S n(m+1)-1 — Sicm)—1) 
ya@-y y 4 <d nn ne 
n=n(2) T, m=2 k=n(m) 7 < m= Tm) 


Because the parenthesized quantity cannot exceed 2, this sum is bounded by 
2% _.[(m? + m)/2]~%, which converges for a > 1/2. 


Editorial Comment. Grahame Bennett remarked that setting a, = 1 for all n 
proves that the result above is best possible, by providing a divergent example when 
a< 5. 


Also solved by G. Bennett, E. Hertz, W. B. Johnson, O. P. Lossers (The Netherlands), A. Riese and 
G. Fricke, and the proposer. 


ADVANCED PROBLEMS 


6579. Proposed by D. K. Cohoon, Temple University, Philadelphia, PA. 


Suppose we define a curl operation on the infinitely differentiable functions from 
R' to C’ by the rule 


curl( 7’) = x {[ Ofiss — Oia | n | Ofise | A | Ofins vt) 


OX 4 OXi43 OXi42 OX 46 OXi44 OXi45 
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where e; is the unit vector in the positive direction on the ith coordinate axis, 
f=f (x) = fie, + he, + es + he, + fees + es + fe, isa C® function from R’ 
to C’, and, for all positive integers j in {1,2,3,4,5,6,7}, x,,,7= x, and f,,,= fy. 
Show that every infinitely differentiable function from R’ to C’ is a curl plus a 
gradient. 


6580. Proposed by P. A. Batnik, University of Illinois at Urbana-Champaign. 


Prove that if n is an odd positive integer, there exist integers x,, x5, x3, x, such 
that 


n=x?4+x24x24 x3, x, > [Vn] -1. 


6581. Proposed by Donald E. Knuth and Ilan Vardi, Stanford University. 
Stirling’s approximation leads to the following expansion 
(2") = 4 b+ ot a ee +o[-}. 
n Van 8n 128n? 1024n? 32768n4 n> 
Prove that if this expansion is continued indefinitely, the coefficient of n~* will be 


2'~“km,, where m, is an integer. What is the exact power of 2 that divides the 
denominator of this coefficient? 


6582. Proposed by N. J. Fine, Deerfield Beach, FL. 
Prove that 


SOLUTIONS OF ADVANCED PROBLEMS 
Weak Contractions 


6532 [1986, 822]. Proposed by Victor Pambuccian, Bucharest, Romania. 


Prove or disprove the following “converse” of Edelstein’s theorem (see this 
MONTHLY, Problem S 8 [1979, 222; 1980, 487]): 

Let X bea set of power less than or equal to 2%°, and f: X — X a map such that 
each f” (n= 1,2,...) has a unique fixed point. Here f” means fofo-:- of 
(n times). Then there exists a metric d on X such that (X, d) is a compact metric 
Space and 

d( f(x), f(y)) <d(x,y) Wx,y EX, x#y. 


Note that a corresponding “converse” holds for Banach’s fixed point theorem 
(“contraction principle”) (see C. Bessaga, Collog. Math., 7 (1959) 41-43). 


Solution by the proposer. The “converse” is false. Let X = N, the set of positive 
integers, and define an integer valued function f by 


1 =] 
f(my= {Fy m>?. 
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Each iterate f”, n = 1,2, ..., has x = 1 as its unique fixed point. Any metric d on 
X that satisfies 


d( f(x), f(y)) < d(x, y), forall x # y, 
must clearly satisfy 


d(1,k)<d(2,k+1)<d(3,k+2)< ---, allk>2, 
and also 
d(1,2) < d(1,3) < d(1,4) < ---. 
Therefore, 
di,k)<d(1+i,k+i), alk >2,i>1 
and 


d(1,2)<d(1,k), allk>3. 
This implies that 
d(1,2) < d(i, j) 
for all i #7 such that {i, 7} # {1,2}. Let r = d(1,2). Then 
B(x) = {y eX: d(x, y)<r}={x} forallx eX. 


In other words, the “open ball” B,(x) must consist solely of its center point x for 
any such metric d. Hence (X,d) is a discrete topological space and so is not 
compact. 


Editorial Comment. Edelstein’s theorem asserts that if (X, d) is a compact metric 
space and f: X — X is a mapping such that 


d( f(x), f(y)) < d(x, y), allx,yeEX, x#y, 


then f has a unique fixed point. (Cf. M. Edelstein, J. London Math Soc., 37 (1962) 
74-79 or Victor Bryant, Metric Spaces, Cambridge, 1985, p. 69.) Since a compact 
metric space is a continuous image of the Cantor set, a converse would presumably 
involve a set of cardinality less than or equal to that of the continuum. Now 
C. Bessaga (Collog. Math. 7 (1959), 41-43) has proved a converse of the Banach 
fixed point theorem for complete metric spaces whose formulation bears the same 
relation to Banach’s theorem that the assertion of the problem bears to Edelstein’s. 
From this perspective the counterexample seems a bit surprising. Thomas Jager 
points out that we can obtain a “converse” for X finite. In this case there is an n 
such that 


XD f(X)D ++) DF" XK) = {p} 
where p is the fixed point. For x in X and x # p, let k(x) be that integer for which 
x E fk(X) — for yx), 
and set k( p) = n. Then the metric d defined by 
d(x, y) = 2n— k(x) — k(y) 


for x # y and d(x, x) = 0 makes X into a compact metric space on which f is a 
contraction. 
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Thomas Jager also constructed a countable counterexample. Ludvik Janos pro- 
vided an uncountable but quite simple counterexample with X the set of all real 
numbers and f(x) =x/2. The idea is to let x* maximize d(0, x), and then 
establish the contradiction 


d(0,2x*) < d(0, x*) = d(0, f(2x*)) < d(0,2x*). 
Borel Images 


6534 [1987, 81]. Proposed by F. S. Cater, Portland State University, Portland, 
Oregon. 


Let f and g be real-valued functions on the real line, R, such that if B is any 
Borel subset of R, then f(B) and g(B) are also Borel sets. For each number y, let 
f—'(y) be at most a finite set. 

(a) Prove that there exist Borel sets A,, A,, B,, B, such that 


A,UVA,= B,UB,=R, A, NA, = BLN BL, = @, 
f(A) =B, = g( By) = Ad. 
(b)* Is the hypothesis on f~! essential? 
Solution of (a) by the proposer. The problem requires 
f(R\ g(S)=RNS, 
where S = B,. Our strategy is to first create a sequence of sets S, for which 
f(R \ g(S,)) =RN Snti 
To do this, set S, = R \ f(R) and 
Sap =RNf(RN2(S,)), on 
Then take B, = S where 


WV 
o 


S= US... 
n=0 


Now we provide the details. Observe that Sp is a Borel set, and an easy induction on 
n shows that S,, is a Borel set for all n. Thus S is a Borel set and so is g(S). 
Now 


(R\ S,) N Sy = f(RN 8(Sp)) N(R Nf(R)) = 
so Sy C S|. It is clear that 
1=R\f(R \ g(Sp)) CRN f(RN g(S;,)) = S, 
and by induction we see that 
ni R \ f(R \ g(S,_1)) CR \ f(R \ g(S,)) = S41 
for alln > 1. Thus S,C S,C --- CS,C ---. But S =U,S,. So 
f(R \ g(S)) =f(R \ U,8(S,)) = fIN(R S 8(S,))I. (1) 
But for any y, f-1(y) is at most a finite set. It follows from this and g(S,) € g(S,.1) 
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that 
FIN \ 8(S,))] =A, (RS 8(S,)). (2) 
From (1) and (2) we obtain 
f(RN g(S)) =, f(R S 8(S,)). (3) 


But R\ S,,, =f(R \ g(S,)) and it follows that 
A, (RN 8(S,)) = OCR SN Sit) = R\U Spar = RNS, 
and 
N, (RN 8(S,)) =RNS. (4) 
From (3) and (4) we obtain 
f(RN g(S))=RNS. (5) 


Put A, = R \ g(S), A, = g(S), B, = R \ S, B, = S. The conclusion, then, fol- 
lows from (5). 
No solutions to (b) were received. The editor thanks Michael J. Dixon for some helpful comments. 


Absolute Integrability 


6535 [1987, 81]. Proposed by Brockway McMillan, Sedgwick, Maine. 


Let D=[0,00) X[0,1) and f= f(x, y) be a real-valued function absolutely 
integrable on bounded measurable subsets of D. Assume that 


lim IIL y) dxdy = 0 


k—>0o 


for every pair of ascending sequences A, CA, C -::, B, C B,C --: of bounded 
measurable sets such that 


UA, = [0,00) and UB, = [0,1). 


Must f be absolutely integrable over D? 


Solution by M. J. Pelling, University College, London, England. No. For n = 
1,2,... define 


. 2k 2k +1 
1/n ifn—-1l<x<nand > <y< an? 
_ _ O0<k <2” '-1, 
f= f(x, y) | 2k +1 2k +2 
—1/n ifn-1<x<nand —{— <y< an? 


0<k <2" !-1. 


Then 


[J \fideay = 0, 
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yet 
ff fdxdy>0 ask > oo. 
A,X B, 


The first statement is immediate, since the double integral of the modulus over D is 


[o @) [o @) 1 
dx dy = —, 
d IDs meioal” ” au n 


n=l 


For the second, we write each y in [0, 1) as a binary “decimal” y = 0.c,c,c; --: 
(use the terminating expansion in the ambiguous cases) and set 


1 c,=90 


env) = {4 


Then f(x, y) = «,,(y)/m for m—-1<x<™m and 


Cn = 1. 


n " En(y) 
= x, y)dx = ; 
g,(y) i f(x, y) XL - 
By orthogonality 
1 1 1 


J (Bru) = 80(9)) y- Cap msrp ve +O <2 


so by the Schwarz inequality 


[18n+09) — g,(y)|dy < s 


= 
Thus { g,} is a Cauchy sequence in the L, norm, and by completeness we obtain the 
existence of a real-valued L, function g on [0,1] such that 


1 
Jan») — g(y)|dy > 0 asn— o. 


We are now ready to study the restricted double integrals. For any bounded 


(measurable) set A,, 
1 
dx dy = dy | dx = 0, 
ID coat ” 7 y 


since the inner integral is always zero. Hence 


Dealt > Waal 


where B; is the complement in [0, 1] of B,. Therefore it suffices to show that for any 
e > 0 there is a 6 > O such that for all bounded A C [ 0, 00) and all S C [0,1] with 
m(S) < 6 we have 


< €. 


f f _ fae 


Assume that A and S are sets as above. Let 
I.,=m(AN[n-1,n]). 
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1/2 


< [ Ema) | r [J eat) | 


m=1 


If we let x, be the characteristic function of the set S, then the square of the second 
factor above is 


L (fen ox500) ‘y\ 


and, by Bessel’s inequality, this is at most 


[xs(v) & = m(S). 


Thus, whenever m(S) is small, so is 


/ i] fdx dy. 
AXS 
This completes the proof. 


Editorial Comment. Pelling’s solution is, of course, an application of Rademacher 
functions. For other nice things that can be done with them, see M. Kac, Statistical 
Independence in Probability, Analysis, and Number Theory, Math. Assn. of Amer. 
(Carus Monograph Number Twelve), 1959. The proposer’s original statement of the 
problem involved n-fold integrals, and he expected the problem to be answered with 
a counterexample. As for its origin, he informs us that it “is abstracted from work 
on statistical mechanics, where one often meets infinite series of which the nth term 
is [related to the sort of n-fold] multiple integral appearing in the statement of the 
problem.” 


Automorphisms of the Symmetric Group on Six Symbols 


6538 [1987, 195]. Proposed by Joseph Rotman, University of Illinois, Champaign- 
Urbana. 
Let ¢ € Aut(S,) have the following values on transpositions: 


(12) = (12)(36)(45) 

(13) = (16)(24)(35) 

(14) = (13)(25)(46) 

(15) = (15)(26)(34) 
Find $(16) explicitly. 
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Combined solution. Since is an automorphism, it must preserve conjugacy 
classes, and so $(16) is a product of three disjoint transpositions. If ¢(16) shares a 
transposition with (1) for some i with 2 <i <5, then it is easy to check that 
(16) commutes with ¢(1i). This is a contradiction since (16) and (17) do not 
commute. Hence (16) = (14)(23)(56). 


Editorial remark. In G. J. Janusz and J. J. Rotman, Outer Automorphisms of S,, 
this MONTHLY, 89 (1982) 407-410, the authors constructed an outer automorphism 
@ and computed the values ¢(1i) for 2 <i <5 (as in this problem) with no 
difficulty. However, in light of the solution to this problem, their computation of 
(16) is overcomplicated. 


Also solved by F. Rudolf Beyl, S. H. Cullinane, John Dalbec (student) and S. F. Barger (jointly), 
Jesis Ferrer (Spain), Steven M. Gagola, Jr., the late I. N. Herstein, A. A. Jagers (The Netherlands), Keith 
A. Kearnes, O. P. Lossers (The Netherlands), David E. Manes, Ondrej MatouS (Czechoslovakia), John 
Henry Steelman, Joseph Verret, William P. Wardlaw, and the proposer. 
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A History of Algebra. By B. L. van der Waerden, Springer-Verlag, 1985. iii +271 pp. 


DAvID J. WINTER 
Department of Mathematics, University of Michigan, Ann Arbor, MI 48107 


In A History of Algebra, Bartel Leenert van der Waerden covers the period from 
Muhammad ben Musa al-Khwarizmi to Emmy Noether, that is, the period from 
about the beginning of the 9th century to the first third or so of the 20th century. 

A History of Algebra is written by a mathematician for mathematicians. In it, van 
der Waerden emphasizes the development of the mathematics and the interrelation- 
ships among ideas coming from different mathematicians. He discusses what was or 
may have been the motivation for research in a particular direction, as well as other 
factors and influences. He shows what mathematics was discovered and what the 
work really was about by describing the mathematics and showing how it was done 
—sometimes giving the crucial ideas, sometimes sketching a proof or outlining a 
manuscript, sometimes including proofs themselves. 

The style is at the same time informal and precise. Although substantial mathe- 
matical prerequisites constrain the potential readership, A History of Algebra should 
be of interest to a fairly broad spectrum of readers from the mathematical commun- 
ity. 

For the working algebraist or the student of algebra, A History of Algebra should 
be particularly useful. Van der Waerden not only places very important parts of 
algebra in historical perspective, but also gives an excellent expository discussion of 
some of the mathematics itself. He covers many important topics at length and, 
throughout the book, gives sources for material covered elsewhere. Furthermore, he 
treats much material which algebra students normally do not see in their graduate 
training, material which is important in understanding how and why the mathe- 
matics was created but is not needed in the modern logical development of it. At the 
same time, such material can provide deep insight into the modern theory and thus 
both render the theory easier to understand and clarify its significance and potential 
future. 

One notable asset of A History of Algebra is the fact that the subject matter is not 
at all confined to algebra. The author is concerned not only with the algebra, but 
also with the mathematicians who discovered it and with their other related work, 
with the surrounding circumstances and with the relations among their various 
discoveries. It was therefore practical and necessary to include a wide variety of 
interesting topics from geometry, topology, analysis, physics, astronomy and other 
subjects, which are well integrated into the discussion. 

Van der Waerden’s choice of material, and his balance between the emphasis on 
history and emphasis on the mathematics, is reasonable. At the same time, algebra is 
a field about which much more can be said than can be put in any one book. 
Accordingly, there is a need for other books covering other important topics and 
other aspects of its history. 


781 
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A History of Algebra comprises three parts, divided according to subject. Part I: 
Algebraic Equations occupies half of the book. Part II: Group Theory and Part III: 
Algebras share the second half. Of course, this division is ignored when, for 
instance, groups are used to study equations. So, considerable attention is given in 
Part I to the group theory of Cauchy, Galois, and Jordan, that is, to group theory as 
it was before Kronecker, Cayley, and others developed the abstract notion group. 
For Cauchy, Galois, and Jordan, a finite group was a finite set of substitutions 
(permutations) closed under multiplication, and group theory was a powerful tool 
for studying equations and, for Jordan, transcendental functions. 

In an attempt to describe what is done in A History of Algebra, we look briefly at 
each of the three parts and sample some of the material. 

In Part I, van der Waerden traces the development of the theory of algebraic 
equations from Muhammad ben Musa al-Khwarizmi, in the first half of the 9th 
century, to Evariste Galois (1811-1832) and Camille Jordan (1838-1922). 

Muhammad ben Musa al-Khwarizmi worked with equations such as “A square, 
which is equal to forty things minus four squares,” which we represent in present 
day notation by x? = 40x — 4x’, and with algebraic operations such as al-jabr or 
al-mugabala (adding or subtracting equals to or from equals). According to the 
biographer Haji Khalfa, al-Khwarizmi was the first Islamic author to write on 
solving equations by performing such algebraic operations. 

Al-Khwarizmi was a member of the House of Wisdom in Bagdad, under the 
Caliph al-Mamum, who reigned in the period A.D. 813 to A.D. 833. His treatise 
Algebra, dedicated to the Caliph, energetically ignored Euclid’s Elements, even 
though it had been translated into Arabic by his colleague al-Hajjaj ibn Yusuf ibn 
Mater. Solomon Gandz, the editor of the Mishnat ha-Middot, an ancient Hebrew 
treatise on mensuration, perhaps written by Rabbi Nehemiah about A.D. 150 and 
being, perhaps, the primary source used by al-Khwarizmi, writes: 


On the contrary, in the preface to his Algebra al-Khwarizmi distinctly 
emphasizes his purpose of writing a popular treatise that, in contradiction to 
Greek theoretical mathematics, will serve the practical ends and needs of the 
people in their affairs of inheritance and legacies, in their lawsuits, in trade and 
commerce, in the surveying of lands and in the digging of canals. Hence, 
al-Khwarizmi appears to us not as a pupil of the Greeks but, quite to the 
contrary, as the antagonist of al-Hajjaj and the Greek school, as the 
representative of the native popular sciences. 


Al-Khwarizmi explains how all linear and quadratic equations can be reduced to 
one of the six forms 


ax* = bx, ax” = b, ax = b, ax* + bx =, 
ax* + c= bx, ax* = bx + ¢. 


He then gives rules for solving each kind, and illustrates them by worked examples. 
Van der Waerden then discusses the writings of al-Khwarizmi on computing 
areas, on the Jewish calendar, on legacies, on geography, on Hindu numerals and on 
astronomy. Finally, he considers the question of what sources may have been used 
by al-Khwarizmi. 
Van der Waerden discusses two other early Muslim mathematicians. One of 
these, Omar Khayyam, best known for his 600 or so poems in the Rubaiyat, gave 
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systematic methods for solving equations. For linear and quadratic equations, he 
followed the geometrical methods explained in Euclid’s Elements. He then showed 
how cubic equations can be solved by means of intersections of conics. Van der 
Waerden develops some of the surrounding circumstances and discusses in detail 
the methods for solving cubic equations. 

The author then goes on to discuss important work of some early Italian 
mathematicians. He starts with the many and varied contributions from 1200 to 
1225 of Leonardo da Pisa (also known as filio Bonnacci or Fibonacci), who was the 
son of Bonnacci, a secretary of the republic of Pisa. Later on, he gives the 
interesting history of the discovery of algebraic methods for solving cubic bi- 
quadratic equations discovered by Scipione del Ferro, Niccolo Tartaglia, Gerolamo 
Cardano, Lodovico Ferrari, and Rafael Bombelli. 

With improvements in notation by Francois Vieté (1540-1603) over the cumber- 
some notation used by early mathematicians, a decimal system created in 1585 by 
Simon Stevin, the restoration and improvement of the coordinate methods and 
theory of conic sections of Apollonius by Pierre de Fermat (1601-1665) and the 
coordinate systems of René Descartes (1596-1650), the stage was set for the more 
sophisticated algebra and theory of equations that was soon to follow. 

Van der Waerden traces these developments giving interesting details, such as 
Descartes’ solution, using his coordinate systems, of the problem of Apollonius of 
determining the “locus of three or four lines.” 

After devoting a chapter to Edward Waring, Alexandre-Theophile Vandermonde, 
Joseph Louis Lagrange, Gianfransesco Malfatti, Paolo Ruffini, Augustin Louis 
Cauchy, and Niels Henrik Abel, he discusses the work of Carl Friedrich Gauss on 
the cyclotomic equation x”-—1= 0 and outlines three of his proofs of the 
“Fundamental Theorem of Algebra.” 

The author concludes Part I with two nice chapters on Evariste Galois and 
Camille Jordan. 

Abel had already shown that the equation of the fifth degree cannot be solved by 
radicals. Galois, during the year before his tragic death in a duel on May 30, 1832 at 
the age of 20, worked out his famous correspondence between groups and fields and 
used it to establish criteria for the solvability of an equation by radicals in terms of 
groups. The night before the duel, he explained the fundamental ideas in a letter to 
Auguste Chevalier, together with a comment and a request: 


Je me suis souvent hasardé dans ma vie a avancer des propositions dont je n’ étais 
pas sir; mais tous ce que j’ai écrit la est depuis bientot un an dans ma téte, et il 
est trop de mon intéret de ne pas me tromper pour qu’on me soupconne d’ énoncer 
des théorémes dont je n’aurais pas la démonstration complete. 

Tu prieras publiquement Jacobi et Gauss de donner leur avis, non sur la veérité, 
mais sur l’importance des théoremes. 


Galois also introduced the important concept of primitive groups of substitutions 
and showed that solvable primitive groups are p groups. His discussion for those of 
order p* ends with the words “C’est ce que je vais rechercher.” 

It was Jordan (1838-1922) who later carried out this research, in part of his 667 
page Traité des substitutions et des équations algébriques. Among his many important 
contributions, Jordan completed and refined Galois’ theory of equations and 
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applied it to the study of transcendental functions. He also extended Galois’ theory 
of primitive substitution groups and classified the solvable ones. Using this classifi- 
cation, he went on to construct all solvable transitive groups of substitutions on d 
letters for any d. This he did by finding maximal solvable irreducible subgroups of 
certain linear groups, using recursive methods. 

In Part II, Van der Waerden concentrates on the theory of groups, noting that 


After the appearance of Jordan’s Traité in 1870, a fundamental change of 
character of the theory of groups took place. Before 1870, only two kinds of 
groups were considered, namely groups of substitutions (or permutations) and 
groups of geometrical transformations. After 1870, the abstract notion group 
was developed in several steps notably by Kronecker (1870), Cayley (1878), von 
Dyck (1882) and Weber (1882). 


He divides his discussion into four parts: Groups of Substitutions, Groups of 
Transformations, Abstract Groups, The Structure of Finite Groups. 

After briefly discussing early theorems on the symmetric group, two papers by E. 
Mathieu on multiply transitive groups of substitutions and the structure theorems of 
the Norwegian mathematician M. L. Sylow, van der Waerden discusses groups of 
transformations. Whereas the work of Euler, Rodrigues and Jordan on closed 
groups of rigid motions of 3-space was discussed in Part I, the author now takes up 
the work of Arthur Cayley and Felix Klein on non-Euclidean geometry, Klein’s 
classification of groups of fractional linear transformations and Sophus Lie’s work 
on continuous groups. 

Cayley develops a theory of distances having special cases leading to the 
Euclidean plane and to the elliptic plane. Klein then builds a model of the hyperbolic 
geometry of Lobatchewsky and Bolyai. In his second paper on this work, in 1873, 
Klein introduces the group generated by Euclidean displacements, similarity trans- 
formations and reflections. His fundamental idea is that each method of geometry is 
characterized by a transformation group. 

Much of the remainder of the theory of finite groups of the nineteenth century 
was due to Otto Hélder. Whereas Jordan knew that the indices in a composition 
series for a finite group are unique up to their order, Hélder defined the concept 
factor group and showed that the composition factors of a composition series are 
unique up to their order. 

One of the problems considered by Holder was concerned with determining the 
structure of a group G from that of a normal subgroup H and the quotient group 
G/H. Later on, methods for solving this problem were developed by Otto Schreier. 

In the last chapter of Part II, van der Waerden discusses the work of Sophus Lie, 
Friedrich Engel, Wilhelm Killing, and Elie Cartan culminating‘in the rich structure 
theory and classification theory of Lie groups and Lie algebras. 

In Part III, van der Waerden discusses the discovery of algebras, beginning with 
the complex numbers. Complex numbers were, perhaps, first used by Cardano, who 
experienced “mental torture” in using them to express a number such as 40 as a 
product of numbers such as 5 + y—15 and 5 — y—15. Twenty-seven years after 
the publication of Cardano’s Ars Magna (1545), they were used by Rafael Bombelli, 
who gave rules for calculating with them. 
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Van der Waerden then discusses Hamilton’s discovery of the quaternions (and 
his long and fruitless search earlier for triplets) and the discovery by John Graves in 
1843 of the octonions (and their rediscovery by Arthur Cayley in 1845). In 1853, 
Hamilton introduced the biquaternions 


at+bi -ct+di 
c+ di a— bil 


Twenty years later, William Kingdom Clifford introduced two more kinds of 
biquaternions, one of which he (and, later, Eduard Study) used to study the rigid 
motions of Euclidean space. 

In 1854, Cayley introduced the notion of group algebra. In 1862, Hermann 
Grassman introduced the Grassmann algebra. In 1878, Clifford introduced a 
variation of it, the first Clifford algebra, of 2” dimensions, which is the quaternion 
algebra if n is 2. He also introduced an important subalgebra of it, the second 
Clifford algebra, which R. Lifschitz used six years later to represent rotations in n 
dimensions. 

Van der Waerden next discusses the development of the structure theory of 
algebras, starting with the work of Benjamin Peirce, who discovered left, right and 
two-sided decompositions which are important for the later theory of Wedderburn. 

Later on, he discusses Theodor Molien (1861-1941), who showed that every 
finite dimensional simple algebra over C is a complete matrix algebra. Cartan also 
proved this theorem and showed furthermore that every finite dimensional algebra 
over C is a direct sum of a simple or semisimple subalgebra and a nilpotent ideal. 

Van der Waerden then briefly discusses the general theory of J. H. Maclagan 
Wedderburn, which generalizes the above results of Cartan on finite dimensional 
algebras over C to finite dimensional algebras over any field. He then briefly 
discusses work by Emil Artin and work by Emmy Noether. 

He then discusses the structure theory of Nathan Jacobson for rings without 
finiteness assumptions. 

Since the finite dimensional simple algebras are full matrix algebras over finite 
dimensional division algebras, to find them amounts to finding all finite dimen- 
sional division algebras D over fields F. Van der Waerden discusses how this is 
done for the following fields F: 

the field R 

finite fields 

p-adic fields 

algebraic number fields. 

If F is the real field, D is R or C or the quaternions, as proved by George 
Frobenius in 1878. If F is a finite field, then D is F, as proved by Wedderburn in 
1909. For F a p-adic field, the structure of D was determined by Helmut Hasse in 
work completed in 1933. Using this and a local-global theorem for full matrix rings, 
Hasse, Brauer and Noether determine the structure of normal finite dimensional 
division algebras over algebraic number fields. 

The remaining three chapters of the book are devoted to Group Characters, 
Representations of Finite Groups and Algebras and Representations of Lie Groups 
and Lie Algebras. 
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A First Course in Coding Theory. By Raymond Hill. Clarendon Press, Oxford, 1986, 
xii + 251 pp. 


RICHARD HAMMING 
Naval Postgraduate School, Monterey, CA 93940 


It is often said (and the author of the present book says it once again) that coding 
theory arose from information theory. On the other hand, the person generally 
accepted as the founder of information theory is Claude Shannon, and Shannon 
points out in his seminal first paper on the subject that his initial results in coding 
theory were found by a method due to the reviewer, who thought he was working in 
coding theory! There were, in fact, two people working in the same place (Bell 
Telephone Laboratories), at about the same time, even sharing an office for awhile, 
who were developing two separate theories. (This of course raises the question of 
whether it is the situation that brings forth the creators or the creators that make the 
situation. One is reminded of Pasteur’s remark that “luck favors the prepared 
mind,” which admits both that there is an element of luck and that it is the 
individual who finally accomplishes the task.) 

What was the motivation? Information theory arose from the desire to under- 
stand what the telephone system was doing in a fundamental sense. Coding theory, 
on the other hand, arose from the perceived need for reliable computing from 
unreliable parts, plus the need to signal through high levels of noise. Coding theory, 
therefore, arose in response to an important practical need, and the subject is still 
partially driven by practical considerations. This does not, however, explain the 
continued separation of the two subjects. That separation has persisted largely 
because the mathematics in the two fields seems to be somewhat different. Coding 
theory uses a great deal of finite algebra and combinatorics: Information theory 
tends to have much more classical analysis with such things as entropy functions 
and probability distributions. There are, of course, a few people who work in both 
fields, but the separation is quite noticeable. 

While coding theory arose in response to practical needs, the typical treatment, as 
in this book, presents coding theory as another branch of mathematics. Usually, 
only slight attention is given to practical applications (other than claims of useful- 
ness). Engineering realities are seldom discussed, and this book is no exception in 
this regard. 

Considered as a branch of mathematics, just how hard is coding theory? Many 
authors of books on coding theory state that for their book one needs only simple 
high school algebra; the reader does not even need calculus. This is true in a formal 
sense, but it requires a mathematical sophistication that is not typically achieved 
before the calculus course. Hill writes, ““The aim of this book is to provide an 
elementary treatment of the theory of error-correcting codes, assuming no more 
than high school mathematics and the ability to carry out matrix arithmetic... The 
first eight chapters comprise an introductory course which I have taught as part of 
second year undergraduate courses in discrete mathematics and in algebra. [This is 
less than half the book.] I have also used the whole as a master’s course taken by 
students whose first degree is not necessarily in mathematics.” 

In the book, the author occasionally refers to the classic book by MacWilliams 
and Sloane (The Theory of Error-Correcting Codes, North-Holland, 1977) for some 
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of the harder proofs. Even so, his book is not easy reading for the person who is not 
mathematically sophisticated. It is well known that the Elementary Problems in the 
Monthly are often much harder than the Advanced Problems: combinatorial 
arguments are often very much harder than even involved analytical proofs. That 
the teacher, having mastered the field of finite algebra and the corresponding ways 
of thinking, finds the material easy does not mean that the student, meeting it for 
the first time, finds it easy. I can only wonder at the difficulty of covering the first 
one hundred pages of the book in one term. While one may cover hundreds of pages 
in the typical calculus text during the first term, such texts have voluminous sections 
of worked examples and carefully honed textual material. The job here cannot be 
easy! The author does give numerous examples to illustrate the material, and he 
does proceed carefully. He also gives background material such as the necessary 
parts of the theory of finite fields. Yet, with all this the subject material is not 
simple to grasp for the first time. 

Given that the author views coding theory as a branch of mathematics, the book 
is excellent. Given that the field has many practical applications and is driven by 
them, there are many acknowledged omissions. (Huffman coding, for example, is 
only mentioned; the book is exclusively on block-coding.) 

It is hard, however, to write a book covering coding theory that is easy to read 
and at the same time covers the more important codes; the field is too vast and 
difficult. From the mere note in Shannon’s first paper, Golay understood a lot and 
wrote a classic one page paper that began the algebraic part of coding theory. It was 
a very difficult paper to read, and this tradition has persisted to this day. The field is 
difficult, but it is fair to say that the author of this book has made some further 
progress in simplifying the presentation of much of the material. For the calculus, a 
number of generations of teachers were needed to make the material simple enough 
for the average undergraduate to understand. In time, we will probably simplify the 
presentation of coding theory similarly. 

As a branch of mathematics, coding theory has a promising future. There are 
many unsolved problems, and this feature is the mark of an important field of 
research. As a branch of practical engineering, it will receive many new stimulating 
problems in the future. The combination makes it a “hot field” for research; this 
book is one avenue of approach for the beginner. 
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General, S$**(15-16), P, L**. 
the Unezpected. Ivar Ekeland. U of Chicago 
Pr, 1988, xiii + 146 pp, $19.95. (ISBN: 0-226- 
19989-4] A stimulating, selective account of a sin- 
gle mathematical idea—dynamical systems—traced 
from Kepler through Poincare to Arnold, Smale, 
and Thom. Using only words and diagrams, the 
author explains how deterministic mechanics can 
lead to random motion, and relates these two grand 
metaphors of science to ancient world views reflected 
by Homer. Translated by the author from an award- 


winning popular French monograph addressed to lay- 
men. LAS 


General, $**(13-16), P, L**. The Mathematical 
Tourist: Snapshots of Modern Mathematics. Ivars 
Peterson. WH Freeman, 1988, xv + 240 pp, $17.95. 
[ISBN: 0-7167-1953-3] A “Baedeker’s Guide” to the 
sights and insights of modern mathematics adapted 
from reports written by the author for Science News 
during the past six years. Landmarks like primes, 
manifolds, fractals, knots, tiles, and cellular au- 
tomata are linked by vivid, accessible exposition that 
conveys the ecstasy (without the agony) of many re- 
cent discoveries. A superb example of popular math- 
ematical writing. LAS 


Elementary, T(13: 1). Algebra for College Stu- 
dents. Margaret L. Lial, Charles D. Miller. Scott 
Foresman, 1988, 714 pp, $24.76. (ISBN: 0-673-18866- 
3] A substantial review of intermediate algebra pre- 
cedes traditional college algebra topics; focuses on 
functions and problem solving; uses two colors even 
in equations. Available supplements include video- 
tapes and audiotapes. JNC 


Mathematics Appreciation, §(13-14), L*. Dis- 
covering Mathematics: The Art of Investigation. 
A. Gardiner. Oxford U Pr, 1987, xiv + 206 pp, 
$19.95 (P); $35. [ISBN: 0-19-853265-2; 0-19-853282- 
2] Unique book designed to give students experience 
in mathematical discovery. Four short and two ex- 


Mathematics and 


tended investigations consist of a structured sequence 
of exercises interspersed with text. Topics from num- 
ber theory, two-person games, geometry. Includes 
solutions for exercises and further problems. Appro- 
priate for good high school students. KS 


Finite Mathematics, T(13: 1). Finite Mathemat- 
ics and Its Applications. Stanley J. Farlow, Gary 
M. Haggard. Random House, 1988, xi + 615 pp, 
$24. [ISBN: 0-394-35161-4] Covers linear equations, 
matrices, linear programming, elementary counting 
techniques, statistics, and graph theory. Applica- 
tions are abundant (mathematics of finance, prob- 
ability, etc.), and historical notes are interspersed 
throughout. Prerequisite is two years of high school 
algebra. LC 


Education, P, L*. Educating Scientists and Engi- 
neers, Grade School to Grad School. Dir: John H. 
Gibbons. Office of Technology Assessment (US Gov- 
ernment Printing Office, Washington, DC 20402), 
1988, vii + 128 pp, $6 (P). A detailed analysis of 
patterns and policy issues pertaining to the flow of 
prospective science and engineering students through 
the educational pipeline. Major conclusion: demo- 
graphic facts alone do not determine either the size 
or quality of the future science and engineering talent 
pool since there are proven ways to increase student 
entry into this pool, even as late as the sophomore 
year in college. Mathematics is generally aggregated 
with other sciences, so separate inferences about the 
mathematical pipeline are not easy to draw. LAS 


Education, P*, L*. The Mathematics Report 
Card: Are We Measuring Up? John A. Dossey, 
et al. Educational Testing Service, 1988, 143 pp, 
(P). [ISBN: 0-88685-072-X] Summary and analysis 
of the National Assessment of Educational Progress 
(NAEP), a survey of mathematical knowledge of 9, 
13, and 17 year olds conducted every four years. Ma- 
jor findings: Blacks and Hispanics have made ap- 
preciable gains, and average performance improved 
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slightly. However, most gains are in lower-order 
skills: only 6.4% of the 17 year olds could solve simple 
multi-step problems. Most students perceive mathe- 
matics as composed mainly of memorizing rules, and 
expect to have little use for mathematics in their 
lives. LAS 


Education, P, L. The Teacher of Mathematics: Is- 
sues for Today and Tomorrow. Mathematical Sci- 
ences Education Board, (National Research Coun- 
cil, 2101 Constitution Avenue NW, Washington, DC 
20418), 1987, v + 120 pp, (P). Papers presented 
at an October 1987 conference sponsored at UCLA 
by the Mathematical Sciences Education Board set- 
ting forth several related visions of the mathemat- 
ics teacher of tomorrow. Includes major papers by 
Thomas Cooney on teaching practice, and by Albert 
Shanker on visions of new schools. LAS 


Education, P*, L*. A Conspiracy of Good Inten- 
tions: America’s Teztbook Fiasco. Harriet Tyson- 
Bernstein. Council for Basic Education (725 15th 
St. NW, Washington, DC 20005), 1988, xii + 113 
pp, (P). A scathing indictment of how public pres- 
sure to increase test scores and official regulation of 
the content of textbooks has led unintentionally to 
“dumbing down” school texts in which much is men- 
tioned but little is taught. Concludes with a series 
of practical recommendations for how parents, policy 
makers, and educators can combat this tendency of 
bad texts to drive out good ones. LAS 


Combinatorics, P. Combinatorial Design Theory. 
Ed: Charles J. Colbourn, Rudolf Mathon. Annals 
of Disc. Math., V. 149. North-Holland (US Distr: 
Elsevier Science), 1987, xii + 470 pp, Dfl. 170.00 
(P). [ISBN: 0-444-70328-4] A collection of forty-one 
research papers ranging over the field: Latin squares, 
projective planes, Steiner systems, block designs, 
packings, coverings, graphs, etc., including attention 
to applications, computational tools, and construc- 
tion problems. SS 


Algebra, L. Reviews in Ring Theory 1980-84. 
AMS, 1986, xiii + 685 pp, $88 (P). [ISBN: 0-8218- 
0097-3] All reviews appearing in Mathematical Re- 
views from 1980 through 1984 with primary or sec- 
ondary classification 16 (Associate Rings and Alge- 
bras). Subsections arranged by their MR numbers. 
Includes author index. LCL 


Real Analysis, T(16-17), L. Measure and Inte- 
gral: Volume 1. John L. Kelley, T.P. Srinivasan. 
Grad. Textsin Math., V. 116. Springer-Verlag, 1988, 
x + 150 pp, $39.80. [ISBN: 0-387-96633-1] Intended 
for use as a text for students with no previous knowl- 
edge of measure theory or Lebesgue integration, but 
augmented at the end of each chapter by informal 
discussions (in their totality comprising one-third of 
the book) of extensions of the ideas into more sophis- 
ticated mathematics. Intended as the first of a two- 
volume set, the second to consist of exercises, prob- 
lems, and additional supplementary material. AWR 
Differential Equations, T(17: 2). Asymp- 
totic Ezpansions for Ordinary Differential Equa- 
tions. Wolfgang Wasow. Dover, 1987, ix + 374 
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pp, $8.95 (P). (ISBN: 0-486-65456-7] An unabridged 
Dover republication of the edition published by 
Robert E. Krieger in 1976 (1965 Interscience edi- 
tion, TR, January 1967; 1976 edition TR, August- 
September 1977). PS 


Partial Differential Equations, T*(16-17). So- 
luttons of Partial Differential Equations. Dean G. 
Duffy. TAB Books, 1986, x + 542 pp, $25.95. [ISBN: 
0-8306-0412-X] A very nice book for a senior level 
course in partial differential equations. Starts with 
Sturm-Liouville problem, then covers Fourier series 
and integrals, Laplace transforms, convolution, and 
then applies these to first-order equations. Does lots 
of practical equations, and is method-oriented. MZ 


Partial Differential Equations, P. Computation 
of Singular Solutions in Elliptic Problems and Elas- 
ticity. D. Leguillon, E. Sanchez-Palencia. Wiley, 
1987, 200 pp, $42.95 (P). [ISBN: 0-471-91757-5] 
Presents general theory plus two new methods for 
computing the eigenvalues and eigenvectors of sin- 
gularities. Concerns mostly homogeneous equations 
and boundary conditions. Many examples. Applica- 
tions to engineering include boundary layers in com- 
posite plates and slightly perturbed corners. Dis- 
cusses open problems. DFA 


Operator Theory, P. Proceedings of the Confer- 
ence Ergodic Theory and Related Topics IJ. Ed: 
Horst Michel. Teubner-Texte zur Math., B. 94. BG 
Teubner, 1987, 216 pp, 22,50M (P). (ISBN: 3-322- 
00434-1] Report of a conference held in 1986, this 
being a follow-up to a 1981 conference on the same 
theme. AWR 


Functional Analysis, T*(18: 2), L. Introduction 
to Functional Analysis, Second Edition. Angus E. 
Taylor, David C. Lay. Robert E Krieger, 1986, xi 
+ 467 pp, $42.50. [ISBN: 0-89874-951-4] A very 
readable text on the basics of functional analysis. It 
starts out by studying n-dimensional spaces, Zorn’s 
lemma, and Hamel bases; it then treats normed lin- 
ear spaces from the perspective of topological lin- 
ear spaces, with Hilbert spaces and Banach spaces 
as special cases. In the second half it covers spec- 
tral analysis of linear operators in general and over 
Hilbert space. (1980 Second Edition, TR, December 
1980.) MZ 


Functional Analysis, T(16-17: 1, 2), L*. Dis- 
tribution Theory and Transform Analysis: An Intro- 
duction to Generalized Functions, with Applications. 
A.H. Zemanian. Dover, 1987, xii + 371 pp, $8.95 
(P). (ISBN: 0-486-65479-6] Republication of the 1965 
edition in McGraw-Hill International Series in Pure 
and Applied Mathematics. Introduction to distri- 
bution theory and generalized Fourier and Laplace 
transformations. Applications to integro-differential 
equations and difference equations. Prerequisite is 
advanced calculus including familiarity with the in- 
terchange of limiting processes. Numerous examples, 
problems, and references (to 1965). JK 

Algebraic Geometry, P. Algebraic Geometry, 
Bowdoin 1985. Ed: Spencer J. Bloch, et al. Proc. 
of Symp. in Pure Math., V. 46. AMS, 1985, $119 
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set (ISBN: 0-8218-1481-8]. Part 1, vi + 481 pp; Part 
2, vi + 513 pp. Fifty-one articles reporting recent 
progress on the basic questions of algebraic geome- 
try, many by leaders in the field. The three-week 
conference ranged from the algebraic (classification 
theorems, group actions, and algebraic cycles) to the 
geometric (vector bundles and Hodge theory). GG 


Geometry, $(16-17), P, L. Theory of Convez Bod- 
tes. T. Bonnesen, W. Fenchel. BCS Associates, 1987, 
ix + 172 pp, $23 (P). [ISBN: 0-914351-02-8] This 
classic book is always cited as a source of basic re- 
sults relating to convex bodies, but up to this time 
it has not been available in English. It would be an 
ideal addition to any library wanting an accessible, 
basic source on this topic. A classic! AWR 


Geometry, T(18: 2), S, P. Kleinian Groups. 
Bernard Maskit. Grund. der math. Wissenschaften, 
B. 287. Springer-Verlag, 1988, xiii + 326 pp, $77.50. 
[ISBN: 0-387-17746-9] Intended for a one-year text at 
the graduate level, this is an essentially self-contained 
introduction to Kleinian groups as the theory has 
grown out of the initial work of Ahlfors and Bers. 
Topics include discontinuous groups, isometries, ge- 
ometric and geometrically-finite groups, B-groups, 
and function groups. Exercises, bibliography. JS 


Differential Topology, P**. Collected Papers, 
Marston Morse, V. 1-6. Marston Morse. World Sci- 
entific, 1987, $322 set. [ISBN: 9971-978-94-6]  V. 
1, xlix + 532 pp; V. 2, vi + 598 pp; V. 9, vi + 639 
pp; V. 4, vi + 602 pp; V. 5, vi + 578 pp; V. 6, vi 
+ 559 pp. Seven volume collected works of Marston 
Morse. Contains over 50 papers concerning Morse 
theory. Morse theory is a study of the relationship 
between the homology of a manifold and the criti- 
cal points of its smooth functions. These volumes 
trace the development of Morse theory and its ap- 
plication to existence theorems for closed geodesics 
and to calculus of variations in general. Other top- 
ics covered include dynamics, minimal surface, topo- 
logical methods in a single complex variable, inte- 
gral representations, pseudoharmonic functions, and 
differential topology in papers written with collab- 
orators such as G.A. Hedlund, C.B. Tompkins, M. 
Heins, W. Trunsue, J. Jenkins, W. Hubsch, and S.S. 
Cairns. AM 


Optimization, P. Design and Analysis of Algo- 
rithms for Stochastic Integer Programming.  L. 
Stougie. CWI Tract, V. 37. Math Centrum, 1987, 
89 pp, Dfi. 14.10 (P). [ISBN: 90-6196-319-2] Treats 
hierarchical scheduling, vehicle, and location prob- 
lems; stochastic integer programming by dynamic 
programming. The style is terse, the notation at 
times forbidding. AWR 


Optimization, S(16-17), P. Classical Principles 
and Optimization Problems. B.S. Razumikhin. 
Math. & Its Applic. D Reidel (US Distr: Kluwer 
Academic), 1987, xiv + 513 pp, $129. [ISBN: 90- 
277-2605-1] Nature frequently comes to an equilib- 
rium position that is also an optimal solution to some 
problem. The underlying theme of this book is to 
develop optimizing algorithms by trying to imitate 
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nature’s process of finding equilibria, an approach 
largely untried, which has yielded some promising 
results. Results in this book deal with the numer- 
ical solution of constrained optimization problems 
in classic mechanics and thermodynamics. Note the 
price! AWR 


Dynamical Systems, P. Dynamics of Infinite Di- 
mensional Systems. Ed: Shui-Nee Chow, Jack K. 
Hale. NATO ASI Series F, V. 37. Springer-Verlag, 
1987, ix + 514 pp, $85. [ISBN: 0-387-18374-4] A rep- 
resentative sample of the presentations given at the 
1986 NATO Advanced Study Institute on Dynam- 
ics of Infinite Dimensional Systems. The conference 
was an attempt to transmit ideas among investiga- 
tors who are working with partial differential equa- 
tions and functional differential equations as dynami- 
cal systems on function spaces. Includes forty papers 
from diverse areas (classical differential equations to 
application for infectious disease modelling). LB-E 


Control Theory, T(16-17: 1). Functional Anal- 
ysts and Control Theory: Linear Systems. Stefan 
Rolewicz. Math. & Its Applic. D Reidel (US Distr: 
Kluwer Academic), 1986, xv + 524 pp, $118. (ISBN: 
90-277-2186-6] Functional analysis is discussed in a 
way which allows non-specialists to understand the 
applications which follow. The short course in func- 
tional analysis is followed by a development of the 
theory of general linear systems, application of that 
theory to ordinary and partial differential equations, 
and fundamentals of differential equations in Banach 
space. Examples come from physics and engineer- 
ing. LB-E 


Probability, T(18), P. Stochastic Modelling and 
Analysis: A Computational Approach. Henk C. Ti- 
jms. Ser. in Prob. & Math. Stat. Wiley, 1986, xii 
+ 418 pp, $36.95. [ISBN: 0-471-90911-4] The main 
concern of this text is the application of stochastic 
models to practical situations. Some of the applica- 
tions are renewal processes to inventory /production, 
Markov chains and operations research, and applica- 
tions to queueing theory. MZ 


Probability, S$(13). Probability Theory (First 
Steps). E.S. Wentzel. Transl: N. Deineko. MIR (US 
Distr: Imported Pub), 1986, 87 pp, $1.95 (P). (ISBN: 
0-8285-3196-X] Introduction to probability for non- 
mathematics majors. Includes probability as fre- 
quency, basic rules of probability, and random vari- 
ables. Examples indicate wide applications. Brief, 
and easy to read. MS 


Stochastic Processes, T(18: 1), P*. Introduc- 
tion to Stochastic Differential Equations. Thomas 
C. Gard. Pure & Appl. Math., V. 114. Marcel 
Dekker, 1988, xi + 234 pp, $65. (ISBN: 0-8247-7776- 
X] Basic account, requiring only introductory courses 
in real analysis, ordinary differential equations, and 
some knowledge of continuous time stochastic pro- 
cesses. Contains review of needed probability theory. 
Emphasis on role of Ito’s formula in stochastic anal- 
ysis. Applications, including population dynamics 
and sample path approximations. Exercises; refer- 
ence list. JK 
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Elementary Statistics, T(13: 1). Statistics and 
Data Analysis, An Introduction. Andrew F. Siegel. 
Wiley, 1988, xxv + 518 pp, $33. (ISBN: 0-471- 
87659-3] Introductory text with a multitude of ex- 
amples. Objectives clearly defined in each section; 
end-of-chapter summary, questions, and problems. 
Exploratory data analysis incorporated throughout. 
Real data sets used illustrating a wide range of ap- 
plications. MS 


Statistics, P. Statistical Decision Theory and Re- 
lated Topics IV. Ed: Shanti S. Gupta, James O. 
Berger. Springer-Verlag, 1988, $32 each. Volume 
1, xii + 418 pp (ISBN: 0-387-96661-7]; Volume 2, xvi 
+ 399 pp. [ISBN: 0-387-96662-5] 65 invited papers 
and discussion from a symposium and workshop held 
at Purdue University in June 1986. First volume de- 
voted primarily to conditioning and Bayesian ideas; 
second volume to aspects of decision theory. LAS 


Statistics, T(13: 1, 2), P. Statistical Methods for 
Business and Economics, Third Edition. Roger C. 
Pfaffenberger, James H. Patterson. Richard D Irwin, 
1987, xiv + 1246 pp, $38.95. (ISBN: 0-256-03664-0] 
Material presented in non-mathematically rigorous 
fashion. Extensive use of examples. An increased 
number of problems. Minitab used throughout. SAS 
also used for regression. Knowledge of calculus is 
limited to advanced sections. (Second Edition, TR, 
February 1982.) MS 


Statistics, P. The Teaching of Practical Statistics. 
C.W. Anderson, R.M. Loynes. Ser. in Prob. & 
Math. Stat. Wiley, 1987, xi + 199 pp, $52.95. 
[ISBN: 0-471-91572-6] Discussion of abilities for the 
ideal statistician; aims of statistics teaching; ways 
of teaching statistical practice. Describes use of se- 
quence of projects tailored to different backgrounds 
and needs. Wonderful coilection ef project sugges- 
tions. MS 


Statistics, S, P. Lecture Notes in Statistics-43: Ma- 
jorization and the Lorenz Order: A Brief Introduc- 
tion. Barry C. Arnold. Springer-Verlag, 1987, vi + 
122 pp, $16.30 (P). (ISBN: 0-387-96592-0] Let R+ 
be the positive orthant of Ry, and let (Zin, Zan, 
++) Zain] be the coordinates of x € Ri arranged 
in decreasing order. Hardy, Littlewood, and Polya 
(as well as others) have discussed a partial order- 
ing on Ri defined by writing x > y if and only 
if Yea Lin > yore Yin k = 1,2,...,n —1 and 
Diei Zin = Dei Yin. This ordering is the ma- 
jorization referred to in the title, and if interpreted in 
terms of income distribution in a population of n in- 
dividuals, Robin Hood type redistributions of wealth 
can be defined. These lead to the concept of Schur 
convexity, and our author is on his way to a nice in- 
troduction to a class of inequalities that is treated on 
a much more advanced level by Marshall and Olkin 
in Inequalities: Theory of Majorization and Its Ap- 
plications (TR, May 1980). AWR 


Statistics, T(14-15: 1, 2), P. Modern Mathemati- 
cal Statistics. Edward J. Dudewicz, Satya N. Mishra. 
Ser. in Prob. & Math. Stat. Wiley, 1988, xix + 838 
pp, $44.80. (ISBN: 0-471-81472-5] Could be used as 
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a probability text (Chapters 1-6), or as a probability 
and mathematics statistics text. Integrates modern 
and classical statistics results. Variety of examples. 
Fortran programs included; SAS emphasized. Hon- 
ors problems included. MS 


Statistics, P. Multivariate Statistical Modeling and 
Data Analysis. Ed: H. Bozdogan, A.K. Gupta. The- 
ory & Decision Library, Ser. B. D Reidel (US Distr: 
Kluwer Academic), 1986, ix + 189 pp, $54. (ISBN: 
90-277-2592-6] Proceedings of an advanced sympo- 
sium held in Virginia in May, 1986. FLW 


Programming, T(13). Using Turbo Basic. Fred- 
erick E. Mosher, David I. Schneider. Osborne 
McGraw-Hill, 1988, xix + 457 pp, $19.95 (P). (ISBN: 
0-07-881282-8] An introduction to programming 
using Turbo Basic, this book contains introductory 
material for beginning programmers and more in- 
depth sections on the Turbo Basic environment for 
experience. Rudimentary understanding of Basic 
is assumed. Many example programs are included. 
Other topics include graphics, sound, and mathemat- 
ical, scientific, and business applications. Example 
output is provided in the graphics section. LB-E 


Languages, S(16-17), P, L. An APL Compiler. 
Timothy Budd. Springer-Verlag, 1988, xi + 156 pp, 
$22.50 (P). (ISBN: 0-387-96643-9] Report of a project 
to develop a compiler for APL, which heretofore had 
been thought impossible due to weak variable typ- 
ing and other very high-level features. The result, a 
prototype compiler written in C for UNIX machines 
(and available from the author), is far from produc- 
tion quality, but is sufficient to resolve most algorith- 
mic and efficiency issues. LAS 


Computer Systems, P. Distributed Computing: 
Structure and Complezity. H.L. Bodlaender. CWI 
Tract, V. 43. Math Centrum, 1987, vi + 294 pp, 
Dfl. 45.40 (P). (ISBN: 90-6196-327-3] Presentation 
of original research in some theoretical distributed 
computing topics. Most of the text is devoted to 
characterization and analysis of emulations, a notion 
of simulation for networks and processors, includ- 
ing complexity of finding uniform emulations. Also: 
algorithms for decentralized extrema-finding, proto- 
cols for distribution of records in rings of processors; 
deadlock-free packet switching networks. RB 


Computer Systems, C, P**, L*. Mathemat- 
ica: A System for Doing Mathematics by Computer. 
Stephen Wolfram. Addison-Wesley, 1988, xviii + 749 
pp, $29.95 (P); $44.25. [ISBN: 0-201-19330-2; 0-201- 
19334-5] The “definitive” user guide to the latest 
symbolic computer package named—at Steve Jobs 
suggestion—“Mathematica.” Written by MacArthur 
Fellow Wolfram, who earlier designed the package 
SMP, Mathematica is designed in two parts—a ker- 
nel which runs the same on any (sufficiently pow- 
erful) computer, and a front end which uses special 
features (windows, graphics) of particular hardware. 
Intended to “liberate the prose of mathematics from 
its grammar,” Mathematica comes with built-in ca- 
pability to do most of calculus and linear algebra, 
but not much more (e.g., not statistics or differential 
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equations). Well linked to other computer systems 
(e.g., UNIX, TeX, Fortran, Postscript) Mathematica 
currently runs on Macintosh as well as on most UNIX 
systems. This guide provides an extensive example- 
based survey of Mathematica behavior, together with 
a full reference guide and a thorough explanation of 
system design (including how to extend the system 
through external Mathematica macro packages). LAS 


Computer Systems, S(17-18), P. Lecture Notes 
in Computer Science-284: Embedded Systems. Ed: 
A. Kindig, R.E. Buhrer, J. Dahler. Springer-Verlag, 
1987, v + 207 pp, $21.80 (P). [ISBN: 0-387-18581-X] 
These lectures are an attempt to bridge the gap be- 
tween newer research results in system architecture 
and its description on one hand, and actual applica- 
tion of those results in developing real time/process 
control systems on the other. The architecture lec- 
tures are not included; instead the design aspect of 
embedded systems is addressed. The text is appro- 
priate for students with previous experience in the 
area. LB-E 


Computer Graphics, C. Advanced Graphics in 
C: Programming and Techniques. Nelson Johnson. 
Osborne McGraw-Hill, 1987, xvi + 670 pp, $22.95 
(P). (ISBN: 0-07-881257-7] A thorough coverage of 
the essential elements needed to do graphics in C. 
Contains techniques and algorithms for graphics pro- 
gramming, graphics editing, hard-copy creation, user 
access, and communications. Focuses on IBM EGA 
graphics standard. Johnson develops a fairly com- 
plete set of graphic functions from the ground up— 
explaining algorithms and possible alternatives as he 
goes. Additional functions include support for se- 
rial and parallel I/O, pop-up menus, and graphics 
text. Appendix contains the complete C codes for a 
graphics system GRAPHIQ (also available on disk). 
Step-by-step approach also gives a valuable lesson in 
software system development—modularity. PS 


Theory of Computation, P. Algorithmic Informa- 
tion Theory. Gregory J. Chaitin. Tracts in Theoret. 
Camput. Sci., V. 1. Cambridge U Pr, 1987, xi + 
175 pp, $29.95. [ISBN: 0-521-34306-2] This entire 
monograph of 175 pages is directed at the proof of a 
single important theorem in computer science—the 
Godel Incompleteness Theorem. The approach taken 
to prove this theorem is quite different from that of 
virtually all other mathematicians. The author uses 
an information-theoretic approach based on the size 
of computer programs. GMS 


Theory of Computation, P. Lecture Notes in 
Computer Science-280: Mathematical Models for the 
Semantics of Parallelism. Ed: Marisa Venturini 
Zilli. Springer-Verlag, 1987, v + 231 pp, $20.60 
(P). (ISBN: 0-387-18419-8] Papers presented at the 
Conference on the Semantics of Parallelism held in 
Rome, Italy in September 1986. The eight papers 
discuss advanced research work on parallel process- 
ing on distributed systems and on guaranteeing in- 
tegrity, security, and consistency of multiple proces- 
sor systems. GMS 


Theory of Computation, P, L*. Open Problems 
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in Communication and Computation. Ed: Thomas 
M. Cover, B. Gopinath. Springer-Verlag, 1987, viii 
+ 236 pp, $25. (ISBN: 0-387-96621-8] Contribu- 
tions of over 50 open problems in the theory of com- 
munication and computation, with a few solutions 
discovered by participants, from summer seminars 
in 1984-86 at Bell Communications Research and 
Stanford University. Highlight is FRACTAN, J.H. 
Conway’s playful programming syntax for fractions 
which, started with short lists of simple fractions, 
can generate (a) all primes, (b) the digits of x, or (c) 
any Turing machine. LAS 


Theory of Computation, T(17-18: 2), P. Theo- 
ries of Computational Complezity. Cristian Calude. 
Annals of Disc. Math., V. 35. North-Holland (US 
Distr: Elsevier Science), 1988, xii + 487 pp, $97.50. 
{ISBN: 0-444-70356-X] A comprehensive treatment of 
the field of computational complexity. Divided into 
five parts, each of which address one of the five ma- 
jor divisions of the field—recursive function theory, 
Blum’s complexity theory, Kolmogorov’s complexity 
theory, LOF theory, and programming hierarchies. It 
treats the subject in a highly advanced form and is 
appropriate only for professionals in this specialized 


field. GMS 


Theory of Computation, P. Lecture Notes in 
Computer Sctence-287: Foundations of Software 
Technology and Theoretical Computer Science. Ed: 
Kesav V. Nori. Springer-Verlag, 1987, ix + 540 pp, 
$40 (P). (ISBN: 0-387-18625-5] A collection of ap- 
proximately 30 papers and invited talks presented 
at the 7th International Conference on Theoretical 
Computer Science held at Pune, India in December 
1987. The papers address advanced research issues in 
automata theory, algorithm analysis, logic program- 
ming, parallel processing, and software theory. GMS 


Computer Science, T(18), P. Parallel Complezity 
Theory. Ian Parberry. Wiley, 1987, 200 pp, $22.95 
(P). [ISBN: 0-273-08783-5] Foundations of parallel 
complexity theory for advanced graduate students or 
researchers in theoretical computer science. Assumes 
familiarity with automata theory, formal languages, 
complexity and analysis of algorithms. LC 


Computer Science, P. Lecture Notes in Computer 
Science-278: Fundamentals of Computation Theory. 
Ed: L. Budach, R.G. Bukharajev, O.B. Lupanov. 
Springer-Verlag, 1987, xiv + 505 pp, $40 (P). (ISBN: 
0-387-18740-5] 110 papers selected from 195 pre- 
sented at a conference held in Kazan, USSR, June 
1987. Sections at the conference were on mathemat- 
ical models of computation, synthesis and complexity 
of control systems, probabilistic computations, the- 
oretical problems of programming, and problems of 
computed-assisted deductions. RWN 


Computer Science, P. Lecture Notes in Computer 
Sctence-290: Co-oP. Tung X. Bui. Springer-Verlag, 
1987, xiii + 250 pp, $24.50 (P). [ISBN: 0-387-18753- 
7] Study of decision support system (DSS) meth- 
ods for multiple user domains which use computer- 
assisted group interaction, with the goals of a coop- 
erative decision setting and distributed and demo- 


1988] 


cratic decision making. The Co-oP system, based 
on a process-oriented group decision methodology, is 
discussed. RM 


Computer Science, P. Lecture Notes in Computer 
Sctence-279: Graph Reduction. Ed: Joseph H. Fasel, 
Robert M. Keller. Springer-Verlag, 1987, xvi + 450 
pp, $34.60 (P). (ISBN: 0-387-18420-1] 1986 workshop 
on graph reduction (an elegant model for evaluation 
of functional programming languages) in which com- 
putation is viewed as a succession of transformations 
between graphs. Papers include discussion of models, 
architectures, resource control issues, arrays. RM 


Applications, P, L. Mathematics as a Service Sub- 
ject. Ed: A.G. Howson, et al. ICMI Study Series. 
Cambridge U Pr, 1988, iv + 92 pp, $39.50; $14.95 
(P). [ISBN: 0-521-35395-5; 0-521-35703-9] Editorial 
overview and eight lead papers derived from an April 
1987 international symposium in Udine, Italy, orga- 
nized by the International Commission on Mathe- 
matical Instruction (ICMI). Major emphasis is on 
teaching university mathematics to engineering stu- 
dents and on the importance of such topics as geome- 
try and discrete mathematics. Opening survey paper 
by the editors sketches a comprehensive agenda of 
issues; a companion volume of selected papers pre- 
sented at the symposium will be published in the 
Springer CISM Series. LAS 

Applications (Biological Science), T(15-16: 1, 
2), L. Mathematical Models in Biology. Leah 
Edelstein-Keshet. Math. Ser. Random House, 1988, 
xvi + 586 pp, $32. (ISBN: 0-394-35507-5] Nicely inte- 
grated treatment of discrete, continuous, and multi- 
variable models in biology, including phase plane or- 
dinary differential equation methods, linear stability, 
bifurcations, partial differential equations applied to 
population, cellular, molecular biology, physiology, 
etc. Biological models, with mathematical tools de- 
veloped in tandem, where mathematics serves as uni- 
fying concept via themes of steady states, stability, 
parameter variations. Minimal prerequisites, very 
nice examples, projects, references. RM 


Applications (Electrical Engineering), T(16- 
17: 2). Digital Transmission Systems and Net- 
works. Michael J. Miller, Syed V. Ahamed. Elect. 
Eng., Commun., & Signal Proc., V. 9 & 10. Com- 
puter Science Pr, 1987, $36.95 each. Volume I: Prin- 
ciples, xix + 274 pp, (ISBN: 0-88175-094-8]; Volume 
II: Applications, xviii + 366 pp. (ISBN: 0-88175- 
176-6] Two-volume text which treats the engineering 
aspects of integrated digital systems for telephone, 
TV, data, and other services. Volume 1 discusses 
evolution of ideas and principles; Volume 2 discusses 
implementation. RM 

Applications (Engineering), T(17: 1, 2), S, 
P, L. Vibrations in Mechanical Systems: Analyt- 
ical Methods and Applications. Maurice Roseau. 
Springer-Verlag, 1987, xiv + 515 pp, $69. (ISBN: 0- 
387-17950-X] Translated from the 1984 French edi- 
tion. On free or induced vibrations in discrete or 
continuous mechanical structures. Linear, nonlinear, 
asymptotic theories via analysis of ordinary and par- 
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tial differential equations. Some previously unpub- 
lished results. Superb selection of examples of ap- 
plications. For graduate students and researchers in 
mechanics and mechanical engineering. Many refer- 
ences. No problems. JK 


Applications (Engineering), P*. Structural Re- 
habthty: Analysis and Prediction. R.E. Melchers. 
Civil Engin. Halsted Pr, 1987, 400 pp, $89.95. 
(ISBN: 0-470-20873-2] Probability applied to struc- 
tural reliability. Practical aspects and underlying 
theory. Unified discussion of structural systems reli- 
ability, including review of influence of time. Simula- 
tion methods, modern second-moment and transfor- 
mation methods. Relationship of human error and 
human intervention to failure probabilities. Appen- 
dices with summary of probability theory and appro- 
priate tables. Numerous references. JK 


Applications (Engineering), T(16-17: 1, 2), 
L. Engineering Analysis: A Vector Space Approach. 
Robert J. Schilling, Hua Lee. Wiley, 1988, xiv + 
337 pp, $44.95. [ISBN: 0-471-82760-6] Aimed at se- 
niors or first year graduate students in engineering, 
and assuming only a traditional background in cal- 
culus, the emphasis is on linear spaces and operators. 
The first seven chapters are an introduction to linear 
algebra. Chapters 8 and 9 treat differential and dif- 
ference systems, and the final chapter deals with or- 
thogonal transforms, including Fourier, Hankel and 
Hilbert transforms with applications. Exercises, ref- 
erences, index. JS 


Applications (Fluid Mechanics), P*. The 
Mathematical Theory of Turbulence, Second Edi- 
tion. M.M. Stanigi¢. Universitext. Springer-Verlag, 
1988, xviii + 501 pp, $45 (P). (ISBN: 0-387-96685-4] 
New fourth chapter on the dispersion of tempera- 
ture in magnetohydrodynamic turbulence from the 
Lagrange-Euler point of view. New Chapter V on 
contemporary developments in theoretical and ex- 
perimental turbulence. References updated to 1987. 
(First Edition, TR, October 1985.) JK 


Reviewers 


RJA: Richard J. Allen, St. Olaf; DFA: David F. Appleyard, 
Carleton; RB: Richard Brown, Carleton; LB-E: Lisa Brown- 
Ewing, Macalester; JNC: Judith N. Cederberg, St. Olaf; LC: 
Laura Chihara, St. Olaf; BC: Barry Cipra, St. Olaf; CEC: 
Clifton E. Corzatt, St. Olaf; JD-B: John Dyer-Bennet, Car- 
leton; RSF: Robert S. Fisk, St. Olaf; SG: Steven Galovich, 
Carleton; GG: George Gilbert, St. Olaf; BH: Bruce Hanson, 
St. Olaf; RH: Rhonda Hatcher, St. Olaf; RK: Roger Kirch- 
ner, Carleton; RSK: Richard S. Kleber, St. Olaf; JK: Joseph 
Konhauser, Macalester; LCL: Loren C. Larson, St. Olaf; AM: 
Alan Magnuson, St. Olaf; SM: Steve McKelvey, St. Olaf; RM: 
Richard Molnar, Macalester; RWN: Richard W. Nau, Car- 
leton; AO: Arnold Ostebee, St. Olaf; MR: Matthew Richey, 
St. Olaf; AWR: A. Wayne Roberts, Macalester; GMS: G. 
Michael Schneider, Macalester; PS: Paul Schoening, St. Olaf; 
JS: John Schue, Macalester; SS: Seymour Schuster, Carleton; 
KS: Kay Smith, St. Olaf; LAS: Lynn Arthur Steen, St. Olaf; 
MS: Myriam Steinback, Macalester; MU: Milton Ulmer, Car- 
leton; TAV: Theodore A. Vessey, St. Olaf; MW: Martha Wal- 
lace, St. Olaf; LW: Lynne Walling, St. Olaf; FLW: Frank L. 
Wolf, Carleton; MZ: Michael Zielinski, St. Olaf. 
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your classroom. It all adds up: respected authors, 


meticulous text development, innovative software, and 


conscientious service for you and your students. 
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Fourth Edition 
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Brief Calculus with 


Applications 
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College Algebra 
Fifth Edition 
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Second Edition 
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Intermediate Algebra 
Second Edition 
Available Now 


Shenk 
Calculus and Analytic 
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Fourth Edition 
In single- and split-volume editions. 


Cullen 
inear Algebra with 
pplications 
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Johnson/Steffensen 
College Algebra and Trigo- 
nometry with Applications 


Johnson/Steffensen 
College Algebra with 
Applications 


Johnson/Steffensen 
Trigonometry with 
Applications 


Schiller/Wurster 
College Algebra and 


Trigonometry 
Basics Through Precalculus 


Schiller/Wurster 
College Algebra 


RECENTLY PUBLISHED... 


Lial/Miller 
Beginning Algebra 
Fifth Edition 


Lial/Miller 


Intermediate Algebra 
Fifth Edition 


Lial/Miller 
Algebra for College Students 


Williams 


Basic Mathematics 
Fundamentals and Algebra 


Second Edition 


Steffensen/Johnson 
Essential Mathematics for 


College Students 
Prealgebra, Algebra, and Geometry 


Peter/Welch 
Algebra for College Students 


For further information please write 
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KING ON NEW DIMENSIONS 


With Springer-Verlag 
publications... the 
point is made clear! 


Geometry: A High School Course 
Second Edition 


S. Lang and G. Murrow 


This updated text is intended for the 
geometry course in secondary schools. Classi- 
cal material is treated as Euclid did, but the 
book profits from some new ideas which have 
developed since Euclid’s time, like the use of 
coordinates. These are introduced in parallel 
with Euclid’s system to allow for practical 
computations not otherwise possible, and to 
maintain algebraic skills in a natural way dur- 
ing the geometry year. Many concrete realiza- 
tions of geometry in daily life and in scientific 
life are presented. Proofs are given for the 
formulas for the length, area, or volume of 
standard figures, like the area of a disc, 
circumference of a circle, and volume of a 
sphere or a cone, much as Euclid or Archi- 
medes would have derived them. These proofs 
develop spatial insights while providing a 
clear view of the mathematics. At the end 
of the book, material on transformations is 
included both for its own sake and to show 
how transformations relate to the Euclidean 
system. Students learn this system better by 
seeing it applied in a different context. There 
are many exercises, some routine and others 
more complex. Experiments and construc- 
tions, which are especially suited for in-class 
activity, working in groups, and open-ended 
discussions, are included. 
1988/app. 416 pp./577 illus./hardcover/ $39.00 
ISBN 0-387-96654-4 


A First Course in Real Analysis 
Second Printing 


M.H. Protter and C.B. Morrey 


This text is designed for the junior or 
senior student taking a beginning course in 
analysis. Starting with an optional treatment 
of the axioms for the real number system, 
including mathematical induction, the authors 
provide a solid discussion of limits and conti- 
nuity, as well as the basic theory of differen- 
tiation and integration. Features of the text 
include: ¢ a discussion of both the Darboux 
and Riemann integral in R" ¢ metric spaces, 
mappings, compact sets, and the Heine-Borel 
Theorem ¢ the Riemann-Stieltjes integral and 
functions of bounded variation ¢ the implicit 
function theorem for a single equation and for 
systems ¢ functions on metric spaces, Arzela’s 
Theorem and the Stone-Weierstrass Theorem. 
1987/507 pp./hardcover/$39.80 
ISBN 0-387-90215-5 
Undergraduate Texts in Mathematics 


Intermediate Calculus 
Second Edition 


M.H. Protter and C.B. Morrey, 


This text is designed for two classes of 
students: those who have completed a one- 
year course in the calculus of one variable and 
desire a third semester consisting of the cus- 
tomary calculus of several variables, and those 
who have completed the usual three semester 
calculus course and want a fourth semester 
containing the basic elements of advanced cal- 
culus. The relative independence of the chap- 
ters provides great flexibility in how the text is 
used. Features of the book include: ¢ all the 
customary topics in elementary calculus of 
several variables ¢ thorough treatment of vec- 
tor field theory, including Green’s Theorem, 
Stokes’ Theorem, and the Divergence Theorem 
¢ a chapter on Fourier series with an elemen- 
tary proof of the basic convergence theorem ¢ a 
discussion of the implicit function theorem 
and the inverse function theorem ¢ when 
differentiation under the integral sign is 
permissible ¢ an elementary treatment of 
uniform convergence and the convergence 
of double series. 


1986/650 pp./266 illus. /hardcover/$44.00 
ISBN 0-387-96058-9 
Undergraduate Texts in Mathematics 


Basic Mathematics 
S. Lang ~ 


This text in basic mathematics can be 
used in multiple ways: for a one-year course 
in high school during junior and senior years; 
for a complementary reference in earlier high 
school grades; for a one-semester course at the 
college level; and to review or to get a firm 
foundation in the basic mathematics necessary 
for calculus, linear algebra or other subjects. 
Topics, such as analytic and vector geometry, 
are presented as natural continuations of both 
algebra and geometry, thus providing a proper 
balance and progression between interrelated 
topics. The emphasis is on presenting, in an 
easy and unthreatening style, the basic con- 
cepts and definitions of mathematics and their 
simpler computational aspects, and then mov- 
ing on to more advanced topics. This volume 
deals with mathematics on both the manipula- 
tive and theoretical level, so a reader can find 
pleasure in the understanding of a proof, as 
well as in the knowledge of how to use the 
results in concrete circumstances. 


1988/app. 496 pp./224 illus./softcover/$29.80 
ISBN 0-387-96787-7 


Ordering Information: To order write to 
Springer-Verlag New York, Inc. Attn: B. Bay, 
S935, 175 Fifth Avenue, New York, NY 10010. 
Checks and money orders (plus $2.50 for ship- 
ping) are acceptable forms of payment. NY, NJ 
and CA residents, please add state sales tax. 
To order by credit card, call TOLL FREE 
1-800-SPRINGER (in NJ, 201-348-4033) and 
refer to Number S935. 


® Springer-Verlag New York Berlin Heidelberg Vienna London Paris Tokyo 
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in the head of Archimedes than 


in that of Homer’’—Voltaire 


And there’s more imagination, astonish- ‘‘Students of recreational mathematics 
ment, and wit in the pages of Archi-_ will be delighted with this new collection 
medes’ Revenge than in any number of of exciting problems, paradoxes, brain teas- 
other books about mathematics. ers and curiosities.” — Martin Gardner 
‘Other works have rhapsodized at length 
about the nature of infinity and the beauty “‘Engaging, highly readable...Algorithms, 
of ‘higher dimensions’,’ says the author, geometric theory, perfect numbers, and 
Paul Hoffman. ‘‘Such philosophical excur- a quantum approach te congressional 
sions have their place, but are far fromthe representation are a few of the topics 
concerns of most working mathematicians. explored.’ —Booklist 
I give a glimpse here of some of the other 
ol o Y actuallyde pure and ap ‘‘Fascinating.’’ —Isaac Asimov 
‘‘Paul Hoffman gives us a rare view into ‘‘Informative and entertaining.’’ 
some of the profound changes now tak- —Arno Penzias, Vice President of Research, 
ing place in mathematics...Archimedes’ Bell Labs, Nobel Prize-winner in Physics 
Revenge explains this all with 
clarity, wit, and wonderful 
insight.’’ 
—R.L. Graham, Adjunct 
Director, Research, 
Information Sciences 
Division, Bell Labs 


Illustrated. 


Now at 
your bookstore, 
or order 
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W.W. Norton & Company, Inc. d 
| Dept IKA | 
th Avenue 

500 Fifth A | 

New York, N.Y. 10110 
| Please send me postpaid ____ copy(ies) of Archimedes’ | 
| Revenge by Paul Hoffman @ $17.95. | 
| Ienclose L] check J money order for $ | 
| Name | 
| Address | 
. | City | 
NORTON: es “— | 
TO | NY and CA residents, please add sales tax. _| 


W.W. Norton & Company, Inc. 
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The Complete 
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Graphing Package 


Lascaux Graphics 
3220 Steuben Ave. 
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(212) 654-7429 
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Writing Mathematics Well, by Leonard Gillman 


64 pp., 1987, ISBN-0-88385-443-0 


Catalog Number - WMW __List: $6.00 MAA Member: $4.50 


Good writing conveys more than the 
author originally had in mind, while poor 
writing conveys less. Well-written papers 
are more quickly accepted and put inta 
print and more widely read and appreci- 
ated than poorly written ones—and for 
notes, monographs, and books the qual- 
ity of writing is more important than it is 
for papers. 

In Writing Mathematics Well, Leo- 
nard Gillman tells his readers how to 
develop a clear and effective style. All 
aspects of mathematical writing are cov- 
ered, from general organization and 
choice of title, to the presentation of 
results, to fine points on using words and 
symbols, to revision, and finally, to the 
mechanics of putting your manuscript into 
print. No book can by itself make you a 
better writer, but this one will alert you to 
the opportunities for better and more 


forceful writing. It does this both by pre- 
cept and by example. 

A book to be read for its sharpness 
and wit as well as for enlightenment, Writ- 
ing Mathematics Well should be on the 
shelf of anyone who writes or intends to 
write mathematics. It will amuse and 
delight the already careful writer and it will 
help reform and refine the sensibilities of 
those who may be somewhat careless 
about their writing. 


Order from 

The Mathematical Association 
of America 

1529 Eighteenth St. NW 
Washington, D.C. 20036 
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New Texts From Academic Press 


FRACTALS 
EVERYWHERE 
Michael Barnsley 

Rather than considering 
fractal images that have been 
generated randomly, Michael 
Barnsley’s approach is to start 
with a natural object and find 
a specific fractal to fit it. The 
applications to computer 
graphics, and in particular the 
compression of images suit- 
able for data transmission and 
reconstruction, are an exciting 
new development. 

Providing the foundation 
for future development, Frac- 
tals Everywhere is ideal as a 
textbook on fractal geometry 
and its applications. Beauti- 
fully illustrated, including 32 
full-color plates, it is also well 
suited as a reference work. 


September 1988, c. 516 pages 
$39.95 (tentative) 
ISBN: 0-12-079062-9 


ee 
CURVES AND 
SURFACES FOR 
COMPUTER AIDED 
GEOMETRIC 
DESIGN 

Gerald Farin 


This work concentrates on 
the concepts of Bezier and 
B-spline methods for curve 


and surface fitting, by which it 
is possible to explain the main 
concepts of computer aided 
geometric design in a very 
intuitive and geometric fash- 
ion. Suitable for a course in 
computer aided geometric 
design, the text contains more 
than 150 illustrations and 8 
color plates. 

August 1988, 300 pages 
$39.95 (tentative) 

ISBN: 0-12-249050-9 


ee 
THE NUMERICAL 
SOLUTION OF 
ORDINARY AND 
PARTIAL 
DIFFERENTIAL 
EQUATIONS 
Granville Sewell 

This book gives a solid intro- 
duction to the computer solu- 
tion of ordinary and partial 
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How I Became a Torchbearer for Matrix Theory* 


OLGA TAUSSKY, California Institute of Technology 


OLGA TAUSSKY is sufficiently well known to Monthly readers that further 
words here would be superfluous. Besides, this article is in itself biographical, 
as you will see [ Ed.]. 


Some time ago in our public library I picked up a book, The Best of All Possible 
Worlds, by Peter F. Drucker, a professor of social science at Claremont Graduate 
School. In that book, Drucker writes about a pupil of Riemann who was to write his 
thesis on quaternions. Riemann had seen their importance to his own areas of study, 
and his student saw that they would lead to the subject that we now call matrix 
algebra, which has become all the rage. 

But matrices were not always the rage. They have played a large role in group 
theory since the work of Elie Cartan, and they play a role in physics and in 
statistics. Still, matrix theory reached me only slowly. Since my main subject was 
number theory, I did not look for matrix theory. It somehow looked for me. In what 
follows a number of instances of such events are sketched. 


1. Algebraic number theory. In a proof given by Minkowski in 1900** there 
appears the theory of matrices with dominant diagonal. He was reproving some of 
Dirichlet’s results concerning units in algebraic number fields, and he observed that 
if a real matrix A has negative off-diagonal elements but only positive row sums, 
then its determinant is nonzero, in fact even positive. A similar theorem can be 
proved by a continuity argument. Such results are connected with the famous 
GerSgorin theorem, to which I will return in section 6 below. 


2. The theorem of Shoda and R. C. Thompson. The next eye-opener for me was 
much stronger and it appeared in stranger circumstances: one of my favorite 
theorems in matrix theory came to me via class field theory! It is K. Shoda’s 
theorem concerning matrices of determinant 1, and it states that for certain fields F, 
such a matrix is a commutator B~'C~!BC where B,C are matrices over F. 

How does class-field theory come into this? When I was working on my thesis in 
class-field theory, a subject created by Hilbert, some of the most important results 
in the subject were being found by Takagi, in Japan. I summoned up the courage to 
ask him for reprints, and not only did he send them, but a number of other 
mathematicians in Japan, even in quite different areas, sent reprints too, and one of 


*This is a slightly enlarged version of an invited lecture delivered in Raleigh, North Carolina, at the 
1985 SIAM Conference on Applied Linear Algebra, organized by R. Brualdi and H. Schneider. 
**Bibliographic references are collected at the end of the paper, and are arranged by sections. 
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them was Shoda. I adored this theorem and felt right away that some day it would 
play a role in my life (but it took a long time to do so). 

The problem was this. If we take a pair of nonsingular matrices X, Y over a field, 
then it is easy to see that they can be replaced by AB, BA, for suitable matrices A, B 
if and only if they are similar. I tried to find out under what circumstances there 
exist matrices A, B,C such that 


X = ABC, Y = CBA. 


The answer is, iff det X = det Y, or equivalently, iff det XY"! = 1. My proof 
depended on Shoda’s theorem, so it was valid in fields for which that theorem holds. 
Shoda himself had not found all of those fields, but R. C. Thompson, in his 1960 
thesis, was able to characterize them completely. The result is now called the 
Shoda-R. C. Thompson theorem (see also Sourour 1986, for recent work in this 
area). 


3. Topological algebra. Pontrjagin’s paper Uber stetige algebraische Korper fell 
into my hands quite by accident. Let F be a topological field, 1.e., a field that is also 
a topological space, so that 


lim(a, + b,) =lima,+limd,; ~~ lim(a,b,) = lima, limb; 
lim b-! = (limb,)* unless lim b, = 0. 


Then Pontrjagin proves that under certain topological assumptions, F must be 
isomorphic with either the reals, the complex numbers or the real quaternions. 
Under the same conditions, F' must contain a subfield D isomorphic with the reals, 
such that D commutes with every element of F. Furthermore, F contains a finite set 
of elements such that every element of F is a linear combination of these elements 
with coefficients in D. Hence F is actually a division algebra over the reals, and so 
by a theorem of Frobenius it is isomorphic with the reals, the complex numbers or 
the real quaternions. Frobenius used matrix theory for the proof of his theorem. 
Later I myself found a topological proof for the theorem. Pontrjagin’s proof 
partitioned the field into sets {A}, {u},{v} with the properties that A” — 0, pu” is 
divergent, and v” has no divergent subsequence nor is 0 a point of accumulation 
(analogous to the interior, exterior and circumference, respectively, of the unit disk). 

I became interested in studying such J sets for real algebras. The matrices over 
the reals form such a set in their natural topology. John Todd and I wrote the paper 
‘Infinite powers of matrices’ on this subject. 

It is well known that matrices C whose powers approach zero play a big role in 
iteration processes. They have been characterized by P. Stein as being precisely 
those matrices C for which there exists a matrix X such that X¥ — CXC * is positive 
definite. Later I showed that Stein’s theorem is equivalent, by a Cayley transforma- 
tion, to the Lyapunov theorem for stable matrices. See also section 6 below for more 
on Lyapunov’s theorem. 


4. Integral matrices. An integral matrix is one whose entries are rational integers. 
My first work on this subject was with John Todd, during the last weeks before the 
war broke out, in Great Britain. Integral matrices had made me return to my major 
subject of number theory in a big and unexpected way. It happened at Bryn Mawr 
College, where I held a scholarship during a year when Emmy Noether was also 
there, and she was known as a champion of completely abstract approaches, even to 
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number theory. But a strange thing happened. Another Fellow at the College, Grace 
Shover (now Quinn), introduced me to her thesis adviser MacDuffee, who was an 
expert in matrices. I learned more about MacDuffee’s work, some of which can be 
traced back to Poincaré’s studies on matrices that are attached to ideals in algebraic 
number fields. This led to my work on so-called ideal matrices. Another paper that 
was influential for me was that of Latimer and MacDuffee, because it provides an 
important link between algebraic number theory and integral matrices. 


5. The theorem of McCoy. John Todd and I spent the first year of World War II, 
1939-40, in Ireland on leave from our positions in the University of London. I had 
no assigned duties during the first term, and found the library at Queen’s University 
very appealing. Among items that I had not seen before was the work of McCoy, in 
particular his well known characterization of pairs of matrices which can be 
transformed simultaneously to upper triangular form. 

Precisely, here is McCoy’s result. Let A, B be two n X n matrices, and suppose 
they have the following property: for every polynomial f in two variables, the 
matrix f(A, B) has for its eigenvalues the numbers f(A, ».), where A, » are suitably 
ordered eigenvalues of A and B, respectively. McCoy showed that for A, B to have 
this property it is necessary and sufficient that a matrix T exists such that both 
T~'AT and T~!BT are upper triangular. 

Since this work was kind enough to jump off of the library shelves into my hands, 
I started a correspondence with McCoy, and my interest in matrices was further 
strengthened. Drazin, Dungey, and Gruenberg [1951] gave a more elementary proof 
of McCoy’s theorem, but my 1957 proof, via the radical in abstract algebras, takes 
only a few lines. See also Flanders’ treatment. 


6. The new numerical mathematics. In the early forties, numerical mathematics 
was in a rather primitive state. The standard textbook in Britain was Whittaker and 
Robinson, which concentrated on interpolation, the solution of equations by New- 
ton’s method, numerical quadrature and Fourier analysis. The solution of linear 
equations was studied, not for its own sake, but as an appendage of the theory of 
least squares approximation. Characteristic value problems and differential equa- 
tions received little attention. A book with a more comprehensive point of view was 
that of Frazer, Duncan and Collar, but it was not as widely used. 

In the decades that followed there was intensive study of the approximate 
solution of continuous problems such as differential equations, and many of these 
involved matrix methods in a natural way, either iteratively or directly. In particu- 
lar, the whole subject of sparse matrices grew largely out of the study of characteris- 
tic value problems and the discretization of partial differential equations. I had 
several contacts with these developments, which I will now discuss. They brought 
me irnfto matrix theory again, whereas previously, numerical mathematics had 
interested me only for special problems in number theory. 


(A) The Gersgorin theorem 

The war was on, WWII, and I was working in London at the National Physical 
Laboratory under R. A. Frazer in the flutter group. I was assigned to the study of 
flutter in supersonic aircraft, which leads to boundary value problems in hyperbolic 
partial differential equations. Hence this work did not immediately contribute to my 
matrix enthusiasm. However, I had read Frazer’s article on how the flutter calcula- 
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tions were to be carried out. A large group of young girls, drafted into war work, did 
the calculation on hand-operated machines, following the instructions of Frazer and 
his assistants. 

The relevance of these calculations to aircraft design is that in flight the 
interaction between the elastic forces in the airframe and the aerodynamic forces 
induces self-excited vibration which, above a certain speed, is unstable. This 
phenomenon is called flutter. It is, therefore, important to know what the flutter 
speed is before the aircraft is built and flown. 

By a mere accident I had heard about the GerSgorin theorem, whose statement is 
given in a Zentralblatt review. It showed me how to reduce the amount of 
calculation, in a way that I will now try to explain. 

The theorem itself states that the eigenvalues of an n X n matrix A with complex 
entries lie in the union of the closed disks (‘GerSgorin disks’) 


Iz—a,|< Vila,| (=1,2,...,n) 
k#i 
in the complex z plane. We will call the union of these disks the Gersgorin set of the 
matrix A, and will denote it by I. 

In the case that I was working on, the question came down to showing that a 
certain 6 X 6 matrix of the form —w*A + iwB + C, where w, the flutter parameter, 
is taken as 1 in the example (see Fig. 1), had no real eigenvalue to the left of the 
small circles. The matrix entries were in the neighborhood of 20 or so, but in itself 
that told us nothing about the whereabouts of the eigenvalues. 

However, it turned out that the GerSgorin disks looked like the ones shown in 
Fic. 1. 


Circles of the sixth order case (c) 


-23 -18 -13 -8 -3 -2 -1 0 


Fic. 1. The GerSgorin disks. 
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This meant that we were lucky indeed, because the GerSgorin theorem can be 
applied as follows: 

(i) If I’, the GerSgorin set, falls into two connected components, one generated 
by r of the disks and the other by s of them, then there are r eigenvalues in the first 
component and s in the other. 

(ii) A similarity transformation S~1!4S does not change the eigenvalues of A, 
but it may well change the estimates that are given by GerSgorin’s theorem. Hence, 
by a careful choice of S, we may get sharper estimates. The easiest kind of an S$ to 
use, it turns out, is one that agrees with the identity matrix except in one position, 
say S|, = r, where r # 0. This similarity will multiply the radius of the ith GerSgorin 
disk by 1/r while leaving all the centers of the disks unchanged. 

(ii1) The intersection of two GerSgorin sets, the original one and the one obtained 
after the similarity transformation, is again a region in which all of the eigenvalues 
must lie. 

In Fic. 2, for example, it turned out that the larger circle could be replaced by 
the small circle far above the x-axis, by a similarity transformation of the diagonal 
type discussed above. 


Circles for sixth order case (c) 


—-5 


-23 -18 —13 ~8 -3 -2 -1 0 


Fic. 2. Similarity transformations shrink the regions. 


The small circle cannot be shrunk by the same method into a point unless the 
diagonal element is itself an eigenvalue. If we vary the diagonal similarities then at a 
certain point the other circles will overlap the isolated one and it will no longer be 
isolated. I raised the question of describing when this happens. Henrici, with a 
followup by F. Gaines, had the first contribution, in a special case. John Todd and 
Richard Varga gave different solutions of this problem. The school of A. Brauer at 
Chapel Hill worked on related questions. 

Once again, I didn’t ask to be assigned to matrix problems. They found me. 
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(B) The Stein-Rosenberg theorem 

Mordell asked me to look at a manuscript of Stein and Rosenberg, before it went 
out to a referee. I liked it, and it has since become a classic. It has been particularly 
studied by Francois Robert in France, and is discussed in detail in Varga’s 1962 
book. The Perron-Frobenius theorem, which concerns the eigenvalues of matrices 
with nonnegative entries, plays a big part in this paper. 

The Stein-Rosenberg theorem itself concerns iteration. The two classical iterative 
methods for the solution of linear equations Ax = b can be described as follows. 
Assume A = J — L — U, where L, U are lower and upper strictly triangular and I 
is the unit matrix. Make a guess x. Then, in the Jacobi method, we improve the 
guess by means of 


xO = -(L+U)x© +b. 


Anyone who tries this soon discovers that it works better if one replaces the 
components of x as fast.as the improved ones are calculated, instead of continu- 
ing to use the old values until all new ones have been found. Formally, if we do that, 
the iteration process is defined by 


x® = (T— L)*(Ux® + b) 


and the procedure is called the Gauss-Seidel method. It seems plausible to assume 
that the Gauss-Seidel method, since it uses improved estimates of the components of 
the unknown vector, would converge faster than the Jacobi method. This is not the 
case always. However, the Stein-Rosenberg theorem asserts that if L and U are 
nonnegative, the two processes converge or diverge together, and the Gauss-Seidel 
process is at least as fast, whichever happens. 


(C) The Lyapunov theorem 

The Lyapunov theorem, too, is of great interest in flutter work. It is a criterion 
for the stability of an n X n complex matrix, where a stable matrix is one all of 
whose eigenvalues have negative real parts. The economist Arrow had used this 
theorem for a measure of the stability of an economic system. 

The Lyapunov theorem gives the following criterion for a complex matrix A to 
be stable: there should exist a positive definite matrix H such that AH + HA* = 
—I, This criterion is useful for theoretical purposes though it is not well suited to 
computation in particular instances. 


(D) The Hilbert matrix 
The Hilbert matrix A is defined by 


a,,=1/Uitj) (i, 7 =1,2,..., 2). 


Its inverse is integral. 

This matrix, too, made a surprise entry into my research interests. In late 1947, 
after we had settled down at the U.S. National Bureau of Standards, I received a 
letter from Professor G. Temple in London. He wrote that the Oscillation Subcom- 
mittee of the British Aeronautical Research Council was interested in the Hilbert 
matrix, and would appreciate comments from me. 

In due course I wrote a paper explaining the slow convergence of the largest 
eigenvalue A, of A to its limiting value of 7. John Todd then studied the condition 
of A and he and others used it as an example of an ill-conditioned matrix. Its 
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determinant, for example, is nonzero, but extremely small. Later H. S. Wilf and 
N. G. de Bruijn described the behavior of A, more precisely. While I had proved 
that 


A, = 7{1 + O(1/log n)} 
they showed 
h, = 7 — 4n(logn)~* + O( (log log n ) (log n) >). 


n 


The literature on A is extensive and interest in A continues (see Wilf, Ergebnisse, 
volume 52, 1970). 


7. The Perron-Frobenius theorem and combinatorial matrix theory. The Perron- 
Frobenius theorem concerns the eigenvalues of matrices that are irreducible and 
have nonnegative entries. Among its conclusions, for instance, is the fact that the 
matrix must have a positive real eigenvalue that is not exceeded, in absolute value, 
by any other eigenvalue. 

It received a special lift through the proof by Wielandt (1950), cf. Gantmacher, 
Matrix Theory II. Graph theory plays a role there, in fact one can define irreducibil- 
ity of a matrix by connectedness of a certain graph that is determined by the 
positions of the nonzero matrix entries. 

I studied N xX N incidence matrices A of projective planes, and observed that 
AN~?} always has strictly positive entries. I further raised the question of determining 
the exponent of A, 1.e., the least power of A that has positive entries only. A. L. 
Dulmage and N. S. Mendelsohn showed that A* > 0 and that permutation matrices 
P,Q exist such that (PAQ)? has some zero entries. These authors used purely 
combinatorial arguments rather than the Perron-Frobenius theorem. For other 
combinatorial matrix work I refer to, e.g., Brualdi, Schneider, Engel, M. Hall, Ryser, 
etc. 


8. Connections with topological algebra. In the late thirties I suddenly realized 
that the Cauchy-Riemann equations and the fact that they imply the Laplace 
equation can be expressed via matrix theory and can be connected with the fact that 
the field of complex numbers has no zero divisors. I then studied the values of n for 
which generalized ‘Cauchy-Riemann’ equations for n functions u, in n real vari- 
ables x, lead to the n-dimensional Laplace equation. I used algebras over the reals 
which have no zero divisors. 

At that time the fact that such algebras must have dimensions 1, 2, 4,8 was not 
yet completely established, although great progress had been made by topological 
methods, in particular by E. Stiefel. However, Stiefel reproved my result about 
dimensions 1,2, 4,8 and for complex variables as well, without topology, by using 
representation theory of algebras and matrix theory. 


9. M. Marden’s book on geometry of polynomials. Marden’s book, which ap- 
peared as Mathematical Surveys, No. 3, AMS, 1966, especially section 31, contains 
applications of matrix theory to the study of geometry of the zeros of polynomials, 
e.g. GerSgorin’s theorem applied to the companion matrix of a polynomial gives 
information on the location of its zeros, and the theorem of Perron-Frobenius does 
likewise (cf. H. S. Wilf (1961)). 
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10. The Schur matrix. The Schur matrix is the matrix (e?7'""/7) forl < m,n < q, 
and it has generated many problems of great interest. Its trace, for example, is 
obviously a Gaussian sum. The eigenvalues of this matrix were obtained by Carlitz, 
and the eigenvectors by P. Morton answering a question of Hugh Montgomery. 
Landau’s famous book on number theory had introduced me to this matrix, and it is 
a valuable bridge between matrices and the theory of numbers. 

If we take the Schur matrix in the form 


S=S, = (¢-DU-D)! (f= exp2mi/n) 


then we find it also in the theory of the Fast Fourier Transform. If v is a vector, 
then Sv is essentially its discrete Fourier transform. Theilheimer showed that the 
Fast Fourier Transform, which speeds up the calculation quite a bit, amounts 
essentially to a factorization of S, namely, 


ao T Olll I 
Sn Pl f Mlb “0 | 
where P is a permutation matrix, T = ({77~YU~)), and D = diag(1, ¢..., §"7+). 


11. Cramped matrices. A matrix theorem that fascinated me at a very early time 
came from the book by Speiser on finite group theory. It is connected with the fact 
that every finite group is isomorphic to a group of unitary matrices. Let A, B be 
such matrices, and put C = ABA~'B™!. Let the eigenvalues of A lie on an arc less 
than a semicircle of the unit circle. If 4 and C commute then C = J. 

Zassenhaus wrote a paper, as did M. Marcus and R. C. Thompson, on this 
subject, and so had I done. Papers in functional analysis were stimulated too, and 
Berberian introduced the term ‘cramped’ for such matrices A. The theorem itself 
goes back to C. Jordan. 


12. Other areas. While I was still in Britain, the idea had come to me that while 
there are a number of inequalities tying the eigenvalues of A, B and A + B there 
had not been much done on explicit relationships between them. With these 
questions in mind I became very enthusiastic about Mark Kac introducing me to 
what I later called the L-property, L for linear, because the eigenvalues of 
AA + pB are then Aa, + pf, with a,;, B, suitably ordered eigenvalues of A, B. 
Motzkin joined me in this work, and we proved among other things the following 
theorem: if for all A and p the matrix 1A + pB is diagonalizable, then A and B 
commute. This result was reproved by Tosio Kato, using perturbation theory, and 
was later generalized by S. Friedland. Kaplansky generalized the L-property to 
operators. 

After the war, in 1947, we came to the USA when John Todd was invited to work 
at the National Bureau of Standards on the uses of high speed computing machines, 
and soon after my arrival I was given employment as well. At that time I picked up 
the torch. Matrix theory had become a subject for me. Matrices were not any longer 
just used, they were algebraic structures like rings, groups, lattices.... 

At first I collected relevant references and they went into my chapter in John 
Todd’s Survey of Numerical Analysis. But I was also working on other projects 
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linked to matrices, both algebraically and arithmetically. Stimulated by McCoy’s 
work, I became interested in pencils of matrices and in matrix algebras. The Caltech 
theses by F. Gaines and by H. Shapiro generalized McCoy’s idea of simultaneous 
triangularization to simultaneous block triangularization, and further contributions 
were made by R. C. Thompson. All of this led to the study of generalized 
commutativity, commutators, higher order commutators, and to my idea of poly- 
nomials in the commutator operator. 

While still at NBS, in 1951, I was asked to organize a conference in numerical 
analysis and, of course, I chose matrix theory as the theme. It was perhaps the first 
matrix conference ever. The Proceedings of this conference contain Givens’ rotation 
method for determining the eigenvalues and eigenvectors of real, symmetric matrices. 

In my position at NBS I encouraged people to turn to matrix theory, and I 
thought of bounds for eigenvalues as my major project there. The school of W. V. 
Parker at Auburn also worked on this subject at the time. I had a master’s student, 
Marion Walter, at NYU, who wrote a thesis on limits for the characteristic roots of 
a matrix, and I gave a course on matrix theory at NYU in 1955. I also looked after a 
group of highly talented postgraduate students and high ranking visitors at NBS. 
Strangely enough, they all became interested in matrix theory. 

Matrix theory changed my life quite a lot. During the war and my civil service 
work I had lost my favorite subject to a large extent, though not entirely. However, 
integral matrices brought it back to me quite unexpectedly. 

During our 1947 visit to Princeton I met Chowla. At that time he and two other 
mathematicians talked to me about similarity classes of integral matrices with the 
same characteristic polynomial. This reminded me of the theorem of Latimer and 
MacDuffee; Chowla urged me to find a new proof for it, and I managed to do so. 

Since then integral matrices have played a major role for me, and I helped to get 
a number of other people, like Zassenhaus, Dade, M. Newman, Estes, and Gural- 
nick interested in them. We found simpler ways to prove many classical number- 
theoretic theorems by methods that used integral matrices. I have given an exposi- 
tion of the connection between algebraic number theory and integral matrices in an 
appendix to a book by H. Cohn on algebraic number theory. 


Some advice. When you observe an interesting property of numbers, ask if 
perhaps you are not seeing, in the 1 X 1 case, an interesting property of matrices. 
Think of GL(n, F) or SL(n, F), GL(n, Z) or SL(n, Z). 

When you have a pair of interesting matrices study the pencil that they generate, 
or even the algebra. 

When the determinant of a certain matrix turns out to be important, ask about 
the matrix as a whole, for instance as in the case of the discriminant matrix, as 
suggested by the discriminant of an algebraic number field. 

When a polynomial in one variable interests you, ask about the matrices of which 
it is the characteristic polynomial. 

When people look down on matrices, remind them of great mathematicians such 
as Frobenius, Schur, C. L. Siegel, Ostrowski, Motzkin, Kac, etc., who made 
important contributions to the subject. 

I am proud to have been a torchbearer for matrix theory, and I am happy to see 
that there are many others to whom the torch can be passed. 
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Area-minimizing Surfaces, Faces of Grassmannians, and Calibrations 


FRANK MorGan, Williams College 


FRANK MorGan: I went to MIT and Princeton, where my thesis advisor, 
Fred Almgren, introduced me to minimal surfaces. Since then I have taught 
at MIT, Rice, and Stanford. In the fall of 1987, I began teaching at Williams 
College. (Photo courtesy of Gordon Graham /Prism.) 


1. Faces of Grassmannians 


A one-dimensional curve that heads in a constant direction (i.e., a straight line 
segment) is length-minimizing among all curves with the same boundary. 


~~ Lue OT 


Fic. 1(1). A straight line is length-minimizing. 


This fact has an interesting generalization to higher dimensions. Instead of curves, 
one considers m-dimensional surfaces S in n-dimensional space R”. Instead of a 
constant direction, one considers a certain collection of tangent plane directions. If 
all the tangent planes to a surface S belong to the same collection, then S is 
area-minimizing among all oriented surfaces with the same boundary. 

These collections of tangent plane directions are called “faces.” The set of all 
oriented m-dimensional planes through the origin in R” is called the Grassmannian 
G(m,R”). The definition of these faces depends on viewing the Grassmannian 
G(m,R") as sitting inside a higher-dimensional Euclidean space R”. The dimension 
N equals (”) = n!/m\(n — m)!, the number of m-dimensional axis planes in R”. 
Indeed, these axis planes give an orthonormal basis for RC). The (”) coordinates 
of any given m-dimensional plane  € G(m,R”) are just the areas of the projec- 
tions of a unit area in onto the (”) axis planes. Each coordinate must be signed + 
according to whether € and the axis plane have similar or opposite orientations. It 
turns out that the sum of the squares of these coordinates is always 1, so that 
G(m,R”") sits in the surface of the unit sphere in RO), 

The shape of the Grassmannian G(m,R”) ¢ R() is independent of the original 
choice of orthonormal basis for R”, which determines the ( ) axis planes. 
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Now bring a hyperplane in R% in from infinity until it first touches the 
Grassmannian. Usually it will hit at a single point, but sometimes there will be a 
larger contact set. In either case, this contact set is called a face of the Grassman- 
nian. In section 3.2 we will prove the following: 


1.1 THE Face THEOREM. Jf all of the tangent planes to an oriented m-dimensional 
surface S in R" lie in the same face of the Grassmannian G(m,R") C R®%, then S is 
area-minimizing. 


Fic. 1.1(1). A face of the Grassmannian (schematic). 


Let’s return for a moment to the Grassmannian G(1,R*) of oriented lines 


through the origin in the plane. G(1,R7) sits inside Ri) another copy of R*. The 
coordinates of the line at angle @ to the x-axis are the oriented projections of a unit 
length from that line onto the axes, namely (cos @, sin). Hence G(1,R7) is just the 
unit circle in R*. Its faces are all single points. The Face Theorem 1.1 just says that 
if all of the tangents to a curve are the same, 1.e., if the curve is a straight line, then 
the curve is length-minimizing. 


(cos @, sin @) 


Fic. 1.1(2). The Grassmannian of lines in R* is represented by the unit circle. 
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Fic. 1.1(3). Each face of G(1,R7) is a single point. 


Actually whenever m= 1 or n—1, the Grassmannian G(m,R”) is just the 
(n — 1)-dimensional sphere in R”. The faces are all single points. The Face Theorem 
just says that straight lines and hyperplanes are area-minimizing. 

In general dimensions, the Grassmannian G(m,R”) is an m(n — m)-dimensional 
submanifold of the unit sphere in RC), and it has some larger faces. These faces 
give rise to interesting area-minimizing surfaces. 

The main reference is the beautiful foundational paper of R. Harvey and H. B. 
Lawson [12]. There is also a nice survey by Harvey [11]. 

Before discussing the theory, we begin with some examples and applications. 


2. Examples and applications. 


2.1 Complex geometry. Let us identify R*” with C” and suppose 1 < m < n. One of 
the faces of the Grassmannian G(2m,R°*”) turns out to be the set of m-complex- 
dimensional complex planes in C”. Since the tangent planes to a complex analytic 
variety are all complex planes, it follows that a compact portion of a complex 
analytic variety is area-minimizing. For example, the complex curve 


C= {(z,w) © C?: w? = Zz, Iz/ +|wl <1] 


is a two-real-dimensional, area-minimizing surface in the unit ball in R*, bounded 
by the 2-3 torus knot 


dC = {(z,w) €C*: z=e", w = e 3/2) 0<6<47r}. 


C is a smooth submanifold with boundary, except for the isolated singularity at the 
origin. H. Federer’s observation [8], that all complex analytic varieties are area-mini- 
mizing, provided some of the first examples of singularities in area-minimizing 
surfaces. 
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2.2 Special Lagrangian geometry [12, Chapter III]. Again identify R?" = C”. The 
special Lagrangian face of G(n,R°") consists of the real plane & = {z © C”: 
Im z = 0} together with all of its images under the group SU, of special unitary 
maps: 


SU, = {n X n complex matrices 4: AA‘ = I and det A = 1}. 


This special Lagrangian face is an (n + 2)(n — 1)/2-dimensional subset of the 
n°-dimensional Grassmannian G(n,R7"). By comparison, for n even, the face of 
n-real-dimensional complex planes has dimension n?/2. 

An n-dimensional surface, with tangent planes in the special Lagrangian face, is 
called special Lagrangian. There are lots of these surfaces, and they must all be 
area-minimizing. For example, if M is any compact 2-dimensional minimal surface 
with boundary in R°, then its normal bundle, 


S = {(x,v) © R®:x © M and v isa vector normal to M at x}, 


turns out to be a 3-dimensional special Lagrangian surface in R°. Here M is not 
assumed to be area-minimizing, merely minimal. This term minimal means that the 
first variation of the area is 0, or equivalently, that the mean curvature is 0, or 
equivalently, that a parametrization in isothermal coordinates is harmonic. Even 
branch points in M are allowed, producing branch points in S. 

B. Cheng [3] recently has discovered some interesting special Lagrangian surfaces. 
Let m > 3. By thinking of SU,, as a set of m X m complex matrices, we may view 
SU, as a subset of the sphere of radius vm in C” = R?””. Cheng has observed that 
the cone C over SU,, 


C= {tx:x © SU,,0<t<1} 


is an m?-dimensional special Lagrangian surface in R*”’, bounded by SU,,. 

For m odd, this cone C has an interesting feature: the singularity at the origin is 
not real analytic. Indeed, if x © C — {0}, then tx © C if and only if ¢>0; 
whereas, if a real-analytic variety contains a half line, it must contain the whole line. 
This is possibly the simplest known example of an area-minimizing surface which is 
not real analytic. 

A function /:R” — R” whose graph is special Lagrangian satisfies a beautiful 
analog of the Cauchy-Riemann equations (see [12, III.2]). 


2.3. The angle conjecture [2, Open problem # 5.8]. A basic question in the study of 
singularities in m-dimensional area-minimizing surfaces asks which pairs D, D’ of 
oriented unit discs, centered at the origin, are area-minimizing. No pairs of 2-dimen- 
sional discs in R? are area-minimizing, as Figure 2.3(1) indicates. It happens that 
certain pairs of 2-dimensional discs in R‘ are area-minimizing. For the general case 
of m-dimensional discs in R”, the answer depends on the geometric relationship 
between the two discs, which can be described by m angles,0 < 6,<0,< --: < 
6, as follows. 

Choose unit vectors v, © D, vj © D’ to minimize the angle 0, between them. 
Choose unit vectors v, perpendicular to v, in D and v3 perpendicular to vj in D’, 
to minimize the angle @, between them. Continue. Finally, choose v,, to complete an 
oriented orthonormal basis for D, v/, to complete an oriented orthonormal basis for 
D’, and let @,, be the angle between them. 
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Fic. 2.3(1). Comparsion surfaces of less area than a pair of discs in R?. 


The angle conjecture says that the pair of discs is area-minimizing if and only if 
the largest angle is less than or equal to the sum of the others: 


6<0,+--- +6... 


Note that for a pair of discs in R°’, m = 2, the smaller angle 6, is always 0, and the 
condition @, < 0, = 0 fails to hold. 

D. Nance [16] recently has proved this condition sufficient by showing that such 
a pair of planes lie in a common face of the Grassmannian. The face turns out to be 
a clever generalization of the special Lagrangian face, based on the quaternions. 

More recently G. Lawlor [14] has proved the condition necessary. He shows that 
planes violating the condition are not area-minimizing by providing comparison 
surfaces of less area. These comparison surfaces are special Lagrangian, and hence 
actually area-minimizing. 

The proof of the angle conjecture is now complete. 


2.4. The classification problem. What are all of the faces of the Grassmannian 
G(m,R")? This classification problem has been solved only in low dimensions. 

For m= 1 or m =n — 1, the only faces are single points, and the associated 
area-minimizing surfaces are just lines and hyperplanes. 

For m = 2, every face consists of the complex lines in some even-dimensional 
subspace of R” (under some orthogonal complex structure). For example, one face is 
a round $* = CP! of complex lines in some R* = C*. For m = n — 2, a complemen- 
tary classification holds. Thus for dimension or codimension 2, nothing but classical 
complex geometry appears. 

The faces of the 9-dimensional Grassmannian G(3,R°), the first nonclassical 
case, were Classified in 1982 by J. Dadok and R. Harvey [4] and by [15]. In addition 
to the singleton and round $? faces inherited from lower dimensions, two new types 
of faces appeared. First, there are large, five-dimensional, special Lagrangian faces 
as previously described, with lots of associated three-dimensional, area-minimizing 
surfaces in R®°. Second, there are doubleton faces, consisting of certain pairs of 
three-planes. The pairs which occur are precisely the ones needed to prove the 
sufficiency of the Angle Conjecture condition for a pair of planes to be area-mini- 
mizing. 

The last Grassmannian to be classified to date is the 12-dimensional Grassman- 
nian G(3,R’) [13]. In addition to five types of faces inherited from lower dimen- 
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sions, five new types of faces appear. The largest faces, eight-dimensional associative 
faces, produce lots of 3-dimensional area-minimizing surfaces in R’. They are called 
associative because of a relationship with the partial associativity of octonion 
multiplication. 

A second new type of face consists of two round $*’s which intersect in a single 
point. The other three new types of faces are $!’s, $7’s, and $°’s which are not 
round. They provide the first examples of faces which are not totally geodesic in the 
Grassmannian. 


3. Theory of calibrations 


3.1. DEFINITIONS. Recall that a face of the Grassmannian G(m,R") C R() is 
defined as the initial contact set with a hyperplane translated in from infinity. The 
hyperplane is the level set of some linear function g on R (a), 


Fic. 3.1(1). A face is a level-set of a linear function Q. 


It is easy to check that the linear functions on R ‘) are precisely the constant-coeffi- 
cient, differential m-forms on R”. Indeed, an orthonormal basis for R Cn) is given by 
the axis m-planes in G(m, R”) C RG): 


{ e, Ast: Ae; i <<: <in <n}; 
and the dual basis for the constant-coefficient differential m-forms is given by 
{ dx; A+++ Ndx,; :1<ip< +e: <i, <n}. 


Thus a face of the Grassmannian G(m,R”) CR (.) is the set of points in G(m, R”) 
where an m-form attains its maximum value 


llpl* = max{ (é): € © G(m,R")}, 


called the comass of ». A (constant-coefficient)m-form gq, normalized to have 
comass 1, is called a calibration. The faces of the Grassmannian are precisely the 
sets of maxima of calibrations 9. 


3.2. THE FACE THEOREM [12, Introduction]. Let S be an m-dimensional surface 
with boundary in R" such that every tangent plane to S lies in a common face of the 
Grassmannian. Then S is area-minimizing, i.e., no other surface T with the same 
boundary has less area. 
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Proof. Since there is a face of the Grassmannian containing every tangent plane 
to S, there is a calibration @ such that m(€) < 1, with equality for € tangent to S. 
Hence /,g = area S, while {;g < area T. Since @ is a constant-coefficient 
differential form, dp = 0. Hence it follows by Stokes’s Theorem that {oy = /7q, 
because S and 7 have the same boundary. Therefore, 


area S = fo- [oo < area T, 
Ss T 
as claimed. 


3.3. Examples of calibrations. This section gives explicitly the calibrations that yield 
the faces of Grassmannians heralded at the beginning of the article. 
Complex lines in C” comprise the maxima of the Kahler form 


w= “(de A dz, + +++ +dz, A dz,). 


= dx, A dy, + 7 +dx, A dy,,. 


Similarly, complex m-planes in C” comprise the maxima of w’/m! (here normalized 
to have maximum value 1). These facts follow immediately from Wirtinger’s 
Inequality (cf. [6, 1.8.2, 5.4.19]). 

Special Lagrangian m-planes in R*” = C™ comprise the maxima of the special 
Lagrangian form 


m = Re dz = Redz, A -::- A dz,,. 
= Re(dx, + idy,) A --- A(dx, + idy,). 
For example, in R°, 
m = dx, A dx, A dx, — dx, A dy, A dyy — dy, A dx, A dy3 — dy, A dy, N dx3. 


R. Harvey and H. B. Lawson [12] discovered this and other beautiful calibrations in 
1976. 

The Nance forms, used to prove half the Angle conjecture, generalize the special 
Lagrangian form by replacing i by more general imaginary unit quaternions 
uy, eees Un: 

gp = Re(dx, + u,dy,) A +++ A( dx, + Up»DWn)- 
Given a pair of m-planes whose characterizing angles satisfy the criterion 


6,<6,4+ ---4+6,-1, 
a calibration can be constructed as follows. The inequality lets us construct a 
polygon in the 2-sphere of imaginary unit quaternions with sides 6,,...,6,,, as 
shown in FIGURE 3.3(1). Now the u,,...,u,, for defining » are just the exterior 
poles of the corresponding arcs. (For example, if @, lies on the equator, and the 
polygon lies above the equator, u, is the south pole.) The whole proof [16] is short 


and elegant. 
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Fic. 3.3(1). Admissible angles form the side-lengths of a spherical polygon. 


J. Dadok and R. Harvey [5] recently have used spinors to construct new 
calibrations. 


3.4. Calibrations on manifolds. The following theorem shows how the theory of 
calibrations, presented above in Euclidean space, extends to general Riemannian 
manifolds. 


THEOREM [12, II.4]. Let S be an oriented m-dimensional surface, with or without 
boundary, in a smooth Riemannian manifold M. Let g be a smooth closed differential 
m-form on M. Suppose that for all x © M, for all unit m-planes &, p(x)(€) < 1, with 
equality if € is tangent to S at x. Then no oriented surface T homologous to S has less 
area. 


The proof is the same as for Theorem 3.2. 


3.5. Least-area representatives of homology classes. Theorem 3.4 sometimes can be 
applied to find explicitly surfaces of least area in homology classes in Riemannian 
manifolds. If the manifold M is complex and the Kahler form w is closed, 1.e., if M 
is a Kahler manifold, then by Wirtinger’s inequality the normalized powers w?/p! 
of w calibrate all complex subvarieties. Hence these complex subvarieties minimize 
area in their homology classes. For example, every complex algebraic subvariety of 
complex projective space CP” is homologically area-minimizing. 

In 1972 M. Berger [1, §6] established an analog of Wirtinger’s inequality for a 
quaternionic form {2. Consequently, if M is “quaternionic-K4dhler,” so that M 
admits‘a closed quaternionic form Q, then the normalized powers of () calibrate all 
quaternionic subvarieties. Hence these quaternionic subvarieties minimize area in 
their homology classes. For example, every quaternionic subvariety of quaternionic 
projective space is homologically area-minimizing. 

A. T. Fomenko began the search for homologically area-minimizing surfaces in 
compact Lie groups and symmetric spaces. He proved for example that inside the 
Lie group M = SU,, the subgroup SU, is homologically area-minimizing [9, 
Corollary 8]. Dao Chong Thi [20] used methods equivalent to calibrations (cf. [12, 
V.1]) to obtain this and other new examples. It is still an open question whether SU,, 
is homologically area-minimizing in SU, for 2 < m <n. 
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As a final example, let M be the Grassmannian G(k,R/’”) of oriented k-planes 
through the origin in R ?. Sub-Grassmannians, as totally geodesic submanifolds, are 
excellent candidates for least-area representatives of homology classes. For k = 2, 
G(2,R”) is a Kéhler manifold, and the various sub-Grassmannians G(2,R‘) 
(q < p) are complex submanifolds and hence homologically area-minimizing. For 
k = 4, G(4,R 7”) is a quaternionic-K4hler manifold, and the various sub-Grassman- 
nians G(4,R”) (q < p) are quaternionic submanifolds and, hence, homologically 
area-minimizing. More recently, by using new calibrations related to the Euler form, 
H. Gluck, W. Ziller, and the author [10] have proven that for any even k, G(k,R7) 
is homologically area-minimizing in G(k,R ”). 

However, G(2,R*) is not homologically area-minimizing in G(3,R°), despite the 
fact that it is totally geodesic and homologically nontrivial. (The inclusion of 
G(2,R*) in G(3,R°) is given by § > £ A eg.) 

The analysis leads to an open question involving the special orthogonal group 


SO, = {3 X 3 real matrices A: AA’ = J and det A = 1}. 
Question: Does the cone C over SO;, 
C= {ix € R°: x € SO;,0<t< 1}, 


minimize area among all oriented four-dimensional surfaces bounded by SO;) 
This stubborn question seems to require a new idea for its solution. 


Added in proof. G. Lawlor has proved that the cone C over SO, is area-minimiz- 
ing by a powerful new method. However, many questions remain open. 


The author acknowledges partial support from the National Science Foundation. 
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Let M be a compact Riemannian manifold. The Laplace-Beltrami operator (or 
Laplacian, for short) is the operator 


A(f) = —div(grad f). 
One way of understanding this operator is by the Heat Equation. If f(x) denotes 
the temperature at time ¢ and a point x, and if we assume that heat travels in the 
coolest direction, then f(x) satisfies the equation 


0 
—f+Af=0 
71 + Af 


as one sees from the usual physical interpretation of div and grad. 

Now suppose that f,(x) is an eigenfunction of A with eigenvalue X, that is, fh 
satisfies 

A( fo) => fo. 

Then the function f(x) = e~*“f,(x) will satisfy the Heat Equation. One may think 
of f, describing a “heat wave” with “frequency” e~”. 

Standard results in partial differential equations tell one that there are countably 
many eigenvalues ’, and for each A, a finite-dimensional family of eigenfunctions 
;, sO that A, and f, satisfy 


Afi) = fi. 
Furthermore, the A,’s are positive, and tend to oo as i > oo. The collection of 
values {A,}, together with the multiplicities with which they occur, is called the 
spectrum of M. 
An arbitrary function f(x) may then be decomposed into its “fundamental heat 
waves” 


f(x) = Las(x), ACL) =A,(f) 


in exactly the same way that the motion of a string decomposes into its fundamental 
overtones, the sine and cosine functions of Fourier analysis. 

If one considers that sound satisfies a similar equation to that of heat, then one 
may regard the A,’s as the sounds emitted by M, if one struck M with a mallet. 


*Partially supported by NSF grant DMS-8501300 
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A natural question, in the best tradition of Riemannian geometry, is to ask for 
the geometric significance of the numbers A, whose existence we found purely from 
analysis. It is hard to imagine this question not being asked even by the more 
speculative-minded of the Pythagoreans, but in the 20th century this question took 
on increasing interest, as the tools to answer it were developed. 

In the first half of this century, there was a great deal of speculation that the 
eigenvalues {A,} might completely determine the geometry of M. This was perhaps 
fueled by the striking result of Hermann Weyl [22] that the eigenvalues determine 
such geometric invariants as the volume of M. In his beautiful survey article [15], 
Mark Kac framed this train of thought in the question, “Can you hear the shape of 
a drum?” 

In recent years, attention has been focused on examples which show this 
speculation to be somewhat overambitious. Let us say that two manifolds are 
isospectral if the eigenvalues of the Laplacian are the same for both M, and M.,. 
Then we will see below how one may give rather explicit constructions of isospectral 
manifolds which are not isometric. 

The first example of isospectral manifolds which were not isometric was found by 
John Milnor in [17]. We will discuss his example briefly in § 1 below, but for the 
present let us say that his method was to reduce the isospectral problem for certain 
classes of manifolds to a number-theoretic problem that had a well-known solution. 
His method was taken up, after a long period, by Ikeda [14] and Vignéras [21], who 
were able to find examples of isospectral manifolds in other classes, by similarly 
looking for solutions to a companion problem among classical examples. 

It is fair to say that these examples all had the flavor of happy, if somewhat 
exotic, coincidences, and it seemed as if the search for isospectral manifolds was 
leading mathematicians into increasingly esoteric subjects in order to find such 
coincidences. 

The situation changed dramatically in [20]. Here, Sunada showed how the 
phenomenon of isospectral manifolds could be understood in a systematic way. His 
insight was to reduce the problem to a straightforward problem in the theory of 
finite groups. This reduction is quite simple, and we will present a complete 
argument in § 2 below. Furthermore, the problem in group theory admitted large 
numbers of relatively straightforward solutions and was understood in a fairly 
systematic way. Indeed, this same group-theoretic problem arose in a basic problem 
in number theory—see [24] and [25]. 

At this point, the construction of isospectral manifolds became fairly simple, and 
actually rather fun. 

In this article, we would like to share these developments with you. We hope to 
share some of the excitement that occurs when certain kinds of objects which were 
thought not to exist—or at least to be extremely rare—can be constructed at home 
using Scissors, paper, and tape. 

The literature on the spectrum of the Laplacian is quite extensive, and we will not 
make an attempt to give a complete overview of it. This topic has been the subject 
of three previous review articles in the American Mathematical Monthly ({9], [15], 
[23]). A comprehensive bibliography concerning the spectrum of the Laplacian as of 
1982 has been compiled by Bérard and Berger in [1]. Furthermore, the foundational 
material of the subject has been collected in at least two monographs, [2] and [6], 
which we heartily recommend. 
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We would like to thank K. Layman and R. Perlis for their critical readings of this 
paper. 
1. The Jacobi Inversion Formula 


Among the simplest manifolds are the flat tori. To describe them, let us pick two 
points (x,, y,) and (x, y,) in R* which are linearly independent. Let A be the 
lattice generated by (x,, y,) and (x,, y,)—that is, all linear combinations m - 
(x1, ¥,) + n- (x5, y). Then the torus T* is determined by identifying points which 
differ by an element of A: 


T?=R°/A. 


Topologically, T? is the surface of a doughnut, as one may see by cutting out the 

parallelogram spanned by (x,, y,) and (x5, y,), and gluing opposite sides together. 

Furthermore, two lattices A, and A, give rise to the same torus if and only if A, 

differs from A, by a rotation. There is a 3-dimensional family of distinct flat tori. 
In this section, we will prove: 


THEOREM. Any two flat tori which are isospectral are isometric. 


This theorem is basically the Jacobi inversion formula, which we will describe 
below. Although this theorem seems a little outside the mainstream of our discus- 
sion, because it is a theorem showing that certain manifolds cannot be isospectral, 
all of the ingredients of the subsequent theory can be found here, at least in some 
form. 

A function H,(x, y) will be called a fundamental solution of the Heat Equation 
(or a heat kernel) if 


7) 
(a) a1 + A. H=0 
(b) lim, 9 f Hix, y)£(y) dy = FC) 


for any smooth function fon M. 
From (a) and (b), we see that the general solution f(x) of the heat equation with 
initial condition f,(x) =f is given by 


f(x) = J acs, y) f(y) ay. 


We will need the following properties of H,: 

(i) H,(x, y) is uniquely determined by (a) and (b). 

(ii) If M is compact, and { f,} is an orthonormal basis of eigenfunctions with 
eigenvalues A, then 


H(x,y) = Le ™£(x) f(y). 


We now want to use (ii) to describe the A,’s, but we need to eliminate the f,’s 
from the picture. To do this, let 


tr(H,) = J BCs. x) dx. 
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Then 
tr(H,) = J Vie f(x) dx = ref f? dx 
M M 
= Yie~*" (since the f,’s are orthonormal). 


Now we claim: tr(,) determines all of the A,’s. 


Proof. Assume Xo, A,,..., A, have been found. Then A,,, is the largest value A 
such that 


k 
tr(H,) — > eM 
i=0 


lim 5 is finite. 
e 


 iomal?,2) 
Now suppose that M’ is a covering of M—this means that there is a map 
gy: M’-—>M which near every point is an isometry. Such a manifold M’ is 
determined by a subgroup 7,(M7’) of the fundamental group 7,(M). When 7,(M’) 
is a normal subgroup of 7,(M), M’ is said to be a normal covering of M, and the 
quotient group G = 7,(M)/7,(M’) acts on M’ as a group of isometries of M’. G 
is called the covering group. 
In this situation, we would like to solve for the heat kernel H,“(x, y) of M in 
terms of H!"(x, y): 


Lemma. Hx, y) = Lee; (x, g-Y) where X and y are points of M’ covering 
x and y. 


Proof. It is evident that the right-hand side satisfies (a) and (b) above. The 
lemma now follows from the uniqueness statement of (1). 
We now Specialize this discussion to the case M a flat torus, that is 


M=R?/A, A a lattice 


and M’ = R?. We observe that H® (x, y) is given by the “Gaussian distribution” 


2 1 2 
HP (x,y) = laale" me 


so from the lemma, we see 


1 ; 
we)=(2) Een 


LEA 
and so 
area( M , 
tr(H™) = | yell “), 
Ant le 


We may now argue as in the claim, but now letting t — 0 + instead of — oo, to 
show that tr(H,”) determines area (M) and 


{|Jj: LE A}. 
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Since (H/’) depends only on the eigenvalues {A,}, we conclude: 

If two flat tori M, = R*/A, and M,=R?/A, are isospectral, then area 
(M,) = area(M,), and {|/|: /e© A,} = {|/|: Je A,}. 

To complete the proof of the theorem we observe that a lattice in R? is 
determined up to isometry by its shortest length, the shortest length of a linearly 
independent lattice point, and the area of the parallelogram they span. Since all of 
these are determined from tr(H,”), which in turn is determined from the ),’s, we 
see that M is determined from the 4,’s, proving the theorem. 

To complete our discussion, we observe that for a flat torus, we may explicitly 
solve for the A,’s as follows: given a lattice A in R*, the dual lattice A* is the set 
/* © R? such that /* -/ is an integer for all / © A. Then the eigenfunctions f, are 
given by 


f(x) =e?" for * © A* 


with eigenvalue 477|/*|*, so that 


tr(H,.”) = 3 oT 4m yt 
i*e A* 


and we conclude the Jacobi inversion formula: 


yee area( M ) | 
e = ——— 
i*e A* Ant 


3 ew) 


ieEA 


It is evident that this argument generalizes to any dimension n. There were only 
two ingredients of this argument which were two-dimensional. The first of these was 
the expression for the heat kernel for R*. But the heat kernel for R” differs only 
slightly from the heat kernel for R?. The second is the assertion that a lattice in R? 
is determined by its area and lengths. This argument breaks down in higher 
dimensions, but we are left with the conclusion: if 7, = R”/A,, and T, = R"/A, 
are isospectral, then the volumes of 7, and T, are the same, and the set of lengths of 
lattice points are the same. 

Milnor observed in [7] that there were two lattices in R'° which were well known 
to number theorists to have this property, and he concluded that there were two 
16-dimensional flat tori which are isospectral. 

One can generalize this argument for any manifold for which you can write down 
the heat kernel explicitly—for instance, symmetric spaces of noncompact type. For 
the hyperbolic plane, the same argument gives (after some more elaborate calcula- 
tion} the Selberg trace formula ({16] and [18]), which says that, when S is a surface 
of constant curvature —1, the eigenvalues of S determine the lengths of all the 
closed geodesics of S, as well as the area of S. 


2. The Sunada Trace Formula 


In this section, we will show how to extend the ideas of the last section in order 
to construct isospectral manifolds. In fact, the striking thing about this argument is 
that it is in fact a simplification of the previous argument, since it will involve only 
finite groups, and hence finite sums. Furthermore, it will not involve computing any 
complicated integrals or functions. 
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Let us start with a normal covering M’ of M, which we now assume to be a finite 
covering. As before, we wish to solve for H.“(x, y) in terms of H,“ (x, y), and as in 
the lemma, we have 


HM@(x,y) = HM (%, 5). 
geEG 


Setting x = y and integrating over M, we have 
1 
tr(H™”) = | 4x, x) d& = —— H™(&, eX) dX. 


We now observe that the terms in the right-hand side are unchanged when g is 
replaced by a conjugate hgh~' of g, since: 
(a) for any isometry h of M’, 


HM (hk, hy) — Hy" (&, y) 
(this follows from the uniqueness statement (i) of § 2) so that 
(b) / H™(&, hgh") d¥ = i] HM (h-%, gh) dx 
M’ M’ 
(by (a)) 
= f H/"'(%, g(X)) d& (changing variables). 
Mt 


We conclude that 


where [g] denotes the conjugacy class of g in G. 
Now suppose we have a tower of coverings 


where M’ is a normal covering of M, (with covering group #7), and also of M, (with 
covering group G), but M, is not necessarily a normal covering of M,. 
Then we have from the discussion above that 


# ([h 
tr( HM) = = Say Lee hX) dx, 


but we may now use the symmetries of G to observe that if h and h’ are conjugate 
in G, then 


[ HAs, hk) dz = [ H,(&, h'%) dz 
M’ M’ 
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so that 


MY #([g] 0 H) 
tr(H; ) = X, #(H) 


Now the point is that the integrals on the right-hand side, as uncomputable as 
they may be, depend only on G, and not on H. Only the coefficients depend on H. 
So if we can find two subgroups H, and H, of G satisfying 


for all g € G, #([g] N A,) = #([g] N A) (}) 


then the corresponding manifolds M, and M, will have identical tr(.H,) and hence 
will be isospectral. 
We have thus proved: 


J H.(&, g&) dk. 
w 


THEOREM ([20]). Let us choose groups G, H, and H, satisfying (7), and 
M,, M,, M,, M’ a diagram of covering space as follows: 


H M’ H 
Yo) 
M, M, 
NZ 
M) 
Then M, is isospectral to M,. 


It remains to check that M, is not isometric to M,. If H, is conjugate to H, inG 
(or even in a larger group of isometries of M’) then of course M, will be isometric 
to M,. But it is easy to check that if H, 1s not conjugate to H,, and the metric on 
M, is sufficiently “bumpy” so that M’ has no extra isometries not in G, then M, is 
not isometric to M,. 


3. Some Finite Groups 


It now remains to show that there are examples of groups (G, H,, H,) satisfying 
the condition (f). Such a condition may seem strange to an analyst, but such groups 
are known in abundance to finite group theorists. We present below an arsenal of 
examples, taken from [3], [12], and [20]: 


Example 1. Z/8 X (Z/8)*. 

We consider ordered pairs (a, b) with b € Z/8, a = 1, 3, 5, or 7, with multipli- 
cation rule (a, b) - (a’, b’) = (aa’, a’b + b’). 

The group G thus defined is the semi-direct product of Z/8 and (Z/8)*, and 
has 32 elements. Let H, = {(1, 0), (3, 0), (5, 0), (7,0)} and AH, = {(1, 0), (3, 4), (5, 4), 
(7,0)}. It is easy to check that (G, H,, H,) satisfies (+), and that H, is not conjugate 
to H, inG. 


Example 2. G = SL(n, Z/p), n > 3, 


* OK OK OK * 000 
0 * 
M=| 9 yx] o=le x 
0 * 
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To see that #((g] N H,) = #([g] A H,), notice that g will appear in H, if and 
only if 


oo Cr 


is an eigenvector of H,, and hence the number of times [g] meets H, depends only 
on the number of eigenvectors g has. Since this is the same for g and its transpose, 
#([g] AN H,) = #(g] 0 A,). In this example, note that H, is conjugate to H, 
under the outer automorphism g — (g‘)~', but that if n > 3, H, is not conjugate 
to H, in G. To see this last, note that if m > 3, the elements of H, do not have a 
common eigenvector (when n = 2, the vector (°] is an eigenvector of all of H,). 


Example 3. Let G be the symmetric group on p? letters, and let H, = (Z/p) ® 
(Z/p) ® (Z/p) and H, be the “mod p Heisenberg group” 


1 a b 
A,=(‘10 1 ec 


:a,b,c,€ 2/0) 
0 0 1 


We may view H, and H, as subgroups of G by the Cayley-Sylvester theorem, which 
says that any finite group H is a subgroup of the symmetric group on #(/Z) letters. 
We claim that both H, and H, share the following property: any nonzero element 
of H,, i = 1,2, has order exactly p. For H, this is obvious, and for H, this can be 
verified from the formula 


(k)(k — 1) 


1 a b\* 1 k-a k-b + 

0 1 cc] = 2 

00 1 0 1 k-c 
0 0 1 


We now claim that #([g] N A,) = #([g] N H,) for all g © G. But the conjugacy 
class of g is determined by its decomposition into cycles, and for [ g] to meet either 
H, or H,, it must consist of p* p-cycles, in which case #([g] N H,) = #([g] A A) 
= p? — 1, or g = id, in which case #([g] N H,) = #([g] N A>) = 1. Note that not 
only is H, not conjugate to H,, but H, is not even abstractly isomorphic to H,, 
since H, is abelian and H, is not. 


Example 4. We may extend example 3 by letting H, and H, be any two groups 
of order p%, such that each nonzero element has order exactly p, and letting G be 
the symmetric group of order p*. For instance, taking a large and considering the 
H,’s to be direct sums of Heisenberg groups and (Z/p)°’s, we see that there are G’s 
with arbitrarily many subgroups H,, all pairwise satisfying (+) and all pairwise 
distinct. 

The above collection of examples will convince the reader that groups satisfying 
(+) exist in abundance. Somewhat surprising, then, is the fact that groups satisfying 
(+) can be more or less systematically understood. See [12] for a discussion. 
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4. Isospectral Manifolds 
The theorem of § 2 and the examples of § 3 have now amply illustrated: 
THEOREM. There exist manifolds M, and M, which are isospectral but not isometric. 


Proof. It suffices to take a triple of groups (G, H,, H,) satisfying (f) and a 
sufficiently bumpy manifold M, with a surjective homomorphism g: 7,(M,) > G. 
We may then take M, i = 1,2 to be the covering of M, with fundamental group 
yp *(H;). 

For mstance, we may use the fact that every finitely presented group occurs as 
the fundamental group of a compact manifold. Then, using Example 3, we may find 
M, and M, which are not even homeomorphic—they even have different funda- 
mental groups. 

An important class of manifolds to which the Theorem of § 2 does not apply 
directly is the class of Riemann surfaces—surfaces equipped with metrics of 
constant curvature. The constant curvature condition restricts how “bumpy” one 
can make the metric on M), so it is not clear that M, and M, so constructed can be 
made non-isometric. Fortunately, surfaces are sufficiently simple to allow one to see 
directly that they are not isometric. It is an interesting challenge to find examples 
which are particularly simple— that is, with smallest possible genus. 

The best we can do along this direction 1s: 


THEOREM ({[4], [5)). 
a) For each genus g > 4, there are Riemann surfaces S, and S, of genus g which are 
gS gS 1 298 gS 
isospectral but not isometric. 
b) There exist surfaces S, and S, of variable negative curvature and genus 3 which 
1 2 §' & 
are isospectral but not isometric. 


The case of genus 5 and genus > 7 was done by Buser in [5]. Here we will sketch 
the construction of genus 4. 

In order to construct examples of low genus, one must first make a nice choice of 
groups (G, H,, H,). Since the area of a Riemann surface is 47(g — 1), by the 
Gauss-Bonnet theorem, and since areas multiply under coverings, we want to choose 
the genus of M, small and the index of H, in G small. 

In fact, we will want to choose the genus of M), so small that M, must be 
singular. We will then have the further problem of guaranteeing that the singulari- 
ties of M, smooth out in the coverings M, and M,. 

For our construction, we will take the group G = SL(3, Z/2). We choose this 
because in this case [G: H,] =[G: H,] = 7, and this can be shown to be the 
smallest index which can occur for groups satisfying (7). 

Fot our base manifold M, we will choose a torus with one singular point of order 
k, where k will be determined in a moment. Then the fundamental group (suitably 
interpreted, because M, is singular) is 


{A, B,C: ABA1B-1=C, CK =1}. 


A moment’s reflection will show that one must impose special conditions on k in 
order for the singularity of M, to resolve itself in M, and M,. The condition is that 
any element of z,(M,) corresponding to a singular point of M, should act freely on 
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the cosets of G/H,, i= 1,2. For the group G given above, one verifies that the 
condition is that k = 7. 

We now seek two matrices A and B in SL(3, Z/2) which have the properties 
that: 

(i) A and B generate G. 

(ii) ABA~'!B™! is of order 7. 

One must be somewhat clever to find such matrices, but they do exist. One choice 
iS: 


0 1 1 1 0 0 
A=|0 1 0 B={0 0 1}. 
1 0 0 0 1 1 


To construct our surfaces M, and M,, we break open our torus M, to get the 
six-sided polygon shown in- FIGURE 1. The reader can check easily that when one 
identifies the sides corresponding to the same letters one obtains a torus with one 
singular point. 

Now make seven copies of this polygon and label them according to the cosets of 
H, in G. A convenient way of doing this is by identifying G/H, with column vectors 
which are 0 or 1, but not all 0, by the identification 


1 
{0} 


0 


We then glue the polygon labeled >) with the polygon labeled A , along the 


edges corresponding to the foot and head, respectively, of the arrow labeled A in 
FiGure 1, and similarly for B and C = ABA~'B7~!. When we are done, we obtain 
the gluing diagram for M, shown in FIGURE 2. 


Fic. 1 
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FIG. 2 


We now do a similar construction to obtain M,, but here we identify the cosets 
G/H, with column vectors, where a matrix g © G acts on a column vector by 


(x,y,z) > (x,y z)(8') 


where g’ is the transpose of g. 

Rather than find the inverses of A’, B’, and C’, it turns out to be convenient to 
merely find the transposes of A, B, and C, and simply reverse the senses of the 
arrows in FIGURE 1. In this way, we obtain the gluing diagram for M, shown in 


FIGURE 3. 
J. 


Fic. 3 


To see that these surfaces are not isometric, notice that there is a short geodesic 
joining the two sides labeled 1, whose length is different from any other geodesic. 
Also, in FIGURE 2 there is a geodesic which joins the two pairs labeled A and B, 
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which again has length different from any other geodesic, and an analogous 
geodesic in FIGURE 3 joining the sides labelled E and F, which has the same length 
as the geodesic in FIGURE 2. In FIGURE 2, these two geodesics cross, but in FIGURE 
3 they do not. Hence M, 1s not isometric to M,. 

It is difficult to see directly that these surfaces are isospectral. However, one may 
see readily that the set of lengths of closed geodesics is the same for both, by 
drawing closed loops on one and looking for the analogous loops on the other. 

A somewhat simpler, but completely analogous, construction gives the genus 3 
examples mentioned above. Here we begin with a sphere with three singular points, 
rather than a torus with one singular point, and the problem in group theory is to 
find matrices A and B in SL(3, Z/2) such that: 

(i) A and B generate SL(3, Z/2) 

(ii) A, B, and AB are all of order 7. 

One such choice is: 


1 0 1 0 QO 1 
A=]1°1 0 B=)/1 0 OJ]. 
0 1 QO 1 1 0 


The resulting gluing diagrams are shown in FIGURES 4 and 5. 


It turns out that if one starts with a metric on the sphere with three singular 
points which is too symmetric (for instance, the unique metric of constant curvature 
—1), then M, will be isometric to M,. To see this, one may take the gluing diagram 
of FiGuRE 4 and reflect it in a mirror. After relabelling the letters in a suitable way, 
one sees that FIGURE 4 becomes the same as FIGURE 5. However, if the metric on 
the sphere is sufficiently bumpy, it will not be respected by reflection in the mirror, 
and M, and M, will not be isometric. 

It is evident that this and similar constructions are mechanical, and I have found 
it amusing and simple to carry out some of these constructions with paper and 
scissors. Here is my favorite recipe to do this—the reader may enjoy trying his own: 
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Fic. 5 


Let us begin with the group (2/8) xX (Z/8)* of Example 1. Here we may 
identify G/H, = G/H, = Z/8, by noting that (a, b)- (a,0) = (1, ab), ab © Z/8. 

Now we consider the three-pronged figure of FIGURE 6. Each prong is identified 
with a generator of G (notice that G cannot be generated by two elements). For ease 
in pasting, we have taken each generator to be of order 2. 


(3, 2) 


(7, 1) 


(5,4) 
Fic. 6 


Now make eight copies of FIGURE 6, and label them from 0 to 7, according to the 
cosets of G/H,. Then glue them together according to the action of the generators 
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A B C C D E F G 
0 ! K y'tJ IM 3 O 
J L N P 
Q R S S T U V V 
E D H H B A G F 
f 
P N L J 
4 5 6 7 
O M K I 
R O Ww Ww U T xX. X 


Fic. 7 


on these cosets. Instructions for doing this are contained in FIGURE 7. Finally, 
repeat this with eight more copies, now using the gluing instructions from FIGURE 8, 
to obtain a second figure isospectral to the first. 

For some of the fine points of paper constructions, see [5]. 


A B C D B A E E 
M O 
9) J 1 K 2 ao 
J L N | P 
| | 
Q Q R S T T U V 
FG D C G F H FA 
N L J 
P 6 
O M K I 
Ww Ww S R X xX V U 


Fic. 8 


5. Isospectral Deformations 


A natural question arises from our discussion of § 4. We saw how one can 
construct examples of two manifolds M, and M, which are isospectral but not 
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isometric. Indeed, using Example 4 of § 3, one may find arbitrarily many manifolds 
M,,..., M,, all of which are isospectral but none of which are isometric. Is it 
possible to find an infinite family M,, for ¢ ranging over the real numbers, such that 
all the M,’s are isospectral, but none of them are isometric? 

To understand the extra challenge of this question, we observe that a theorem of 
Guillemin and Kazhdan [11] asserts that this can never happen when the M,’s have 
negative curvature. On the other hand, the construction of § 4 works perfectly well 
with manifolds of negative curvature, and indeed this is the easiest case, for then the 
fundamental groups are complicated enough to do the construction of § 4. 

The problem of finding such isospectral families was solved by Gordon and 
Wilson in [10], and indeed their work came before the work of Sunada [20]. But it is 
not difficult to understand what they did in the framework we have set up. One may 
think of their construction as an analogue of Sunada’s theorem for infinite groups. 

Recall that the group-theoretic part of the argument of § 4 was to construct 
groups G, H, and H, satisfying the condition (+), such that H, was not conjugate 
to H, in G. One way to réalize condition (}) is to take an automorphism of G 
which takes H, to H,, such that » takes each element / in H, to an element p(h) 
which is conjugate to h, but in such a way that the element of G which conjugates h 
to g(h) varies as h runs through H,. Such an automorphism we will call an 
almost-inner automorphism of G (an inner automorphism being one which is 
conjugated by a fixed element of G). 

Now, certain Lie groups admit families py, of such almost-inner automorphisms, 
and it was the observation of Gordon and Wilson that such families could be used 
to construct isospectral families. Perhaps the simplest such Lie group is 


1 x, x, z 0 0 O 
01 0 y, 0 0 O 
00 1 y 0 0 O 
0 0 0 1 0 0 OJ]! Xt. %2> Vir Ya. 2122 ER 
0 0 0 0 1 x, 2, 
00 0 0 0 1 yp 
0 0 0 0 0 0 1 
with almost-inner automorphism 
1 x, x, z 0 0 O 
0 1-0 y 0 0 0 
00 1 y 0 0 O 
G(x1.%2 Ie %22)=10 0 0 1 00 +0 
0 0 0 O0 1 x, 24+ tx, 
00 0 0 0 1 =¥»y 
00 0 0 0 0 1 


The construction of isospectral families from this example of an almost-inner 
automorphism of G is carried out in detail in [8], to which we refer the reader for 
details. 
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6. The Future 


It happens occasionally in mathematics that a problem that has vexed mathema- 
ticians for decades is so completely understood that it is difficult to look to the 
future. The present circumstance is one of those occasions. It appears that using 
some variation of these ideas allows one to find relatively easily examples that were 
difficult to imagine in the past (see, for instance, [3] or [19] for examples of this). 
Here, however, we will make the attempt to guess what may be profitable lines of 
research for the future. 

A natural question is whether the techniques given here exhaust the methods of 
finding isospectral manifolds. Historical experience suggests that the answer to this 
question should be “no,” but perhaps in sufficiently well-defined cases the answer 
may be “yes.” For instance: 


Question: Suppose S$, and S, are two isospectral Riemann surfaces. Are there 
Riemann surfaces S’ and Sj such that S$, and S, are sandwiched between them, as 
in the Sunada construction? We remark that the analogous problem in the number- 
theoretic case was settled in the affirmative by Robert Perlis [25]. 


Question: We constructed in § 4 two isospectral Riemann surfaces of genus 4. Are 
there examples of genus 2 or 3? 


This question really has two parts: the first part asks if the group-theoretic 
construction given here for genus 4 can be made to work in lower genus. One can 
write down a finite (but long) list of all the possibilities, and in principle one can 
check each possibility individually. There are a few cases that almost work, but 
many of the cases are easily shown not to work. The bulk of the cases are 
unchecked. The second part of the question is to ask whether in genus 2 or 3 there 
might be other constructions which will work. Perhaps one can show that there are 
no isospectral Riemann surfaces of genus 2. See [13] for some ideas which tend in 
this direction, and which may or may not be able to be pushed forward. 

A direction which promises to bear fruit in the future is the study of the small 
eigenvalues, such as A,. A beautiful characterization of A, in terms of geometry was 
found by Cheeger in [7]. One believes that the behavior of A, can play an interesting 
role in studying groups and manifolds which arise in number theory. But a complete 
elaboration of even the interesting questions of this line of thought is outside the 
scope of this paper. 

A major message of the work described in this paper is the impact of finite group 
theory on fundamental problems in geometry and analysis. Since finite group theory 
is so well-developed, but in language quite far from the practicing geometer, it 
makes sense to ask: 


Question: What do they know that we don’t? 
More precisely: 
Question: What do they know that we are afraid to ask? 


One cannot help but conclude with the observation that the techniques given here 
do not work for domains in the plane R?. So we can only repeat the question of 
Mark Kac: 


Question: Can you hear the shape of a drum? 
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The Editor’s Corner: n Coins in a Fountain 
ANDREW M. ODLYZKO* AND HERBERT S. WILF** 


In Richard Guy’s article in last month’s Monthly [6] there appeared a number of 
elegant questions, one of which we will answer here. 

An (n,k) fountain is an arrangement of n coins in rows such that there are 
exactly k coins in the bottom row, and such that each coin in a higher row touches 
exactly two coins in the next lower row. In FIG. 1 below we show a (28, 12) fountain. 
Let f(n, k) be the number of (n, k) fountains, and let f(n)=%,f> ©) be the 
number of fountains of n coins. 


Fic. 1. A (28, 12) fountain. 


The question of determining { f(”)} was raised in [6], and attributed to J. G. 
Propp. The values of f(n) for 1 <n < 18 are 1, 1, 2, 3, 5, 9, 15, 26, 45, 78, 135, 234, 
406, 704, 1222, 2120, 3679, 6385. 

We will show that { f(7)} has the remarkable generating function 


1 
F(x) =1+ )) f(n)x" = ————x——_. (1) 


n>1 1— 


This continued fraction was first studied by Ramanujan (see [1], [2], [4], [7]). That 
the power series coefficients of this fraction have a combinatorial meaning is not 
new. Already in 1968 Szekeres [8] found such an interpretation and others were 
found later by Flajolet [5]. However the interpretation as the number of n-fountains 
seems particularly attractive, and it lends itself to a very transparent proof. 

We Say that an (n, k) fountain is primitive if its next-to-bottom row contains no 
empty positions (i.e., contains k — 1 coins), and let g(n,k) be the number of 
primifive (n, k) fountains. By peeling off the bottom row of a primitive fountain we 
see that 


f(ln-—k,k-l)=g(n,k) (n>k;k>1). (2) 
Next, let A be an arbitrary (n, k) fountain, and suppose that the first empty 


position in the next-to-bottom row is the rth (1 <r < k — 1) (in FIG. 2 we have 


*AT & T Bell Laboratories, Murray Hill, NJ 07974. 
**Supported in part by the United States Office of Naval Research. 
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r = 3). Then by drawing a dotted line to the right of the rth coin in the bottom row, 
as in Fig. 2, we split F into an (n’,r) primitive fountain and an (n — n’,k — r) 
not-necessarily-primitive fountain. 


Fic. 2. Split off a primitive fountain. 


The factorization is unique, and so we have 
f(n,k)= do g(n',r)f(n—-n',k— 1) (3) 
n’,r>0 


for all n, k > 1, where f(0,0) = 1 and g(0,0) = 0. 

Hence if F(x, y) and G(x, y) are the ordinary generating functions of the 
Sequences f, g respectively, then from (3) we find that F = 1 + FG and from (2) 
that G(x, y) = xyF(x, xy), whence 

1 
1 — xyF(x, xy) 
1 
1 ~ 
1 — x°yF(x, x*y) 


F(x, y) = 


-— (4 


In particular, if we put y = 1 we obtain the generating function for the numbers 
of n-fountains in the form (1), as claimed. 


Remarks 

1. An easier problem would have resulted if we had asked for h(k) = &, f(n, k), 
the number of fountains whose bottom row contains k coins, without regard to the 
total number of coins. If we put x = 1 in-(4) we find that {h(k)} is generated by 
H(y) where H(y) — yH?(y) = 1. Hence the {h(k)} are the Catalan numbers. 

2. The continued fraction (1) of Ramanujan can be rewritten in various forms, for 
instance as 


xt) 
» (“Gate ox) 
F(x) = aM (5) 


4) T= 


r>0 
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We will now use this expression for F(x) to deduce the asymptotics of f(”). More 
precisely, we will show that 


f(n) = exo" + O((5/3)") (n> &) (6) 
where c = 0.312363324596741 ... and x9 = 0.576148769142756... . 

Let p(x) and q(x) denote, respectively, the numerator and the denominator on 
the right side of (5), and let p,(x), q,(x) denote the nth partial sums of the series 
for p(x) and q(x). It is clear that p and gq are analytic in |x|< 1, so F is 
meromorphic there. We will show that g(x) has a simple, real zero x) € (0.57, 
0.58), and no other zeros in the disk |x| < 0.62, while p(x,) > 0. It will then follow 
that (6) holds, and more precise numerical calculations yield the more accurate 
values of x, and c = —p(X))/(X0q'(X_)) stated above. 

The proof of our claim about x, can be obtained fairly easily using numerical 
methods to compute multiplicities of zeros of analytic functions. Here we sketch a 
proof that can be carried out by hand. 

Write g;(x) = a(x)/(1 — x)(1 — x*)(1 — x?)) where 

a(x) =1-—2x — x7 4+ x3 4+ 3x4 4+ x° — 2x8 — x? — x?, 

and consider b(x) = TT 1(x — x,), where the x, are 0.57577, — 0.46997 + i0.81792, 
0.74833 + i0.07523, —1.05926 + i0.36718, 0.49301 + i1.58185, in that order (these 
x ,’$ are approximations to the zeros of a(x)). An easy hand calculation shows that 
if {a,} and {b,} are the coefficients of a(x) and b(x) respectively, then LJa, — b,| 
< 1.7 xX 107%, and so ja(x) — b(x)| < 1.7 X 10% for all |x| <1. Another such 
calculation shows that |b(x)| > 8 X 10~¢ for all |x| = 0.62. 

On the other hand, for u © (0, 0.62) and |x| = u we have 


yrk th u? 


<< - XL SO 
— — 
1 — yk*} 1—wuw 


2. 32 
(kt D?-k 


1 _ x kt 


(k > 5) 


and, therefore, 


x - Te _ x/) 


m 


6 u? 
<—— <6 x 10-4. 
1—u* xl; | 


k?2 


Therefore, by Rouché’s theorem, g(x) and b(x) have the same number of zeros 
in |x| < 0.62, namely, 1. Further, since g has real coefficients, its zero is real, and 
we call it x9. A brief calculation shows that x, € (0.57, 0.58). 

Finally, we must show that p(x,)) > 0. However, since the successive summands 
in the definition of p(x) decrease in absolute magnitude for each fixed real x, 
0 < x <“0.6, we have p(x) > 1 — x*/(1 — x), which is positive for 0 < x < 0.6, 
and the claim is established. 

We note that the asymptotic expansion that we have proved is very accu- 
rate, even for moderately small values of n. For instance, /(120) = 
1.700213368... X10°8, while f(120) — cx9 7° = 1.59... X10. 

3. To make the correspondence between our problem and the one that Szekeres 
solved quite explicit, we draw the slant lines in Fig. 1 that are shown in FIG. 3, and 
let x, denote the number of coins on the ith slant line. If we let g;=x,;+i-1 
(i = 1, k) then we see that f(, k) counts partitions with a fixed ‘rate of climb.’ 
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Precisely, f(n, k) is the number of partitions of the integer n + ; , whose largest 


part is kK and whose ith-from-smallest part is > i for each i = 1, k, and that is 
exactly the problem that Szekeres treated in [8]. 


Fic. 3. Draw the slant lines. 


4. A problem that is similar to this one has been considered by Auluck [3]. He 


dealt with arrangements of coins that satisfy our conditions and are further subject 
to the condition that the set of coins in each horizontal row forms a single 
contiguous block. 


REFERENCES 

1. George E. Andrews, An introduction to Ramanuyjan’s ‘lost notebook,’ this MONTHLY, 86 (1979) 
89-108. 

2. , The theory of partitions, Encycl. Math. and its Appl., vol. 2, Addison-Wesley, 1976. 

3. F.C. Auluck, On some new types of partitions associated with generalized Ferrers graphs, Proc. 
Cambr. Phil. Soc., 47 (1951) 679-686. 

4. §. Bhargava and Chandrasekhar Adiga, On some continued fraction identities of Srinavasa 
Ramanujan, Proc. Amer. Math. Soc., 92 (1984) 13-18. 

5. P. Flajolet, Combinatorial aspects of continued fractions, Discrete Mathematics, 32 (1980) 125-161. 

6. Richard Guy, The strong law of small numbers, this MONTHLY (October, 1988). 

7. M. D. Hirschhorn, A continued fraction of Ramanujan, J. Austral. Math. Soc. (Ser. A) 29 (1980) 
80-86. 

8. G. Szekeres, A combinatorial interpretation of Ramanujan’s continued fraction, Canadian Math. 


Bull., 11 (1968) 405-408. 


LETTERS TO THE EDITOR 
Editor: 


I was appalled to read C. Smorynski’s vicious attack on James Henle’s elegant 
and beautiful book, An Outline of Set Theory, and was amazed that the editors of 
this journal let such a vituperative review be printed—the MONTHLY has an 
enormous circulation, and its reviews are highly influential. To allow this fine book 
to be excoriated in front of such a large audience is extremely unfortunate. 

The gist of Smorynski’s diatribe is that Henle did not write the sort of book 
which Smorynski would like to see written. It is as if an orange is being accused of 
not being an apple. 

1. Henle is accused of not writing a text organized on historical lines—he didn’t 
set out to write one. 

2. Henle is accused of writing a book in which mathematics is motivated by 
having the students actually. do it, rather than passively consuming it. That’s just 
what he wanted to do. 

3. Preposterously, Henle is accused of being a mere problem solver. I find this 
charge incredible, since the problems are designed exactly to lead students to the 
basics of modern set theory, and to its connection with the rest of mathematics. This 
is not problem solving for its own sake! 

4. Henle is accused of not including enough material. But he didn’t set out to 
write a comprehensive text. He wrote an introduction to undergraduate set theory. 
In fact, Henle covers far more material than any other book I know of on this level, 
even covering topics that are often missing in beginning graduate texts. 

5. Henle’s history is wrong. Smorynski may be right on this, but he should cast 
no stones, since his own history is wrong: it was not Zermelo who formalized the 
hierarchy of sets; the process of doing so was a long one, beginning with Mirimanov 
in 1917 and ending with von Neumann in 1925. With all due respect to the 
importance of the history of mathematics, it is astounding that the bulk of 
Smorynski’s comments concerns a few paragraphs in a 135 page book whose major 
focus is elsewhere. 

If Smorynski would like to write a comprehensive set theory text organized on 
historical lines, he is welcome to write one. He should not condemn others for 
writing different books. 

I encourage the reader interested in an undergraduate set theory text to consider 
Henle’s book (which, by the way, is designed so that it can be used in a conventional 
class as well as in a Moore-method class). In fact, Henle’s book is so well organized 
and written that it would make a pleasurable introduction for anyone wanting to 
learn basic set theory. For more detailed comments, see my own review in the 
Journal of Symbolic Logic, vol. 52 No. 4, December 1987, p. 1048. 

I also urge the editorial board of the MONTHLY to carefully consider their 
editorial policy on book reviews. Polemical diatribes are appropriate in letters to the 
editor. They are not appropriate in reviews. 


Sincerely, 
JUDITH ROITMAN 


Professor of Mathematics 
University of Kansas 
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UNSOLVED PROBLEMS 


EDITED BY RICHARD GUY 


In this department the MONTHLY presents easily stated unsolved problems dealing with notions 
ordinarily encountered in undergraduate mathematics. Each problem should be accompanied by 
relevant references (if any are known to the author) and by a brief description of known partial results. 
Manuscripts should be sent to Richard Guy, Department of Mathematics and Statistics, The 
University of Calgary, Calgary, Alberta, Canada T2N 1 N4. 


Conjectured Isoptic Characterization of a Circle 


M. S. KLAMKIN 
Department of Mathematics, University of Alberta, Edmonton, Alberta, Canada T6G 2G1 


The isoptic curve of a given smooth convex region C is the locus of the 
intersections of pairs of tangents to C which meet at a constant angle, say 7 — B. It 
is known that if the isoptic curve is a circle, the region C need not be a circle. For 
example, it can be an ellipse, a hyperbola or a deltoid [1]. However, Green [2] has 
shown that if 8 is an irrational multiple of 7 or if B = (m/n)a where m/n in its 
lowest terms has an even numerator, then C must be a circle concentric with the 
isoptic locus. Nevertheless, it is conjectured, that if two isoptic curves of the region 
C are concentric circles, then C is also a circle. Prove or disprove. In view of the 
result of Green, it is assumed that the two angles 8, and B, are each of the form 
ma/n with m odd. 
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A Conjecture Concerning Magic Squares 


J. DENES 
Csaba Utca 10 V 42, 1122 Budapest, Hungary 


A. D. KEEDWELL 
Department of Mathematics, University of Surrey, Guildford, Surrey, England GU2 5XH 


A magic square of order n is an n Xn matrix whose entries are n? usually 
distinct, and usually consecutive, integers so arranged that the sums of the entries in 
each row, each column and each main diagonal of the matrix are equal. A classical 
method of constructing such squares is by means of pairs of orthogonal latin 
squares. See, for example, [6] or, for a more recent account, Chapter 6 of [4]. By a 
latin square of order n is meant an n Xn matrix containing n different symbols 
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each repeated n times in such a way that each symbol occurs just once in each row 
and column of the square. The definition of orthogonal is given in the next 
paragraph. If L, and L, are two orthogonal latin squares of order n with suitable 
diagonal properties and having appropriately chosen integers as entries, then the 
matrix L, + L, is a magic square. The question which we wish to raise is whether it 
is true conversely that every magic square can be represented as a sum of two latin 
squares. We provide some evidence for the truth of our conjecture and pose two 
alternative weaker conjectures. 

Let M be an n Xn magic square whose entries are the consecutive integers 
0,1,..., 7 — 1 and let us write these integers as numbers to base n. Then the first 
digits of each such number define a square L,, the second digits form a second 
square L, and we have M =nL, + L,. Both nL, and L, are equi-n-squares: that 
is, each involves just n symbols each repeated n times but they need not be latin. 
However, if M is to have entries which are consecutive integers, the squares L, and 
L, must be orthogonal: that is, if they are superimposed, each of the n different 
integers which occur as the entries of one square must occur in the same cell with 
each of the n different integers which are the entries of the other square exactly 
once. (Of course, the squares L, and L, presently being discussed both have the 
same set 0,1,...,— 1 of integers as entries but later in this paper this will not 
always be the case.) 

Also, if S, denotes the sum of the elements of one of the rows, columns or main 
diagonals of L, and S, denotes the sum of the elements of the same row, column or 
main diagonal of L,, then the diophantine equation nS, + S, = 4n(n* — 1) must 
be satisfied for that row, column or main diagonal. To see this, we note that the sum 
of the integers 0,1,..., 2”? — 1 is 4n*(n* — 1). So if each row of the magic square is 
to have the same sum, that sum must be 4n(n’ — 1). The common sum of each row, 
column and diagonal of a magic square is usually called its magic sum. E. Stern [7] 
has pointed out that these conditions are sufficient for L, and L, to define a magic 
square. He has also pointed out that, if L, and L, are themselves magic (in which 
case, S, = S, = 5n(n — 1) for each row, column and main diagonal), then the last 
condition is automatically fulfilled. 

In example 1 below, L, and L, are both latin and magic and then M = nL, + L, 
is certainly a sum of two latin squares. 


Example 1. (n = 5) 


In [3], J. Chernick has shown that every magic square of order 3, whether 
involving consecutive integers or not, can be expressed in terms of three variables 
k, a,b as in Figure 1 and that every magic square of order 4, again involving 
not-necessarily-consecutive integers, can be expressed in the form shown in Figure 2 
which involves eight variables. The latter form had been obtained earlier by E. 
Bergholt [1] but without a satisfactory proof that it was universal. 
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A-a Ct+tat+cec Brtb-c D-b 
D+a-d B C A-atd 


k-a-pb k+b 
k k+a-—b 
k+atb k-a 


C-—bt+d 
B+b 


FIGURE 1. FIGURE 2. 


From Figure 1, it is easy to see that every magic square of order 3 is the sum of 
two orthogonal latin squares in many ways. Two such decompositions are shown in 
Figure 3. When the square has the consecutive integers 0 to 8 as its entries, we 
necessarily have k = 4 (since the magic sum 3k is 12) and so a + b = 4, implying 
a = 3 and b = 1 (since a # 5b). Then the decomposition of Figure 3(b) gives the 
form M = nL, + L,. 


Lk-b lk+b 
tk = tk-b 
tk+b tk 


FIGURE 3(b). 


Bergholt has obtained a somewhat more restricted result for magic squares of 
order 4: namely, that if such a square has the extra property that the sums of the 
elements of each quarter are equal, then the magic square is the sum of two 
orthogonal latin squares. In Figure 2, the requirement that the elements of each 
quarter have the same sum requires that d = a + c. If we then eliminate the variable 
a and writeA +c=a, B+b=£,C+d=y and D = 4, we get the form shown 
in Figure 4 which is clearly the sum of two orthogonal latin squares. Because any 
latin square of order 4 which has an orthogonal mate must be isotopic to Klein’s 
four-group Z, X Z, (see [4], pages 34 and 129) and can only be magic if it has one 
of the three forms shown in Figure 5, it is clear that, conversely, a magic square of 
order 4 having consecutive integers as entries and constructed as the sum of two 
orthogonal latin squares of that order has equal quarters. Thus, for example, the 
magic square shown in Figure 6 cannot be expressed as the sum of two orthogonal 
latin squares but, despite this, it can be expressed as the sum of two orthogonal 
squares one of which is row-latin and the other column-latin, as shown. An 
equi-n-square is row-latin if each symbol occurs just once in each of its rows but not 
necessarily in each column. A column-latin square is similarly defined. The idea of 
constructing a magic square as the sum of a row-latin square and a column-latin 
square goes back at least as far as Ozanam: see pages 99 and 100 of [6] for magic 
squares of orders 6 and 8 constructed in this way. 
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FIGURE 4. 


FIGURE 5. 


FIGURE 6. 


The above results lead us to pose the following two questions (though we are not 
sure which is likely to prove the more difficult to resolve): 


Problem 1. Can every magic square be expressed as the sum of two orthogonal 
squares one of which is row-latin and the other column-latin? 


Problem 2. Can every magic square be expressed as the sum of two not-neces- 
sarily-orthogonal latin squares? 


We draw the attention of the reader to some further results and questions which 
are relevant. 

First, in [3], Chernick has given a form of expression valid for every magic square 
of order n, n > 5, showing that it depends on n? — 2n arbitrary constants. (The 
entries of the square are not restricted to being consecutive integers.) 

Second, we draw attention to a theorem of G. Birkhoff [2] concerning the 
representation of a doubly stochastic matrix as a sum of permutation matrices. A 
square matrix with nonnegative entries is called doubly stochastic if the sum of the 
elements of each of its rows and columns is equal to unity. Thus, if all the elements 
of a magic square are divided by the magic sum, the resulting matrix is doubly 
stochastic. Likewise, if all the elements of a latin square are divided by the sum of 
the elements of any one row (or column), the resulting matrix is doubly stochastic. 
Birkhoff’s theorem states that any n Xn doubly stochastic matrix D can be 
expressed in the form D = )}_,6,P,, for some positive integer s, where P,, P,,..., P, 
are n X n permutation matrices and 6,, 6,,...,6, are nonnegative numbers whose 
sum is unity. An example of such a decomposition is given in Figure 7. A proof of 
the theorem can be found, for example, on page 36 of [5]. Let us note further that 
any doubly stochastic latin square L of order n can be expressed in the form 
L = U_,0,P., where P,, P,,..., P, are disjoint permutation matrices and 17_,6, = 1. 
Here, each different element of L defines one of the permutation matrices. An 
example is shown in Figure 8. 
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Eee deep ee ales 
Lo o4 2 01 001 1 0 01 0 
50 0 0 1 0 0 0 0 1 0 0 
FIGURE 7. 
0 2 t\_g 0 0\42/0 1 0) 4a 0 1 
2 1 9 0 1 1 0 0 0 1 0 
1 9 2 1 0 001 1 0 0 
FIGURE 8. 


These observations suggest the following question: 


Problem 3. Is it possible to write every doubly stochastic magic square M as a 
weighted sum of doubly stochastic latin squares L,, L,,..., L,, say 


M= )@L,, where )) 0, = 17 
‘ i=l i=l 


If the answer to Problem 3 is “YES”, then we have solved a weaker version of 
Problem 2 above: namely, we have shown that every magic square can be expressed 
as a sum of latin squares. 
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NOTES 


EDITED BY DENNIS DETURCK, DAVID J. HALLENBECK, AND ANITA E. SOLOW 
A Simple Cover-up Game 
V. J. BASTON AND F. A. BOSTOCK 


Two decks, each with n cards numbered 1 to n, are shuffled separately, and the 
top card from each deck is laid face down on the table. One of two players now 
looks at the numbers on the cards and chooses to turn one of them face up. The 
other player selects one of the cards and receives from his opponent an amount 
equal to the number of his selected card. What is a fair entrance fee for the second 
player to play this game? 

The problem can be modelled mathematically as a two-person, zero-sum game Q),, 
as follows. Each element of an ordered pair (x,, x,) is chosen independently from 
{1,2,...,} with probability 1/n. Only player 1 sees (x,, x,) and, after seeing it, 
he covers either x, or x, before showing it to player 2. Player 2 now either accepts 
the uncovered number or rejects it and chooses the covered one. The number so 
chosen is the payoff to player 2. 

Note that the value of 2, to player 2 is then the entrance fee required. 

Before presenting a solution of &, we will find some preliminary discussion 
useful. We will denote by EX, Y) the expectation to player 2 when player 1 plays 
the strategy X and player 2 the strategy Y. Notice that player 2 is trying to 
maximize E( X, Y) whilst player 1 is trying to minimize E(X, Y). If X is a fixed 
strategy for player 1, then a best reply to X is any strategy Y* such that E(X, Y*) 1s 
the maximum of ECX, Y ) over all strategies Y for player 2. Similarly, a best reply to 
a strategy Y of player 2 is any strategy X* such that E( X*, Y) is the minimum of 
E(X, Y) over all strategies X for player 1. Let X* and Y* be strategies for the 
players such that X* is a best reply to Y* and Y* is a best reply to X*. Then it is 
immediate from the definition of optimal strategy that X* and Y* are optimal. 
Technically, a solution of the game is obtained when an optimal strategy for each 
player has been found. Although a game can have many solutions, E( X*, Y*) is the 
same whatever optimal strategies X* and Y* are used for the players, and it is 
referred to as the value of the game to player 2. The reader who is not familiar with 
the rudiments of elementary game theory is recommended to consult [3]. 

In playing Q, player 2 clearly has simpler decisions to make than player 1, so it is 
natural to first consider what strategies this player can adopt in order to do well. It 
is routine to calculate that the maximum and minimum of the two numbers seen by 
player 1 have expected values (n + 1)(4n — 1)/6n and (n + 1)(2n + 1)/6n, respec- 
tively. Since player 2 knows he is seeing either the maximum or the minimum, a 
reasonable strategy for him to adopt is to accept the uncovered number if and only 
if itis at least (m + 1)/2. A consideration of cases in which n is small suggests that 
this strategy is indeed optimal and leads to the following theorem. 


THEOREM. The following strategies are optimal in ,. For player 1, cover a 
maximum number if and only if the sum of the numbers is at least 2[(n + 2)/2] where 
[x] denotes the integral part of x. For player 2 accept the uncovered number if and 
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only if it is at least [(n + 2)/2]. The value of Q, is (In* + 6n + 2)/12n if n is even 
and (In? + 6n — 1)/12n if n is odd. 


Proof. Let d= [(n + 2)/2] and denote the strategies in the statement of the 
theorem by X* and Y%, respectively. Let player 2 adopt Y*. Then, for a best reply, 
player 1 should cover a maximum if both are at least d and cover a minimum if 
both are strictly less than d; in the other cases, he can cover either number. Hence it 
is easy to see that X* is a best reply to Y*. 

Now let player 1 adopt X*. Suppose player 2 sees m, where m > d. This could 
have arisen via the pairs (m, m), (m, t), and (t, m), where m<t<norl<t< 
2d — m. Hence if player 2 rejects m, he obtains m for the pair (m, m) and ¢ for 
each of the pairs (m,t) and (t, m), where m<t<n or 1 <t< 2d—m; since 
there is a total of 4d — 4m + 2n — 1 such pairs (occurring with equal probability), 
player 2’s conditional expectation would then be 


n 2d—m-1 
m+2 yy t+2 > | [(¢¢ 4m an 


t=m+1 t=1 

= (n> +n + 4d? — 4dm — 2d + m)/(4d — 4m + 2n - 1). 

Thus he will not do worse by accepting m provided 
m(4d—-—4m+2n-—1)>n?+n+ 4d? —- 4dm—2d+™m, 

1.€., 

2(m —d)(n-2m+2d—-—1)+ Qd-—n-1)n>0. 
By considering separately the cases n odd and n even it is easy to check that all the 
brackets are nonnegative for [(n + 2)/2] = d < m < n, So the inequality holds. 

Now suppose player 2 sees m where m < d. This could have arisen via the pairs 

(m, m), (m, t) and (t, m), where 0 < t <m or2d—m<t<n. Hence if player 2 
rejects m, he obtains m for the pair (m, m) and ¢ for each of the pairs (m, ¢) and 
(t,m), where 0 <t<m or 2d—m«<t<«<vn; since there is a total of 4m — 4d + 


2n + 1 such pairs (occurring with equal probability), player 2’s conditional expecta- 
tion would then be 


m+20 +2 x | [lam saan 
t=1 t=2d—m 
= (n?+n— 4d? + 4dm + 2d — m)|(4m — 4d + 2n + 1). 
Thus he will not do worse by rejecting m provided 
n>+n-—4d*+ 4dm+2d—m>m(4m — 4d + 2n +1), 
1.€., 


2(d-—m-—1)(n+1-—2d+ 2m) + (n+2)(n+1-—2d)+4ms0. 


Since all the brackets are nonnegative for 0 < m < d= [(n + 2)/2] the inequality 
holds. 
It follows that Y* is a best reply to X*, whence X* and Y* are optimal. 
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We obtain the value of Q, by calculating E( X*, Y*). Let player 1 adopt X* then, 
by our previous analysis: 

(i) player 2 will see the number m > d with probability (4d — 4m + 2n — 1)/n? 
and, in this case, since he accepts m, there will be a contribution m(4d — 4m + 2n 
— 1)/n? to his expectation; 

(ii) there will be contribution (n* + n — 4d? + 4dm + 2d — m)/n’ to his ex- 
pectation when player 2 rejects m < d. Hence 
n°*E(X*, Y*) = )) m(4d—- 4m + 2n - 1) 

m=d 
d-1 
+ Y (n*+n—- 4d? + 4dm + 2d—-m) 


= =n(n + 1)(6d — (2n + 13)/3) — ~d(d — 1)(2n + (16d — 8)/3) 


after some routine manipulation. 


The result now follows easily. 

It is natural to consider the corresponding game where there is only one deck of 
cards and the top two members are laid face down on the table. Trivial modifica- 
tions in the proof of our theorem show that the players have the same optimal 
strategies as in the original game and that the value is 7(n + 1)/12 if n is odd and 
(Tn? — 4)/12(n — 1) if n is even. 

From a mathematical point of view it 1s also natural to consider the infinite game 
(2 corresponding to 2, in which the two numbers are chosen independently from the 
closed interval [0,1] by means of the uniform distribution. By making n large our 
theorem, suitably normalized, suggests that the following strategies U* and V* are 
optimal in 2. For player 1, cover a maximum number if and only if x, + x, > 1, 
and for player 2, accept the uncovered number if and only if it is at least 1/2. 
Although we shall shortly prove that these strategies are indeed optimal the reader 
should not jump to the conclusion that they are obviously so. To see this we shall 
consider the following trivial game A,. Each of two players simultaneously chooses 
a number from {1,2,...,”}. If they choose the same number no money changes 
hands. Otherwise, the player choosing the strictly larger number gets a dollar from 
his opponent unless he has chosen n, in which case he gives a dollar to his 
opponent. Clearly it is optimal for each player to choose the number n — 1. Now let 
A be the infinite game in which players 1 and 2 simultaneously choose numbers x 
and y, respectively, from the closed interval [0, 1]. The payoff M(x, y) to player 1 is 
given by 

O forx=y- 
M(x,y)=(-1 forx<y<lory<x=1 
1 fory<x<lorx<y=1. 


By making vn large in A, and normalizing suitably, it might be thought that 
optimal strategies in A would be for each player to choose the number 1. However it 
is well known that A has no solution (see [2, p. 115}). 

As promised, we now prove that the above strategies U* and V* are optimal in Q 
by showing that each is a best reply to the other. If player 2 adopts V* then, for a 
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best reply, player 1 covers a maximum if both numbers are at least 1/2, a minimum 
if both are less than 1/2, and either number otherwise. Hence it is easy to see that 
U™* is a best reply to V*. 

Now let player 1 adopt U*. Suppose player 2 sees 8, where 1 > B > 1/2. This 
could have arisen via the pairs (x, 8B) and(8, x), whereB <x <lor0<x<1-8. 
Thus if player 2 rejects B his conditional expectation is 


[fixe n [*xae) [0 —~B+1-8)=1/2. 


Hence he will not do worse by accepting 8B when B > 1/2. 

Now suppose player 2 sees 8 where 0 < B < 1/2. This could have arisen via the 
pairs (x, 8) and (B, x) whereO0 < x < B orl — B < x < 1. Thus if player 2 rejects 
B his conditional expectation is 


B 1 (4 _ 
[fisac fxs) (a (1 — B)) =1/2. 


Hence he will not do worse by rejecting B when B < 1/2. 

Thus V* is a best reply to U%*. 

We will find the value of Q by calculating the expectation to player 2 when the 
players adopt the strategies U* and V*. Suppose player 1 sees x and y where 
y > x > 1/2. Then he covers y and player 2 accepts x. Thus the contribution to the 
expectation for these cases is 


af f xdxdy = 1/6. 
1/271/2 
Suppose player 1 sees x and y where y > 1/2 > x. Then, whichever he covers, 
player 2 gets y, and the contribution to expectation is 


2p J vaedy = 3/8. 


Suppose player 1 sees x and y, where 1/2 > y > x. Then he covers x and player 2 
rejects y. Thus the contribution is 


2/0" [xdedy = 1/24. 


Hence the value of Q is 7/12. 

The reader may think that the corresponding games in which three numbers are 
chosen independently and player 1 covers two of them have equally simple solu- 
tions. Our investigations suggest that this is not the case and that a general solution 
would be difficult to find. Somewhat surprisingly, we have not been able to find any 
reference to such games in the literature and the nearest we have found is that to 
deception games, the study of which was initiated by Mark Thompson in an 
undergraduate thesis at Harvard University (see [4]). The only difference is that in 
deception games, instead of covering some of the numbers, player 1 tries to confuse 
his opponent by apparently changing their values. Little has been done on these 
problems, and they appear to be hard, but it has been proved recently that very 
general deception games always have a solution (see [1]). 
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A Note on Probabilities in Proofreading 


FASHENG LIU 
Computer Science Department, Shandong Teacher’s University, P.R. China 


Polya [1] estimates the number M of misprints in a proofsheet using the moments 
methods for two independent proofreaders. It is sometimes desirable to estimate M 
for one proofreader proofreading twice, and it is also meaningful to estimate the 
operational level of proofreading (i.e., the probability that the proofreader notices 
any given misprint). 

A proofreader reads the proofsheet two times by picking out misprints without 
replacement. Let M denote the number of misprints examined, p the probability 
that the proofreader notices any given misprint. Let X, and X, denote the numbers 
of misprints picked out in the first and second proofreadings. Using the moments 
methods as in [1], we have 

Mp ~ X,, (M - X)p ~ X, 
and so 

X? X, — X. 
M = —— p=-—_—2. 
X, — X, xy 
They are the desired estimates. Since the joint distribution of X, and X, is 
M! 

~ (M— X, — X,)!X!X! 
the above estimators are also MLE (maximum likelihood estimator). The proof- 
reader reads two proofsheets of the same book independently. As he finished, X, 
misprints were noticed in the first proofsheet, X, were noticed in 2nd proofsheet, 
Xj, were noticed in both. So the desired estimates of M and p are: 


_ X,X) P Xo(X + X,) 


X19 7 2X; X5 
Two proofreaders read, independently of each other, the proofsheets of the same 
book. Let p, and p, be the operational levels-of proofreading of the two proofread- 
ers, A’ and B’. A misprints were noticed by A’, B misprints were noticed by B’, C 
misprints by both. Then the desired estimates of p, and pz, are: 
C 


P,=—-, P,=—. 
AB BA 


2M—2X,—X> 
P(X, = X,, X, = Xp) pX+%(1 — p)™ * “2 


These are useful in quality control. 


Reference 
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A Proof of the Lucas-Lehmer Test 


MICHAEL I. ROSEN* 
Department of Mathematics, Brown University, Providence, Rhode Island 02912 


E. Lucas proposed and D. H. Lehmer proved a very elegant test for the primality 
of the Mersenne number M, = 2” — 1. Unfortunately, it is hard to find a proof in a 
modern text. Most books which state the result do not prove it, but do prove a 
related and weaker result, e.g. [1], [2], and [4]. In [5] a complete proof is given which 
follows the original proof of D. H. Lehmer (see [3]), but it is lengthy and detailed. In 
this note I give a fairly short proof using the simplest properties of algebraic 
numbers. Another proof using algebraic numbers was given long ago in [6]. 

Define a sequence S, recursively by setting S, = 4 and S, = S’?_, — 2. So, 
S, = 4, S, = 14, S; = 194, etc. 


THEOREM (LUCAS-LEHMER). Let p be a prime number. Then, M, = 2? — 1 is a 
prime if and only if M, divides S,_). 


Before giving the proof we need a few definitions. Let 
1+7¥3 © 1-y3 
T= v2 ’ T= v2 ’ 


Note that tT = —1 and ww = 1. 


w=77=24+ 73, ando=77=2- 73. 


— Qn - I 


LEMMA. S,, = @” + @ 

Proof. Let T, = 0 + 6°" .Thenn T,=0+6=4=S,and T,,, = 72-2. 
Thus, T,, = S,, for all m. 

Lema 2. If we assume M, is prime, then 7“%** = —1 (mod M,). 


Proof. The congruence is understood to take place in the mng of algebraic 
integers. We temporarily set q = M,,. 

Write V27 = 1 + ¥3, raise both sides to the gth power and take congruences 
modulo g. We find 


7929-D? V2 = 14+ 34-73 (mod q). 
Since g = —1 (mod8) we have 2°%~)/* = (2/q) =1 (modgq). Since g = 
1 (mod 3) we have 3°4—)/* = (3/q) = —1 (mod q). It follows that 
ri=F (mod q) and r7tl=c7r= -1 (mod q). 


We can prove the theorem. We first prove that S,_, = 0 (mod M,) implies M, is 
prime. This part of the proof makes no use of Lemma 2. 

From the condition S,_, = 0 (mod M,) we find w”* + @" * = 0 (mod M,) so 
that w?” = —1 (mod M,) and w*” = 1 (mod M,). 


Let / be a prime dividing M,, and O= Z [V3]. The coset of w in (@/10)* has 
order 2? by the above congruences. 


*This work was partically supported by NSF Grant DMS-8606407. 
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Suppose / splits in @. Then it follows that 
(O/10)* = (Z/IZ)* x (Z/IZ)*, 


and so 2” divides /— 1. Thus, /= 1+ 2?k for some k > 1. This is impossible 
because it implies />1+2?> M.,. 

Suppose / stays prime in @. Then (0//0)* has order /* — 1 and 2? divides 
7—1=(/-—1)(/+ 1). If /=1 (mod4) we must have 2?7! divides /— 1, or 
1=1+ 2?~'k for some k > 1. This cannot happen since 2/ > 2 + 2? > M,. If 
1 = 3 (mod 4), then 2?~! divides /+ 1 so that /= —1+2”7!k for some k > 1. 
One cannot have k = 1 since 2?~! — 1 does not divide 2? — 1. The only possible 
case is k = 2. Then, / = 2? — 1 = M,, and so M, is prime. 

Finally, suppose M, is prime. From Lemma 2 we have 


qMptl= —] (mod M,) ort’ +1=0 (mod M,). 


Since 7? = w, w2” + 1 = 0 (mod M,). Multiply both sides of this congruence with 


—2p-2 


w and recall ww = 1. The result is 
_ QP-2 —2P-2 — 
S,-1= 07 + @” = 0(mod M,). 
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The Behaviour of Means Under Equal Increments of 
Their Variables 


J. ACZEL 
Department of Pure Mathematics, University of Waterloo, Waterloo, Ont. Canada N2L 3G1 


ZS. PALES 
Institute of Mathematics, L. Kossuth University, H-4010 Debrecen, Hungary 


In [6], after noting that the arithmetic (M = A), geometric (M = G), harmonic 
(M = #) means, and the root-mean square (or quadratic mean M = Q, denoted in 
[6] by R) all satisfy 


M(a,x,...,a,x) = M(a,,...,4,)x (1) 


(here and in what follows, except in Proposition 4, a,,..., a,, x are supposed to be 
positive), and 


A(a,+x,...,a4, +x) =A(a,,...,a,) + x, (2) 
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the authors prove separately the following inequalities: 


(3) 


G(a,+x,...,a,+x) > G(a,,...,a,) + x and 
G(a,+x,...,a, + x) — x increases with x, 
H(a,+x,...,a, +x) > H(a,,...,a,) + x and (4) 
H(a,+x,...,a, + x) — x increases with x, 


(5) 


QO(a,+x,...,a,+x) < QO(a,...,4,) te | 


O(a, + x,...,a, + x) — x decreases with x. 
The authors’ next aim is to dispel the worry that Q(a,+x,...,a, +x) -— 
A(a, + x,..., a, + x), which in view of (5) and (2) decreases, could go below 0 in 


contradiction to Q > A and similar worries in opposite direction concerning (4) and 
(3). For this purpose the authors prove 


lim |M(a,+x,...,a, +x) —A(a,+x,...,a, + x)] =0 


or, what is equivalent by (2), 


lim [M(a, + x,...,a, +x) —x] =A(a,,...,a,), 


again individually for M=G, M=H, and M=Q (for M =A this is obvious 
from (2)). 
In what follows we give short elementary proofs for all root-mean powers 


n 1/p 
M= Mylar dy) = ( Sar (p #0) 
7 (6) 


and 


M =M,(q,...,4,) = G(a@,...,4,) = [J ai” (p = 0) 
i=1 


(clearly M, = A, M_, = H, M, = Q) of the following statements (and, later, some 
generalizations ). 


PROPOSITION 1. Equation (1) holds for all M, (p © R), as defined in (6). 


PROPOSITION 2. As a function of x, 
M,(a, + x,-..,4, +x) — x 


decreases or increases (or is constant) and is smaller or greater than (or equal to) 
M,(4@,,...,4,) according as p 21 (or p= 1). This increase or decrease and the 
inequalities are strict unless a, = +--+ =4,,. 


PROPOSITION 3. For all p€&R 


a t+:+: +a, 
lim | M, (a, + x,...,a,+ x) — x| = A(a,,...,a,) = ———_-.. (7) 


xXx— 00 nN 
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Proofs. 

Prop. 1: Trivial. O 

Actually (see, e.g., [5]) the equation (1) characterizes the root-mean powers (6) 
among the quasi-arithmetic means (cf. {1]) 


M(x,,...,x,) =¢ +{ SoG, | 


(where ¢ is continuous and strictly monotonic). 


Prop. 2 (All sums and later products are with respect to i and go from 1 to n): 


d (Z(a, + x)’)” 
—|M,(a, + X,...,a, +x) - x| = — | ———_—__ - | 


yn /P 


1/p)-1 
yr" Vi(a,+ x)? > -120ifp S11. 


1 
= (La, + x)" 


Indeed, for p > 1, we have with the notation y,=(a,+ x)? (i=1,...,n), by 
Ho6lder’s inequality (see, e.g., [5]) 


y p/Pyl-a/p) < (¥1)'77( yy) —(1/p) 


with equality iff y,/1 = --- =y,/l,1e, a, = --- =a,. For p < 1 the inequality 
reverses. Accordingly, 
M,(a, + x,...,4,+x)—-xSM,(q,...,4,)ifp 21. O 


An even simpler proof is the following. By Minkowski’s inequality (see, e.g., [5]) 
we have for p > 1, whenever x < y, 


M,(a,+x,...,a,+x)—x=M,(a,+x,...,a, +x) 


+M,(y—x,...,y—-x)-y>M,(a,+y,...,a,+y)—y 
with equality iff a, = --- =a,. If p <1, the inequality signs are reversed. 
Minkowski’s_ inequality _Bives also directly M,(a, + x,...,4, +x) S$ 


M,(a,...,4,) + x for p 2 1, that is (in generalization of (3), (4) and (5), 
M,(a, + x,...,4,+x)—xS M,(a,,...,4,) 


o (9) 
for p 2 1, except ifa, = --: =a,,. 


Prop. 3: Using the notation t = 1/x and the Bernoulli-l’Hospital theorem, 


‘ 4 yyr\l? \/p ayy 
lin (2 x) | ~ «|= (X(a,t + 1)”) n 


x— 0O n t—0 ni/Py 


_ Ip (X(a,t + 1)? pS at + 1)?~“a, 
= lim eS 
10 n/P 


1/p)-1 
n/P) La, La, 
ni/P n 
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Remarks. (1) The second proof of Proposition 2 remains unchanged with p = 0. 
Since (see, e.g., [5]) lim, 9M, = Mo, Proposition 3 follows also for p = 0 and the 
first proof of Proposition 2 ‘almost’ covers the case p = 0: except the uniqueness of 
the equality case. However, a direct proof for M, = G is very simple too: 


d d Lyn 
Gy lala +Xx,...,a,+x)—-x|= Gy (TT (ai + *) / — x) 


1 1 

— a +x)/"¥ ——-1>0, 
[1 (a, ) 

n a.+x 


z 


because the geometric mean [](a, + x)'/” is greater than the harmonic mean 
n/X(a, + x)~* if not all a, are equal. 

(11) All three propositions remain true (with unchanged or slightly modified 
proofs) for weighted root-mean powers ((Xq,a?)'/? for p #0, Ila” for p =0 
(q,> 0; i=1,..., 2; Xq; = 1), with A(a,,..., a,) = Lq;a; in (7). 

(iii) In generalization of Proposition 3 and with a very similar proof we have the 
following. 


PROPOSITION 4. Let I be one of the intervals R ,=]0, co[, R =] — 00, oof, and let 
M be homogeneous, that is, 


M(v,x,...,0,x) = M(v,,..., 0,)x (10) 
for allv, € I (i =1,...,n) and for all positive x, 
let M satisfy 
M(1,...,1)=1 (11) 
| 0M 
and be differentiable at (1,...,1). Write q, = 9p tb ..., li =1,..., 2). Then 
U; 
lim [M(a,+x,...,a4,+x)-—x] = ¥q,a; and Yq, = 1. (12) 


Proof. Introducing again ¢ = 1/x we get from (10) (by using the definition of the 
derivative instead of the Bernoulli-l’Hospital theorem) 


. . ay a, 
lim [M(a, + x,...,a, +x) — x| = lim x m( + 1,...,—+ 1 — 1 
x 


x— 0O x— 0o Xx 
 M(at+,...,a,t+1)-1 
= km —-_ 
t—0 t 


0M 
Lg Gh Da, = \)9,4;.- 
From (10) and (11) we get M(x,..., x) = x for all x > 0 and thus 
OM 
Le Fy, 9%) = 1, 


in particular Yg,= 1. O 


Note. Here a,,..., a, do not need to be positive anymore; neither do all the gq. 
However, if M is symmetric, then gq; = --- = q, = 1/n and the limit is A = La,/n 
as before. 
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(iv) Losonczi [7] gave necessary and sufficient conditions for quasi-linear means 
with weight functions 


M(a,,...,4,) = o UY f(x,)o(%)/XS(x;)) 
to satisfy (cf. (9)) 
M(a,+u,...,a,+u)—u>M(a,,...,a,) foru> 0. (13) 


By the way, it follows from (13) that the function u > M(a, + u,...,a, +u)—u 
is strictly increasing. Indeed, take an arbitrary y > x and u = y — x > 0. Then (13) 
yields 


M(a,+y,...,a,+y) -y=M(a,+x+(y—x),...,a,+x+(y-~x)) 


—(y-x)-x 
> M(a,+ x,...,4,+x)-—x, 
as asserted. 

(v) In the second proof of Proposition 2, Minkowski’s inequality is used. In [8] 
Minkowski type inequalities were proved under certain conditions for mean values 
even more general than quasi-linear means with weight functions. Proposition 2 
holds in all those cases. 


We think that the above, in particular Propositions 1, 2, 3, 4, Remark (ii) and the 
second part of (iv), show that more general statements may have at least as simple 
elementary proofs as their particular cases. 

(vi) Independently, Brenner and Carlson [3] have proved (personal communica- 
tion) a result somewhat stronger than (12), namely, 


x|M(a, + xX,...,a,+x)-x- » 94;| is bounded as x > oo 


under somewhat stronger conditions than those in Proposition 4 above, supposing, 
in particular, instead of differentiability at (1,...,1), the existence and continuity of 
all second order partial derivatives (everywhere). Previously Brenner [2] has proved 
Proposition 3 for positive integer p by binomial and other expansions and for 
noninteger p by the monotonicity of p > M,(a,,..., a,). Recently Bullen [4] has 
proved (personal communication) Proposition 2 independently by a variant of our 
first proof, also using that M, is increasing in p, and Proposition 3, again by use of 
binomial and multinomial expansions (both for weighted means; cf. the above 
Remark (11)). 

The simplicity of the proof of Proposition 4 and of the second proof of 
Proposition 2 here may be of some comparative interest. 
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1. Introduction. The classification of surfaces is one of the truly beautiful theo- 
rems of modern mathematics. There are many proofs in the literature already, most 
of which are quite elementary. This note presents yet another proof, one that 
involves some very simple graph theory, an area increasingly popular with, and 
familiar to, students today. It also involves rudimentary surgery and thus could 
serve as an introduction to some of the methods of modern differential topology. 

A surface is a compact two-manifold without boundary. For those readers for 
whom this is not very helpful, [7, Chap. 1] and [4, Chap. 1, 2] both give a very 
leisurely, well-illustrated explanation. 

Perhaps the most familiar surfaces are the sphere and the torus (FIG. 1). 


oO S| 


sphere torus 


Fic. 1. 


Two more exotic surfaces are the projective plane and the Klein bottle (FIG. 2; see 
also [6, pp. 64—67]). 


(> > LL 
=YSeE 
Ne = 


projéctive plane 


Klein bottle 
FIG. 2. 
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Most mathematicians are familiar with the Mobius strip (FIG. 3). 


cylinder MoObius strip 


FIG. 3. 


This is not a surface but, like the cylinder, a surface-with-boundary. The boundary 
of the cylinder is, of course, two disjoint circles, while the boundary of the Mobius 
strip is just a single circle. The Mobius strip will be quite important in our proof 
because it is so closely related to the projective plane. In fact, if the interior of a 
small disc is removed from a projective plane the resulting surface-with-boundary 1s 
a Mobius strip (see [5, p. 33]). 

Any two surfaces can be combined to produce a third by forming what is called 
their connected sum (see [5, p. 79]). A small open disc is removed from each surface, 
and the two resulting boundary circles are then glued together (FIG. 4). 


. 
te Connected sum of a torus and a projective plane. 


FIG. 4. 


The simplest and weakest form of the classification theorem says that any surface 
can be built from the few examples described above using only the operation of the 
connected sum. 


THEOREM. Any surface is homeomorphic to the connected sum of a sphere, m tori, 
n projective planes, and r Klein bottles, where m, n, and r are greater than or equal to 
zero. 


The basic idea of this proof is the same as that of [5, pp. 88-89]. We attempt to 
simplify the surface by removing tori, projective planes, or Klein bottles. Begin by 
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looking for a nonseparating simple closed curve y on &, that is, a curve y for which 
= — y is still connected. If W is a small open neighborhood of y on =, then W (the 
closure of W) is either a cylinder or a Mobius strip. Consider = — W the 
surface-with-boundary obtained by removing W from =. The boundary of > — W 
will be one or two circles since bdry(= — W) = bdry(W). Glue a disc (or discs) 
onto the boundary circle(s) of 2 — W to get a new surface X’. In essence we go 
from = to =’ by removing a cylinder (handle) or a MObius strip. A little care must 
be taken here. The handle could have been attached nicely (FIG. 5(a)) or badly (FIG. 
5(b)). It is a good exercise to show that the nice handle in FIG. 5(a) is homeomor- 
phic to a torus with a disc removed and that the bad handle in FIG. 5(b) is 
homeomorphic to a Klein bottle with a disc removed. 


(a) “nice” handle (b) “bad” handle 


Fia. 5. 


We continue by looking for a nonseparating simple closed curve y’ on >’ and 
repeating the process. This leaves two major questions: Will we ever reach a surface 
with no nonseparating curves? What can we say about such a surface? The answers 
are quite simple. After a finite number of these surgeries, we wil] obtain a surface 
with no nonseparating closed curves and such a surface is homeomorphic to a 
sphere. The verification of these statements is usually rather lengthy, though. 

In this proof we use triangulated surfaces and some graph theory to describe a 
very concrete execution of the program just sketched. We show where to look for 
the non-separating curve y and demonstrate easily that the procedure produces a 
sphere in a finite number of steps. For convenience, we use a slightly different 
technique for the surgery. Instead of removing an entire neighborhood of y, we 
merely cut the surface open along y. The surface-with-boundary created is homeo- 
morphic to the one described above and so the ultimate effect is the same. 
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The idea of this proof came from Paul Melvin, who credits it originally to E. C. 
Zeeman. (The referee has pointed out that a similar version of this proof [also 
credited to Zeeman] appears in an excellent textbook by M. A. Armstrong [1]. 
Regrettably, this book does not seem to be very well known or readily available in 
North America. We hope that those readers who find this article useful will be able 
to examine Armstrong’s text in its entirety. The author would also like to acknowl- 
edge financial support from Williams College, McMaster University, and Wilfrid 
Laurier University.) 


2. Some Graph Theory. For purposes of this note, we assume that all graphs are 
connected. Recall that a spanning tree in a graph G is a subgraph of G that is a tree 
containing all the vertices of G. It is easy to see that every graph has a spanning 
tree. 

Let V(G) (resp., E(G)) denote the number of vertices (resp., edges) in a graph 
of G. 


DEFINITION. The cyclomatic number of a graph G, denoted by C(G), is defined 
by the equation 


C(G) =1-V(G) + E(G). 


The cyclomatic number measures the number of independent cycles in G. We need 
only one of its properties. Some deeper results about C(G) may be found in [2, 
Chap. 2]. 


PROPOSITION. C(G) > 0. Furthermore, C(G) = 0 if and only if G is a tree. 


Proof. The proof follows quite easily using induction on the number of edges. 


3. Proof of the Theorem. As in most proofs of this theorem, we use combina- 
torial techniques and so assume that all surfaces are triangulated. It is a nontrival 
result that this introduces no loss of generality in dimension two (see, e.g., [3, 


§16.a]). 


DEFINITION. A surface is a finite collection of triangles in some Euclidean space 

such that: 
(1) the intersection of any two triangles is empty, a single vertex, or a single edge; 
(11) any edge is the intersection of exactly two triangles; and 
(111) the triangles incident with a given vertex can be ordered A,, A,,..., A, so that 
A, has exactly one edge in common with A,,,, for 1 <i<k, and A, has 
exactly one edge in common with Aj. 

For a surface 2, let A,,A,,...,A, be a list of all the triangles of =. The 
collection of vertices and edges of these triangles forms a graph that we also denote 
by 2. Let B, be the barycenter (centroid) of A,;. Form a new graph Gs whose 
vertices are the barycenters B,, B,,..., B,. There will be an edge B,B, in Gs if 
and only if A; and A, have an edge e in common. In this case we say that B,B, 
Crosses eé. 

Let T be any spanning tree of G; and K, be the subgraph of = consisting of all 
the vertices of 2 and those edges of 2 not crossed by T. It is easy to verify that K- 
is connected since T is a tree. 

It is perhaps helpful to think of T as lying along the edges of b(2), the first 
barycentric subdivision of 2. It is not, however, a subgraph of b(2) because the 
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vertices of T are just the barycenters of the triangles in =. We can use b*(=), the 
second barycentric subdivision of =, to produce neighborhoods U > T and V > K, 
on the surface > satisfying: 

G) UNV = 4, 

(ii) UU V = 3, and 

(1) bdry U = bdry V. 

For example, U will be T together with the interiors of all the triangles and edges of 
b*(=) that intersect J. There is a similar description for V. We should remark here 
that U is homeomorphic to a two-dimensional disc since T 1s a tree. 

FIGURE 6 shows a triangulation of the projective plane. (Note that there are 
identifications on the boundary of the hexagon.) One of the possible choices of T is 
drawn with heavy lines. The corresponding K, is drawn with broken lines. The 
shaded region is U and the unshaded one 1s V. 


FIG. 6. 


Now focus on the cyclomatic number L = C(K,). We consider two cases. 


Case 1: L = 0. In this case K; is a tree. Then V is a disc, as observed above. 
Thus 2 is the union of two discs, U and V, joined by an identification of their 
common boundary. This means that = is homeomorphic to a sphere. 


Case 2: L # 0. In this case, Ky is not a tree so there must be a cycle y in Ky. 
This is the curve on which we do surgery. 


LEMMA. The cycle y does not separate =. 


Proof. The tree T has a vertex in each triangle of = and does not cross y. Thus 
any two points of = — y can be joined by a path in 2 — y. 


Instead of removing a whole neighborhood of y, we just cut the surface open 
along y to create a triangulated surface-with-boundary 2,. Just as in our earlier 
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discussion, the boundary of 2, consists of one circle if y has Mobius strip 
neighborhood and of two circles if y has a cylinder neighborhood. 

In the example of Fic. 6, L = 1 and essentially the only choice of y is the cycle 
ADFA. The neighborhood of y is shown in Fic. 7. It is obviously a MObius strip. 


Fic. 7. 


The next step is to cap off the boundary of 2, with a suitably triangulated disc 
(or discs) as shown in FIG. 8. This creates a new surface 2’. The heavy lines in 
FIGURE 8 show how the existing tree T can be extended into each new triangle to get 
a new tree T” that will be a spanning tree for Gy. We then compute L’ = C( K,.) by 
simply counting the vertices and edges added to or lost from K, to produce K+. 


é 
: ° 
’ ‘ 4 ‘ 


a 

e 
é 
a 


(a) one boundary component (b) two boundary components 
FIG. 8. 


LEMMA. If 2, has one boundary circle, then L' = L — 1; if &, has two boundary 
circles, then L’ = L — 2. 
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The whole procedure can now be iterated, decreasing the value of L each time. In 
a finite number of steps, then, we arrive at the L = 0 case, which corresponds to a 
sphere. As outlined in Section 1, each surgery (the process of cutting along y and 
capping off the boundary of 2,) is equivalent to removing a torus, a projective 
plane or a Klein bottle. This completes the proof of the theorem. 
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Another Required Reading Program for Mathematics Majors 


JAMES C. REBER 
Department of Mathematics, Indiana University of Pennsylvania, Indiana, PA 15705 


I appreciated the article by Robert L. Brabenec, “A Required Reading Program 
for Mathematics Majors” [this MONTHLY, April 1987]. For precisely the reasons he 
presented we have introduced a reading program at Indiana University of Pennsyl- 
vania. The differences between programs, and respective advantages and disad- 
vantages might be worth noting. 

Our program is tied to individual courses required for all majors, and uses 
readings on reserve in the library. Each instructor teaching a course in the program 
requires the readings for that course, specifies how the requirement is met (typically 
with a short reaction paper), and determines how the assignment will be graded. 
General readings are used in courses populated primarily with mathematics majors; 
readings explicitly tied to the course are used in courses that include many 
non-mathematics majors. 

One disadvantage of this approach: though encouraged to complete the assign- 
ment early in the semester, students often complete it amid the pressure of other 
work. 

Advantages: students minoring in mathematics or taking just a few courses 
complete some readings; faculty are more likely to integrate the readings into the 
course; there is no departmentwide bookkeeping; and the program is easy to start 
since it can be phased in several courses at a time. 

The explicit readings we use might be instructive, though I agree with Brabenec’s 
comment, “The specific content of what our students are reading is not nearly as 
important as the fact that they are reading.” 

Calculus I. Read two of three essays on the nature of mathematics from 
Mathematics: People /Problems /Results: “Math and Creativity” by Alfred Adler; 
“The Meaning of Math” by Morris Kline; “Math as Creative Art” by Paul Halmos. 
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Calculus IIT. Read specified selections from either The Mathematical Experience 
by Davis and Hersh or from How to Solve It by Polya. 

Calculus IIT. Read specified selections from either Infinity by Lieber or from 
Bridges to Infinity by Guillen. 

Introduction to Linear Algebra. Read three articles from Scientific American as 
follows. Either “Linear Programming,” August 1954, or “The Allocation of Re- 
sources by Linear Programming,” June 1981. One of these articles: “Input-Output 
Economics’’, October 1951; “The Structure of U.S. Economy,” April 1965; or “The 
World Economy by the Year 2000,” September 1980. One article from the 
“Mathematics and Modern World” issue of September 1964. (“Math in Social 
Sciences” is most relevant to this course.) 

Introduction to Algebraic Structures. Read either Flatland by Abbott, or specified 
selections from Kline’s Mathematics, The Loss of Certainty and from Kasner and 
Newman’s Mathematics and the Imagination. 


A Principal Ideal Domain That Is Not a Euclidean Domain 


Oscar A. CAMPOLI 
Facultad de Matematica, Astronomia y Fisica, Valparaiso y R. Martinez Ciudad Universitaria, 
5000 Cordoba, Argentina 


Introduction. In most advanced undergraduate and graduate algebra texts a very 
simple argument is used to show that a Euclidean domain is a principal ideal 
domain (PID). And then it is mentioned that the converse is not true, sometimes 
together with the claim that the subring A = Z[#] = {a+ bO#ja,beZ, O= 
(1 + ¥—19)/2} of the complex numbers is a PID but is not a Euclidean domain. I 
have not been able to find a proof, accessible to beginning graduate students, in any 
standard reference (e.g., [1, 2, 3, 4]). 

In what follows it is shown in an elementary fashion that A has both properties. 

The proof that A is not a Euclidean domain is in [5] but we use here a shorter 
argument suggested by the referee. 

One way to see that A is a PID can be found in algebraic number theory books 
where the class number of the field @ (y — 19) is computed. The proof given here 
uses that A is “almost” a Euclidean domain in the sense that it has a “generalized” 
Euclidean algorithm. A criterion (sometimes attributed to Dedekind and Hasse) is 
then proven and used to show that A is a PID. 


A is not Euclidean. In general, it is not clearly stated what Euclidean domains 
are. A definition is as follows: 

A Euclidean domain consists of an integral domain A together with a map 
||:A — Z (the Euclidean norm) that satisfies the following conditions: 

(i) | (a) = ja| > O for all a € A; |a| = Oif and only if a = 0. 

(ii) |ab| = |a|- |b| for all a, b € A. 

(iii) (Euclidean algorithm) Given a, b € A, b # 0, there exist g,r © A so that 
a=qb+r with |r| < [b|. 
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It is interesting to note that condition (11) of the definition can be weakened to 
(11) |a| < |b| whenever a divides b (for nonzero b), 
which follows easily from (ii). In fact (i1’) will be used instead of (ii). 

To show that A is not Euclidean it is sufficient to prove that .A does not admit a 
function || satisfying the three stated properties. Thus assume that || is a Euclidean 
norm in A. This leads to a contradiction. 

Indeed, let U be the set of nonzero elements in A with minimal norm. Since 
every unit of A divides every nonzero element, (ii’) implies that every unit is in U 
and (ili) implies that every element of U divides every nonzero element of A; so U 
consists precisely of the units of A. 

We next show that U = {1, —1}. In order to prove this and other assertions a 
few specific calculations in the ring A are needed. 

The following identities can be proved directly from the definition of 6 = 
(1 + ¥-19)/2. For ae A, a denotes the complex conjugate of the complex 
number a. 


(I) 6=1-6 
(II) 06 = 5 
(Ill) 07 =6-—5 


(IV) For any x = a+ bOEA, 0x = —S5b+4+ (at b)8. 

From (I) it follows that A is closed under complex conjugation. Identity (II) 
implies that the integer 5 is not a prime in A. Later it will be clear that @ is not a 
unit in A and it will then follow that 5 is reducible in A. From (III) it follows that 
6? © A and hence A is closed under complex multiplication (a fact not obvious 
frm the definition of A). 

If N(z) = zz is the usual complex norm, then the preceding identities yield: 

(V) N(a + b0) = (a+ bO)(a + bO) = a? + ab + 5b’. 

Moreover, the function N: A — Z satisfies 

(a) N(xy) = N(x)N(y) for all x, y € A, and 

(b) N(x) > O for all x € A and N(x) = 0 if and only if x = 0. 

This immediately implies that if an element a + b@ © A is a unit then a* + ab 
+ 5b* = N(a + b0) = 1 and hence, if ab > 0, then b = 0 and a = +1. Also, since 
a+ b0=a+b-— b0and1=N(a+ bd) = N(a+ bO) = (at b)? — ab + 4b’, it 
follows that when ab < 0 then again b = 0 and a = +1. This concludes the proof 
of the fact that U = {1, —1}. 

Now assume that m is of minimal norm among the elements of A different from 
0, 1, — 1. Condition (iii) implies that 2 = gm + r, with |r| < |m|; therefore r is one 
of 0,1, or —1. Hence either m divides 2 or m divides 3. We claim that m must then 
be one of +2, +3. 

This claim is a consequence of the fact that 2 and 3 are primes in A, which is 
shown as follows. Suppose 2 = (a + b@)(c + d@). Then 4 = N(2) = N(a + b@) 
N(c + d@@) and assuming that a + b@, c + d@ are not units in A, it follows that 


2=N(a+ b0) =a? + ab+ 5b? = N(a+ b0) = (a+b) — ab + 4b?. 


Therefore, considering the cases ab > 0 and ab < 0, we conclude that b and d 
each equal zero. 

Thus 2 = (a + b@)(c + d@) = ac is an integral factorization. Since 2 is a prime 
in Z, 2 is a prime in A. A similar argument shows that 3 is also a prime in A. 
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Now, again using (ili), 8 is congruent to 0,1, or —1 modulo one of +2 or +3. 
Hence # or 6—1 or @+ 1 is divisible by 2 or 3. But this is impossible since 
N(@) = 5 = N(@ — 1) and N(@ + 1) = 7, while N(2) = 4 and N(3) = 9. 


A is a PID. As stated in the introduction, to show that A is a principal ideal 
domain (PID) it is enough to show that A is “almost”? a Euclidean domain. More 
precisely, it may be seen that given elements a, B € A, B # 0, if B does not divide a 
and N(a) > N(f) then there exist y, 6 € A such that 

0 < N(ay — BS) < N(B). (1) 

This property implies that A is a PID by an argument similar to the one usually 
applied to show that Z is a PID. Let J # 0 be an ideal in A. Let B © I be an 
element such that N(f) is minimal among the nonzero elements in J. Then BA = I. 
Indeed. since clearly BA C I, consider the possibility of having an element a € J 
such that B does not divide a. Then a # 0 and hence N(a) > N(B). Now using (i) 
it is possible to obtain another nonzero element ay — 86 in J which contradicts the 
minimality of N(£). 

To show (i) take a, 8B € A, B # 0. If B does not divide a and N(a) > N(B) write 


b6 
—-=art , 
B 


where a and 5b are rational numbers and at least one of them is not an integer. This 
is possible since the inverse of 8 as a complex number is in Q[8], which is a subfield 
of C. 

A case by case consideration leads to elements y and 6 € A such that 


0<a(Fy—3| <1, whence N(ay — 6B) < N(B). 


There are seven cases. 


Case 1: b © Z. Then a € Z and we may take y = 1 and 6 = {a} + b@ (here 
{ x } denotes the integer nearest x, with {n + 1/2} = n). Now, 


a 1 
0<Fy-8| <7 <1. 
Case 2(a): a& Z and 5b¢ Z. Then ru =a-+5b-—aé and we may take 
y = 0,5 = {a+ 5b} — af. 
Case 2(b): a = Z and 5b @. Take y= 1,5 =a+ {bY80. 
Case 3(a): a,b € Z and 2a, 2b © Z. Then, although we proved IV for a, b € Z, 


it is clearly valid also for a, b rational and hence 8a/B = —5b+ (a+ b)6@ and 
a+béZ. Therefore, we may take y = 0, 8 = {—5b} + {a+ b}8. 

Case 3(b): a,b € Zand2a,2b € Z. Then either |b — {b}| < 1/3 or [2b — {25}| 

< 1/3. In the first situation take y = 1 and 6 = {a} + {b}@ and estimate 
O<N 5 » 1 
<Nij-—y-46|]< — <1. 
a | 36 
In the second situation take y = 2 and 6 = {2a} + {2b}6 with the same estimate. 
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Case 3(c):a,b € Z,2a © Zand2b € Z. When5b © Ztakey = 5andé = {5a} + 
5b@ and when 5b & Z take y = 26 and 6 = {2a + 10b} — 2a8. 


Case 3(d): a,b € Z, 2b € Z and 2a € Z. Take y = 2, 6 = {2a} + 2b8. 
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ELEMENTARY PROBLEMS 
E 3289. Proposed by C. Alsina, Universitat Politecnica de Catalunya, Barcelona, 
Spain. 
Let J = [0,1]. Prove that there are exactly two mappings T from I X J to J 
which have the following five properties for all x, y, z in I: 


(1) T(T(x, y), z) = Tx, T(y, z)), 

(2) T(x, y) = TV), x), 

(3) T(x,z) < T(y,z)ifx<y, 

(4) T(x,1) = x, 

(5) T(zx, zy) = z*T(x, y) for some fixed positive k. 


E 3290. Proposed by Douglas B. West, University of Illinois, Urbana, IL, and 
Douglas H. Wiedemann, Institute for Defense Analyses, Princeton, NJ. 
Given nonnegative integers /,m,n with /,n < m, evaluate the double sum 


l 


EE om NIA) 


i 
i j=0 


E 3291. Proposed by Jerrold R. Griggs, University of South Carolina, Columbia, 
SC. 


Suppose y > x > 0. Is it always true that y* > x”? 
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E 3292. Proposed by Solomon W. Golomb, University of Southern California, 
Los Angeles, CA. 


(a) What is the maximum number of consecutive days with no occurrences of 
Friday the thirteenth? 

(b) What is the minimum number of consecutive days with four occurrences of 
Friday the thirteenth? 


E 3293. Proposed by Joseph Keane (student), Carnegie-Mellon University, 
Pittsburgh, PA, and Gregg Patruno, The First Boston Corporation, New York, NY. 


Suppose that the distinct circles C, and C, intersect at P and Q. Suppose that 
the tangent to C, at P intersects C, again at A, the tangent to C, at P intersects C, 
again at B, and the line AB separates P and Q. Let C, be the circle externally 
tangent to C,, externally tangent to C,, tangent to line AB, and lying on the same 
side of AB as Q. Prove that the circles C, and C, intercept equal segments on one 
of the tangents to C, through P. 


E 3294. Proposed by Daniel Goffinet, Saint Etienne, France. 


Characterize the topological spaces in which the following assertion is true: For 
each open subset G of a closed subset F there exists a set A with G as its interior 
and F as its closure. 


SOLUTIONS OF ELEMENTARY PROBLEMS 
Zeros of a Combination of Orthogonal Polynomials 


E 2979 [1982, 757]. Proposed by Emeric Deutsch, Polytechnic Institute of 
New York. 


Let { p,} and {q,} (k = 0,1,2,...) be two sequences of real, monic, orthogonal 
polynomials, p, and qg, having degree k. Show that for any nonnegative integers m 
and n, the polynomial p,,g,41 + Pmn+19, has only real simple zeros. 


Composite Solution based on the solutions of Richard Askey, University of Wiscon- 
sin (Madison), David Secrest, University of Illinois at Urbana-Champaign, and the 
proposer. By Theorems 3.3.1 and 3.3.2 of Gabor Szegé’s Orthogonal Polynomials 
(Amer. Math. Soc., Providence, 1975) the zeros of p, are real and simple, the zeros 
of p,, and p,,,, iterlace, the zeros of qg, are real and simple, and the zeros of q,, 
and gq,.,, interlace. It follows (cf. op. cit., Theorem 3.3.5) that we have partial 
fraction decompositions: 


Py(X) — a, q,(x) by 
ny ) ry ) 
Pin+1(X) j=0% 4 Qnt(X) yay ¥ — Mx 
where uo, u,,..., u,, are the zeros of P,,41, U9, U1,---, U, are the zeros of g,,, and 


the coefficients a, and b, are positive. 
Now let S be the given polynomial p,.g,41 + Pm+i9, and put 
s ' : 
F —- Et P + An 


Pm+19n+1 Pm+1 Gn+1 
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Let r be the number of common zeros of p,,,, and g,,,. Then F has exactly 
m+n-+2-—~r poles, all of which are real. Since both p,/p,,,, and q,/q,4 , have 
positive residues at their poles, it follows by addition that the residue of F at each of 
its poles is positive. Accordingly F goes from +o to — oo on the interval between 
any two consecutive poles. Hence F must have at least m+n+1-—vr zeros 
between its poles; these zeros are also zeros of S, since S = Fp,,.49,4+ 1. However, 
since the poles of F are simple, the r common zeros of p,,,, and q,4, are also zeros 
of S and are distinct from the above mentioned zeros of F. Hence S must have at 
least m + n + 1 distinct real zeros; since deg S = m + n + 1, in fact S has exactly 
m +n -+ 1 real simple zeros. 


Trisecting a Triangle by Cevians 


E 3155 [1986, 482]. Proposed by Gene Bennett, John Glenn, and Clark Kimber- 
ling, University of Evansville.’ 


Prove that for any triangle ABC, there exist points A’, B’, C’ satisfying 
(1) A’ lies on side BC, B’ on side AC, and C’ on side AB. 

(2) A’C + CB’ = B'A + AC’ = C’B + BA’; and 

(3) AA’, BB’, and CC’ concur in a point. 


Composite Solution based on the solutions of Jeffrey M. Cohen and others. Let AB 
be the shortest side. For any choice of C’ on AB, determine points A’ and B’ on the 
perimeter of triangle ABC which are one-third of the way around the perimeter 
from C’ in the opposite direction from A and B respectively. (One of these will be 
on the longest side; the other one will be either on the third side or on the longest 
side.) 

The location of the point J of intersection of AA’ and BB’ is a continuous 
function of C’. It lies on the B side of CC’ when C’ = A and on the A side of CC’ 
when C’ = B. The Intermediate Value Theorem implies that, as C’ sweeps through 
all positions from A to B, there must be at least one position where CC’ contains J. 
When this happens, A’ will be on the side BC and B’ will be on the side CA. 


Editorial comment. Most solvers used the 17th-century theorem of Ceva from 
projective geometry. Suppose a line is drawn from each vertex of a triangle to a 
point on the opposite side, thus cutting each side into two segments. Ceva’s theorem 
asserts that the product of the lengths of the three segments clockwise from the 
vertices equals the product of the lengths of the three segments counterclockwise 
from the vertices if and only if the three lines drawn are concurrent. When the lines 
are concurrent and the two triple products are equal, the three points on the sides 
are called “Cevian points.” Interestingly, the solvers using Ceva’s theorem were 
almost precisely those not from North America; Ceva’s theorem seems much better 
known among mathematical communities outside North America. In the present 
instance the use of Ceva’s Theorem was not an indispensable part of the argument. 


Solved by J. M. Cohen, J. Dou (Spain), J. Fukuta (Japan), L. Kuipers (Switzerland), N. Lord 
(England), O. P. Lossers (The Netherlands), E. Morgantini (Italy), V. Pambuccian (Romania), A. 
Pedersen (Denmark), W. Raffke (West Germany), V. Schindler (East Germany), L. Smith (student, 
Canada), J. H. Steelman, P. Tzermias (Greece), P. J. Zwier, and the proposers. Two incorrect solutions 
were received. 


1988] PROBLEMS AND SOLUTIONS 875 


Inside a Difference Set 


E 3176 [1986, 733]. Proposed by Anatole Beck, The London School of Economics 
and Political Science, England. 


Let A be a measurable subset of R. Is it true that the interior of A — A (set of 
differences) either contains 0 or contains nothing? 


Solution by O. P. Lossers, Eindhoven University of Technology, The Netherlands. 
The answer is “no.” This can be shown as follows. Define 


A, ‘= | e,2'-*":e, € {0,1} for all n}. 
n=1 


A, = f + Ye 2-*":e, © {0,1} for all n}. 
n=1 


Then A is obviously measurable (with measure zero). Since Ay = {2/3} — Apo, 
A, = {7/3} — A,, Ap + A, = [1,2], we obtain 


zi < 1—2n zi 
A, — Ay = Ay + Ap - (5 = €,2 re, © {0,1,2} forall n> — 5} 
n=1 


7 00 1 
A, — A, = A, + A, - (3 — | y e,2- "6, € {0,1,2} for all n| — ‘s}: 
n=1 


2 1 4 
Ay ~ Ay = A, + Ap~ {5} = 3°3) 


4 1 
Ay — Ay = |— 35-35]. 


Now A — A is the union of these four sets and so it has a non-empty interior. 
However, if —1/3 < x < 1/3 and both x + 1/3 and x + 2/3 have three consecu- 
tive ones in their binary expansion (e.g., if x = 27> + 27* “kh > 2) then x € A, - 
A, (i = 0,1), sox € A — A and hence 0 € (A — A)?. 


Also solved by O. MatouS (Czechoslovakia) and the proposer. 


An Inequality for Polynomial Roots 


F 3184 [1987, 71]. Proposed by Calin P. Popescu, Bucharest, Romania. 


Suppose the roots of the polynomial P(x) = L"_)a,x', n > 2, are real numbers 
lying in (0,1) and that a, > 0. Show that Ere ia, > 0, whereO < k <n — 2. 
Solution 1 by David Callan, University of Bridgeport, Bridgeport, CT. Without loss 


of generality. we may assume a, = 1. Let x,,..., x, be the roots of P. We evaluate 
the kth derivative P'*)(1) in two ways. First, using the fact that Lx, = —a,,_,, we 
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obtain 


raya ali prnal(t)—(G Es +n} 


where 
n—-2 


T,= [i Ja 


i=k 


is the desired sum. Alternatively we perform k differentiations of the expression 
P(x) = [I(x — x;) and then substitute x = 1. To each term in the resulting sum we 
apply the inequality [](1 — y,) > 1 — Ly,, which holds for any r numbers y,,..., y, 
between 0 and 1 by induction on +. The resulting computation, in which the sum 
runs over the sets { j,, j>,---, J,-,} Of n — k indices between 1 and n inclusive, 
gives 


P*)(1) HDT —x,)> a{() = | m cia)? ys, } 


n—k 


Equating the two expressions for P“*)(1) yields T, > 0. 


Solution II by O. P. Lossers, Eindhoven University of Technology, Eindhoven, The 
Netherlands. Weierstrass’s Inequality, easily proved by induction on n, states that 
IV, — x,) > 1— X%_,x, for any set of n numbers x,,..., x, between 0 and 1. 
We apply this to the case k = 0. Letting x; be the roots of P(x), we have 
P(x) = a,1T"_ (x — x;) and L"_1x; = —a,_,/a,. This yields 


n nN 


n-2, , 
E (5)e=P0)~e..- a= a4] FLA) + x71 > 0. 
1=0 ‘ 


For k > 0, the kth derivative of P(x) 1is the polynomial Q(x) = KE ,a, i \xi-k 
of degree n — k. The leading coefficient of Q(x) is positive and, by Rolle’s 
Theorem, its roots lie between 0 and 1. Therefore we can apply the argument above 
to Q(x) to obtain KEP 2a, '}> 0. 


Also solved by A. Aziz (India), J. Ferrer (Spain), A. Riese, J. H. Steelman, A. Stenger, Univ. of S. 
Alabama Problem Group, and the proposer. 


A Coloring Problem for Subsets of a Set 


E 3188 [1987, 72]. Proposed by Ioan Tomescu, University of Bucharest, Romania. 


Let X be a nonempty set having n elements and C be a color set with p > 1 
elements. Find the greatest number p satisfying the following property: If we color 
in an arbitrary way each subset of X with colors from C such that each subset 
receives only one color, then there exist two distinct subsets A, B of X such that the 
sets A, B, A UB, AC B have the same color. 


Solution by Rhodes Peele, University of South Alabama, Mobile, AL. The answer 
is p =n. If |C| <n, then in any coloring of the subsets of X = {x,,..., x,,} the 
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pigeonhole principle implies that some pair of distinct subsets in the chain ©, 
{x,},{x,, x,},..., X have the same color. For such a pair A, B with A C B, we 
have A 1 B =A and A U B = B, so they form the required subconfiguration. 

Conversely, suppose |C| > n, and color the subsets of X in such a way that A 
gets color |A|. If A and B have the same color, then A M B and A U B have other 
colors. 


Editorial comment. Note that we are coloring the elements of the power set of _X, 
not the elements of X. O. P. Lossers pointed out that if the four sets A, B, AUB 
and A B are required to be distinct, then the maximum value of p can be much 
lower than n. In this case A 1 B and A U B must have different cardinalities from 
A and B, because |A| + |B| = |A U B| + |A 2 B|. When n = 6, giving A color 1 
if |A| © {0,1,4,6} and color 2 if |A| © {2,3,5} yields no monochromatic config- 
uration; since color depends only on cardinality, such a configuration requires a 
monochromatic solution i, j7,k,/ to k+/=i+j7 with i<k </<y. Thus for 
n = 6, the maximum p is 1. More generally, the maximum value of p is less than 
the number of colors required to color {0,..., 1} without having a monochromatic 
solution i, j,k,/tok+/=i+j withi<k<l</j. 

Also solved by R. E. Bernstein, N. Falsinger, J. Ferrer (Spain), O. P. Lossers (The Netherlands), J. H. 
Steelman, P. Tracy, and the proposer. 


An Alternating Ratio Sum 


E 3190 [1987, 181]. Proposed by Vasanth B. Solomon, Drake University, Des 
Moines, IA. 


Show that 
j (-1)"(w - 2r)(4) 
pao (N-r) ss (N-r-J) 


Solution by O. P. Lossers, Eindhoven University of Technology, Eindhoven, The 
Netherlands. The assertion is valid for all (complex) N € {0,1,...,27}. We begin 
with the partial fraction expansion 


lea} (Fa) BON Clee 


Applying this with z = N — r, we can rewrite the desired summation as 


SPE gow 


r=0 t=0 N-r-t 


= 0 for j>OQOand N > 27). 


Replacing (N — 2r)/(N —r-—t) by 1+(t—r)/(N — r— t) breaks this expres- 
sion into two double summations. The first is zero because £/_)(—1)' J) = 0, and 
the second is zero because its summand is skew-symmetric in r and ¢. 


Also solved by J. N. Fitch, I. Gessel, A. A. Jagers (The Netherlands), L. E. Mattics, B. E. Rhoades, J. 
H. Steelman, P. Tracy, K. H. Vijitha-Kumara, and the proposer. 
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A Characterization of Rapid Convergence 


E 3201 [1987, 371]. Proposed by Grahame Bennett, Indiana University, Blooming- 
ton, IN. 


Let us say that a convergent Series of positive terms, /a,, is rapidly convergent if 
there exists a sequence of positive numbers { 5, } such that 


X 4, y b, = O( by). 


m=1 

Prove that the series Ya, is rapidly convergent if and only if 
> 1 

a, = o| — | 
n=N N 


(This shows that the termwise sum of two rapidly convergent series is rapidly 
convergent.) 


Solution by University of South Alabama Problem Group, Mobile, AL. Let 
A,, = L%_,,a, and B, = L%_,b,. If La, is rapidly convergent, then by definition 
there exists a number c such that 


io @) io @) 
ch, >), a,B,=A,B,+ ) Axby 
k=n k=nt+]1 
for all n. If N > n, restricting the sums yields cb, > AyB, + Lh_,,,15,Ay = AnBy- 
Summing this yields cB, > NA,By, and hence A, < c/N. 
Conversely, if A, <c/N for all N, set b, = n~'/*. Then Y%_,a,B, = AyBy + 
ye An <2WeN 1/2 + crn 3/2 = O(N) = Ol by). 


Also solved by M. S. MacPhail (Canada), N. Martin, and the proposer. 


An Exact Difference Cover of Integers 


E 3202 [1987, 372]. Proposed by Paul Erdés, Hungarian Academy of Science, 
Budapest. 


Prove that there is an increasing sequence of positive integers a,, a,,... with the 
following two properties: 

(1) Every positive integer n can be uniquely expressed in the form n = a, — a,, 

(2) a,/k? is bounded. 


Solution by the proposer. Choose a, = 1, a, = 2. Suppose aj, ay,..., a, have 
already been chosen so that the k(2k — 1) differences a,—a,with 1 <i<j<2k 
are all distinct. Suppose m is the first positive integer that does not occur among 
these k(2k — 1) differences. We choose a,,,, and a,,,, so that a5,,5 = a),,, + 
m and so that the 4k differences 


Ann+1 4; 
Ann. — Ay = Angyy TM — a; 


differ from m and from the k(2k — 1) differences already obtained. In particular, 
Q5,4 1 Must avoid the values a,,a,,..., a, and 4k[k(2k — 1) + 1] other values. 
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Choosing the smallest value not forbidden, we have 
Oop.1 < 2k + 4k[k(2k — 1) +1] +1 = 8k? — 4k? 4+ 6K +1< (2k +1), 
Oop 09 = Ag EM <Any, +K(2k-1) +1 < (2k +2). 
The sequence so constructed begins 1, 2,5,7,15,22,.... 


Editorial Comment. Hang-Fai Yeung pointed out that this problem appeared as 
problem P290 in the Canadian Mathematical Bulletin, 23 (1980) 380, solved differ- 
ently by Erdés in 24 (1981) 505. In the same Bulletin, 24 (1981) 497-499, Andrew 
Pollington and Charles Vanden Eynden showed it is possible to choose a, in the 
interval [ck?, c(k + 1)°] for a suitable constant c. 


Also solved by I. Kozma (Israel), L. E. Mattics, and H.-F. Yeung (Australia). Two incorrect solutions 
were received. 


Uniquely Fibonacci 


E 3210 [1987, 457]. Proposed by Paul A. Smith (student), University of Wiirzburg, 
West Germany. 


Determine all pairs (m,n) of integers such that 
l<me<n, m? = —1(modn), n* = —1(modm). 


Solution by students in the 1987 Mathematical Olympiad Program, U.S. Military 
Academy, West Point, NY. Let (a, b) be any pair of integers such that 


l<a<b, a” = —1(mod dD), b? = —1(mod a), b> 1, 


and consider the transformation T(a, b) = (c, a), where c = (a* + 1)/b. Observe 
that c? = (a? + 1)*/b? = —(a* +1)? = -1 (moda) and a*=bc-1=-1 
(mod c). Also, 1 < c < a, with c = a only when a = c = 1. Thus, we can repeatedly 
apply 7 through pairs satisfying the constraints and obtain T*(a, b) = (1,1) for 
some k > 0. It now follows that (a,b) = T-*(1,1), where T '(m,n) = 
(n,(n* + 1)/m). 

Let F, denote the nth Fibonacci number. Since T~'(1,1) = (1,2) = (4, F;) and 
F? =F _4F,,,+(-1)" for n> 3, it follows by induction that T~*(1,1) = 
(Fy,_1, Fop43) for all k > 1. Thus 


(1,1), (1,2), (2,5),..-.(Pax-as acai): 
constitutes the list of all pairs that satisfy the given condition. 


Editorial Comment. Charles Aschbachér and Hugh Edgar independently noted 
that this problem was solved by James C. Owings, Jr., in “Solution of the System 
a” = —1 (mod db), b? = —1 (mod a),” Fibonacci Quarterly, 25 (1987) 245-249. 

The proposer remarked that the condition m? = —1 (mod n) implies that the 
Dedekind sum s(m,n) is zero. (Cf. Hans Rademacher and Emil Grosswald, 
Dedekind Sums, Carus Monograph No. 16, 1972, page 26.) Similarly the condition 
n* = —1 (mod m) implies that s(n, m) = 0. By the reciprocity formula for Dede- 


kind sums (op. cit., Chapter 2) 


m* +n* +1 = 3mn, 
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or 
(3n — 2m)’ — 5n? = —4. 


Thus (3n — 2m + nV5)/2 is a unit of norm —1 in the real quadratic field generated 
by V5, so that 


3n -2mtn¥5_ [1+ 75\7" 
2 7 2 
for some non-negative integer 7. Hence 
; ; 1 4 i5 2jt+1 1 _ V5 2j+1 
_ — + , 
n m 5 5 
5 1+ J5 2j+1 1 J5 2j+1 
me 1” 2 , 
so that 
1/145 \7" 1 [1-5 \0" . 
wo 5 \ 2 5\ 2 0 2D 
1/14 fi 2j+1 L/1- 5 2j+1 . 
— 5 ,) ~ 5 y) ~ “~2j+1° 


Also solved by the proposer and 27 other readers. 


ADVANCED PROBLEMS 


6583. Proposed by Moshe Laub, Jerusalem, Israel. 


(a) Let A be the set of positive integers z such that z’ is expressible as a sum of 
two nonzero squares. Prove that A has asymptotic density one. 

(b) Let B be the set of positive integers z such that the triangular number 
T, = z(z + 1)/2 is expressible as a sum of two nonzero triangular numbers. Prove 
that B has asymptotic density one. 


6584. Proposed by E. T. Parker, University of Illinois at Urbana-Champaign. 


Suppose a finite group G satisfies the following condition: If H is a cyclic 
subgroup of G of prime-power order or if H is an elementary abelian subgroup of 
G, then H lies in the center of the normalizer of H in G. 

(a) Show that G need not be abelian. 

(b) Show that G must be nilpotent. 


6585. Proposed by Alexandru Lupas, Sibiu, Romania. 
Prove that if —1 < x < 1, then 


sin(k arccosx) — yl — x ia 1-Pi(y) a 


k=l k 2 -~. I1l-y x-y 
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where 


PAX) = Sa 


is the nth Legendre polynomial. 


SOLUTIONS OF ADVANCED PROBLEMS 


6536 [1987, 194]. Proposed by Pei Yuan Wu, National Chiao Tung University, 
Hsinchu, Taiwan. 


Let T be a bounded linear operator on a complex Hilbert space H such that 
dim ker(T ) < oo. Define 


a(T) = inf {||T — S||: dimker(S) > dimker(T )} 
s 
and 
y(T ) = inf {||Tx||: x 1 ker(7) and ||x|| = 1}. 


It is proved on p. 365 of J. B. Conway, A Course in Functional Analysis (Springer 
Verlag, New York, 1985) that y(T) < a(T). Show that 


y(T) = a(T). 


Solution by John Henry Steelman, Indiana University of Pennsylvania. Given the 
result of Conway, it suffices to show that a(7T) < y(7). We show this to be true in 
any inner product space. 

Given any x 1 ker(T) with ||x|| = 1, let M be the subspace spanned by x and 
ker(7T). Since H = M ® M~, one can define a bounded linear operator S by 
setting S=0 on M and S=T on M~. Now any y € 4d can be written as 
y=m+Xx +m’ where m € ker(T) and m’ € M~. Then (JT — S)y = T(Ax) = 
A Tx, and by the generalized Pythagorean theorem, 


2 _ 2 2 mp2 2 
vil = [len[e + Axe + |lm’|]f > XY. 
Hence 


T-S ATx r 
P= S)yll IAPS IM ae 
Pal IPA Pal 
Thus ||T — S|| < ||7x||. Since dim ker(S) > dim ker(T), it follows that a(T) < y(T). 
Scott Hochwald considers the polar decomposition 7 = UP of T, and lets 
P =0 @ P, with respect to ker(P) ® (ker(P))~ . He shows that if o(S) denotes the 
spectrum of the operator S then 


y(T) = inf{A: A € o( P,)} 
— (inf{A >O:AE o(T*T)})'”’. 


Also solved by Mordechai Falkowitz, Jesus Ferrer (Spain), Ramesh Garimella, Scott Hochwald, Eero 
Posti (Finland), University of South Alabama Problem Group, and the proposer. 
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6537 [1987, 195]. Proposed by F. S. Cater, Portland State University, OR. 


Let (u,,) be a sequence of positive numbers. For each positive integer n and each 
x € (0,1), let (2”x) = 2"x — [2”x], where [2”x] denotes the greatest integer in 2”x. 
Let 

A = (2: 0 <x < land liminfu>'(2"x) = 0}. 

Prove that the measure of A is 0 or 1 according as Lu, converges or diverges. 

Editorial Comment. Jim Conklin observed that the solution of this problem can 
be obtained from Example 6.7, p. 85, of Patrick Billingsley’s Probability and 
Measure, Second edition, Wiley, N.Y., 1986. It is also a corollary of Theorem 2A in 
Walter Philipp’s article, Some metrical theorems in number theory, Pacific J. Math. 
20 (1967) 109-127. In terms of probability theory Billingsley’s Example 6.7 is a 
Borel-Cantelli lemma for weakly dependent events, whereas Philipp proves strong 
laws of large numbers for the indicator functions of such events. Here is how one 
can deduce the result from the above sources. 

Suppose du, = oo. Then, by a well known construction there exists a sequence 
yg, 10 such that Lum, = oo still holds. Now taking x, = u,@, in Billingsley or 
Philipp, for almost all x we have 


(2"x) < u,@, for infinitely many n. 
Hence 
liminf u~'(2"x) < limg, = 0. 


Now suppose Lu, < oo. Construct a sequence wy, foo such that Lu,w, < oo. By 
the same results as above, for almost all x we have 


(2"x) <u,~, for finitely many n’s only. 
Hence (for almost all x) 
u-*(2"x) >, foralln > N = N(x), 
so that for almost all x we have 
lim u,*(2"x) = oo. 
Thus the measure of A is 0 or 1 according to whether Liu, < oo or Vu, = oo. 


Solutions were also received from L. E. Mattics and the proposer. 
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New York, 1987. Vol. 1, Introduction. x + 469 pp. Vol. 2, Loop Amplitudes, 
Anomalies and Phenomenology. x11 + 596 pp. 


JOEL A. SHAPIRO 
Department of Physics and Astronomy, Rutgers University, Piscataway, NJ 08855 


The front-page article of the New York Times Magazine* called it “A Theory of 
Everything,” while to the Atlantic’ it was “The Gospel of String.” Controversy over 
its appropriateness within the Halls of Physics has arisen in magazines ranging from 
Physics Today* to the Chronicle of Higher Education.’ The Times had previously 
announced!: “There is a nagging suspicion abroad that the science of physics may 
be on its death bed.” 

What is this “Theory of Everything’ (TOE), and why does it arouse such 
passion? It is a set of highly mathematical approaches, collectively called “String 
Theory,’ proposed as the totally unified theory of all the forces of Physics. This 
would make it the culminating step in the series of unifications which has marked 
the dramatic progress in fundamental Physics in recent times. While unification has 
a long history, its modern era began with the melding of electromagnetism with the 
“weak” force responsible for the radioactive decay of nuclei. The combined theory, 
called Salam-Weinberg or electroweak theory, is the first really successful gauge 
theory based on a non-Abelian symmetry group, SU(2) X U(1). The next unifica- 
tion step attempts to unify this Salam-Weinberg theory with Quantum Chromody- 
namics (QCD), the gauge field theory based on the color group SU(3), which 
describes the strong interactions of hadrons. The resulting Grand Unified Theories 
(GUTS) require that at energy scales far beyond those available today, the underly- 
ing interactions are more symmetric, having at every point of space-time an 
enlarged symmetry group such as SU(5) or SO(10) or Eg. 

These theories are all gauge field theories, which means they are built on a fiber 
bundle with a compact Lie structure group and space-time as the base manifold. 
The connection on this bundle, viewed as an operator on the space of quantum 
mechanical states of the system, describes the quantum field associated with a point 
particle of spin 1. For electromagnetism this gauge particle is the photon, and 
corresponds to the one generator of U(1). In the electroweak theory, the SU(2) X 
U(1) generators give the W+* and Z bosons in addition to the photon. QCD is the 


*Oct. 18, 1987. 

April, 1986, p. 24. 

+P. Ginsparg and S. Glashow, “Desperately Seeking Superstrings?,” Physics Today, May, 1986. 

§Kim McDonald, “Controversial ‘Superstring’ Theory Captures Physicist’s Imagination but Could 
Prove Impossible to Verify,” July 23, 1986. 

{Malcolm W. Brown, “Physics May Be Up Against the Wall This Time,” The New York Times, July 
8, 1986. 
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theory of quarks and gluons, of which the hadrons are built. The quarks, which are 
not gauge particles, were invented first, but the eight generators of the SU(3) are the 
gluons, the glue of the strong interactions. This glue is so strong that neither the 
gluons nor the quarks can escape from their composite particles, which are 
the protons, neutrons, pions, and others, that populate the hadronic world. When 
GUT models unify these into a larger group, the standard model’s SU(3) X SU(2) 
x U(1) becomes a subgroup of a larger group. The extra generators needed have 
particles (so far unobserved) called X-mesons or baseballs. 

Gravity, the one remaining known interaction, is also a gauge field theory, but of 
a different type. In Einstein’s general relativity theory, the gauge transformations 
are “general coordinate transformations,” or diffeomorphisms, of the space-time 
manifold. The group is itself related to space-time, rather than representing an 
“internal” symmetry, and this causes the associated particle, the graviton, to have a 
higher spin. It is a spin-two, or tensor, particle. The quantum field theory of such 
particles does not provide an acceptable quantum mechanical description of gravity, 
because it is nonrenormalizable. Renormalization is the technique used in order to 
make sense of the problem that all four-dimensional (space + time) field theories, 
when evaluated by perturbation expansion about the noninteracting theory, produce 
infinite answers. These are due to the interactions taking place at the same point 
(ultraviolet singularities). In a renormalizable theory, it is possible to relegate the 
infinities into the unobservable parameters which describe the “bare” theory, 
expressing the resultant answers in terms of a finite number of finite “renormalized” 
coupling constants. But for gravity this is not possible. 

String theory is now seen as a theory which does successfully quantize gravity, 
along with the other interactions. In string theory, the basic objects are not point 
particles tracing out curves as they travel through space-time, as they are in field 
theory—instead they are strings, open or closed, which sweep out a two-dimen- 
sional surface, called the world sheet, as they travel through space-time. The 
coupling of three strings takes place over a smooth surface with no distinguished 
point. As a consequence, the ultraviolet behavior is nonsingular. There arises a 
perturbative expansion for scattering amplitudes in which a given term of the 
perturbative expansion is a functional integral over maps of a Riemann surface # 
into space-time .@. For theories which involve open strings, the surfaces may have 
boundaries. The full perturbation expansion involves summing over terms, that is, 
over the genus of the Riemann surface. For open strings we also need to sum over 
the topology of the boundaries. For each term, we must integrate over the moduli, 
and over the embedding maps X*: @ — @. In addition to the map into space-time, 
the “super” models involve fermionic functions on the Riemann surface. These 
models have an enhanced symmetry, called supersymmetry, which is a Lie algebra 
with a Z, grading. The particles corresponding to odd grade are fermions, such as 
electrons and quarks, which have half-integral spin. 

The string picture is a great change from ordinary field theory, much more tightly 
constrained by requirements of consistency. Unlike ordinary field theory, in which 
various forms of interactions can be postulated, string theory offers no choice of 
interaction, because the interaction is purely geometrical, a topological feature of 
the embedded surface. Among the inescapable requirements of string theory is the 
requirement that the manifold through which the string moves have a particular 
(and large) dimension, 10 for superstrings and 26 for bosonic strings. There is also 
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no way to get around having a spin-two massless particle, and one generally gets a 
number of spin-one massless particles. 

If superstring theory does turn out to be the TOE, historians of science will have 
a hard job explaining why it came into being. It originated in the model of 
Veneziano (1968), which described something completely different, the phenomenol- 
ogy of the scattering of mesons, most importantly the existence of a tower of 
unstable particles forming a linear plot in spin vs. mass-squared. The slope of this 
line defines a hadronic mass scale of about 1 GeV. After a dramatic proliferation of 
phenomenological papers, the field quickly changed emphasis in an attempt to make 
a consistent complete theory of hadronic physics. The outstanding problem of string 
theory in its hadronic phase (alias dual-resonance model) was the wrong dimen- 
sionality and the theory’s insistence on having massless spin-one and spin-two 
particles, something not present in the hadronic world. Its lack of ultraviolet 
singularities also implied softness in deep inelastic scattering, just when hardness 
was emerging as the new experimental feature on which all theorists should focus 
their understanding. Finally, in 1974, it became accepted in the high-energy com- 
munity that QCD provided the theory of hadrons, and interest in strings dropped 
dramatically. But just as the hadronic era of strings was collapsing, Scherk and 
Schwarz proposed that the same model, with its scale changed by a factor of 10”, 
could describe a quantum theory of gravity. This idea did not fall on immediately 
fertile ground—during the period from 1976-1981, almost no work was done on 
strings. It was a period of work on GUTS, and also on supergravity, which is based 
on gauging supersymmetry. In 1977 it was noticed that one such theory is the 
massless sector of supersymmetric string theory. In the early ’80s, Green and 
Schwarz developed the manifestly space-time supersymmetric form of string theory. 
Finally in 1984 they published their paper on anomaly cancellation, and almost 
overnight strings emerged from their long slumber and started the rise to the 
superstar status they currently enjoy. 

Anomalies are problems which arise in theories with symmetries during the 
renormalization process. To regularize the formally divergent integrals, one may 
need to impose a cutoff which violates the symmetry. If the effect of this cutoff does 
not disappear into a renormalization, one has an anomaly, with consequences 
ranging from loss of symmetry in some theories, to inconsistencies in the case of 
gauge anomalies. Thus there was a big impact when it was shown that the gauge 
anomaly problems which plagued ordinary GUTS theories were solved in an almost 
unique way in string theory, provided the internal gauge symmetry group was one of 
two possibilities, SO(32) or E, X E,. Furthermore, possible gravitational anomalies 
also cancelled under the same conditions. The excitement was enhanced because this 
was the first time a fairly abstract consistency requirement had picked out, almost 
uniquely, a symmetry group. At the time, only open string theories had gauge 
groups, and these could only be SO(N) or SP(N), so there was no Ey X Eg string. 
Soon thereafter, however, the heterotic closed string was invented. It requires an 
even, self-dual, 16-dimensional lattice, again picking out SO(32) and EF, X E;! It is 
even said that E, X E, has properties good for phenomenology. 

In this second active period, since the anomaly cancellation, string-theory re- 
search has been distinguished by the level of mathematical sophistication of much 
of the development. The bias against formal mathematics, intensely instilled into 
physics students in the ’60s, began to erode during the late ’70s, when everyone was 
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exploring gauge theories. The power of notions of algebraic topology, such as index 
theorems, was proven in Yang-Mills theory. A greater appreciation of differential 
geometry came about because the compactifications attempted in supergravity 
involved some rather unintuitive Riemann spaces, so that more sophisticated 
mathematics was needed to investigate them. Still more than Yang-Mills theories or 
supergravity, however, strings provide a rich field for the mathematically inclined. 
Even the simplest theory, the bosonic closed string, has a sum over all embeddings 
of Riemann surfaces into space-time. This requires a measure on moduli space, 
which must make the singular contributions from the boundaries of moduli space 
consistent with the physical requirement of unitarity. 

The bosonic theory is a rich development tool, but physics requires fermions as 
well, and this leads to the superstring. Actually, the inclusion of fermions is also 
required to get rid of one fatal flaw which the boson theory has—a particle which 
travels faster than light, called the tachyon. When fermions are included, it is 
possible to abolish the tachyon. One method of incorporating fermions is to include 
a fermionic field representing a space-time spinor defined on the world-sheet. This 
leads to the manifestly space-time supersymmetric theory known as the Green- 
Schwarz string. An alternative approach to an equivalent theory is to consider 
supersymmetry on the world-sheet. In fact, this Ramond-Neveu-Schwarz formula- 
tion of 1971 was the first place where the supersymmetry idea became popular. Here 
we have a supermanifold, with one or two Grassmann coordinates in addition to the 
two ordinary ones. Quantization involves summing over all functions from super 
Riemann surfaces into ordinary space-time. So now we need new mathematics, the 
study of supermoduli space. 

There is also a great deal of old but sophisticated mathematics now necessary in 
doing string theory. I will give one example. Physics requires that something be 
done with the six dimensions that are not manifest to our eyes and watches. 
Generalizing an old idea of Kaluza and Klein, we now believe that the ten-dimen- 
sional manifold, while nearly flat and very large in four dimensions, is tightly curled 
up in the other six. Thus the ten-dimensional space-time is approximately M* x K, 
where M* is four-dimensional Minkowski space and K is a compact six-dimen- 
sional manifold. A quantum theory of gravity has a natural scale of 10!? GeV in 
energy or 10 °° cm in distances. If we assume that the dimensions of K are on that 
scale, all current physics is very low energy indeed. Thus all particles we know of are 
essentially massless, corresponding to zero modes of differential operators on K. In 
general, these zero modes will only be there if there is a reason for them, which 
brings in the concepts of index theorems and cohomology classes. Massless particles 
exist because a nonzero index or a nonzero Betti number requires them. 

Another physical value which seems to be zero compared to the natural scale is 
the cosmological constant, which may be theught of as the energy density of the 
vacuum. In any ordinary field theory without supersymmetry, it is naively predicted 
to be infinite, because the vacuum energy density is the most divergent of all 
quantities in quantum field theories. Supersymmetry arranges a cancellation be- 
tween the contributions of bosons and fermions. If the cosmological constant is 
neither zero nor infinite, it would be naively expected to be ~ 1 in natural units, 
where Planck’s constant, the speed of light, and Newton’s gravitational constant are 
all set to one. A density of 1 in these units corresponds to 5 x 10”? g/cm?, whereas 
the experimental upper bound on the cosmological constant is 107°? g/cm’, 
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undoubtably the worst finite error of any prediction in science. Even if we begin 
with a theory that is supersymmetric before compactification, a compactification 
which breaks all supersymmetry at the gravitational scale could not be consistent 
with this density. This means that some supersymmetry must survive compactifica- 
tion; the state which corresponds to the vacuum is invariant under some supersym- 
metry transformations. The supersymmetry parameter then needs to satisfy the 
equations of a massless spinor field, which on K means there must exist a 
covariantly constant spinor field ». Examining the spin connection, which is a gauge 
field of SO(6) ~ SU(4) on the six-dimensional manifold K, we can ask about the 
holonomy. The existence of the covariantly constant spinor 7 implies the group 
fixes the spinor 7, so the holonomy must lie in the remaining SU(3). The covariant 
spinor defines a Kahler form 71; ,n, where I’’s are defined as antisymmetrized 
products of the Dirac matrices on K. Thus K becomes a three-dimensional complex 
manifold, and the restriction to SU(3) holonomy means that the first Chern class 
vanishes, and K is a Calabi-Yau manifold. Classification of three-dimensional 
Calabi-Yau manifolds is a growth industry these days. 

Writing a textbook which can give a graduate student the ability to access the 
current string literature is a monumental task. Superstring Theory represents a 
major effort by three of the most significant contributors to the field. Two of the 
authors are primarily responsible for the developments which caused the rebirth, 
and the other has been the most prominent exponent of strings since then. They 
have designed a text for advanced graduate students and others familiar with basic 
quantum field theory, and they have done an excellent job in covering all the most 
important areas without assuming more than is reasonable of the student’s prepara- 
tion. It is clear that a great deal of thought has been given to the approach, trying to 
use moderate tools and develop new ones as needed. Adequate discussion of results 
from the first incarnation are included, while maintaining a modern approach in the 
overall development. In an attempt to keep the total to 1,000 pages, many of the 
demonstrations are overly sketchy, so that working through the details is an 
educational but demanding process. A student attempting to do so will surely find 
places where the help of an expert in the field is required. The use of conformal field 
theory techniques is also not as well developed as I would have liked, and exercises 
would have been very useful. 

There are also many hot topics which are not covered. The last two years has 
been a period during which much of the research is attempting a broader and deeper 
fundamental understanding of the whole theory. Several new abstract approaches to 
reformulation are being pursued. For example, the role of general coordinate 
invariance in space-time is not transparent in the conventional approach, and might 
be better understood in string field theory. Such current topics, which may or may 
not eventually prove to help in understanding strings, cannot reasonably be covered 
in a text at this time. Thus it is appropriate that these volumes cover the essential 
fundamentals of the field widely and well. This is clearly the book on string theory. 
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Calculus with Analytic Geometry. By George F. Simmons. McGraw-Hill Book 
Company. New York, 1985. xiii + 950 pp. $46.95. 


UNDERWOOD DUDLEY 
Department of Mathematics and Computer Science, DePauw University, 
Greencastle, Indiana 46135 


This is about calculus books, and there are seven important conclusions. 

Let us go back to the beginning and look at the first calculus book, Analyse des 
Infiniment Petits, pour L’ intelligence des Lignes Courbes, published in Paris in 1696 
and written by Guillaume Francois Antoine L’Hospital, Marquis de Sainte-Mesme, 
Comte d’Entrement, Seigneur d’Orques, etc. To almost everyone, L’Hospital is just 
a name attached to L’Hospital’s Rule and almost no one knows anything about him. 
His memory deserves better. He displayed mathematical talent early, solving a 
problem about cycloids at fifteen, and he was a lifelong lover and supporter of 
mathematics who, unfortunately, died young, at the age of 43. Also, L’Hospital was 
no mean mathematician. He published several papers in the journals of the day, 
solving various nontrivial problems. I know that I could not have found, as he did, 
the shape of a curve such that a body sliding down it exerts a normal force on it 
always equal to the weight of the body. Further, as Abraham Robinson has written, 


According to the testimony of his contemporaries, L’Hospital possessed a very 
attractive personality, being, among other things, modest and generous, two 
qualities which were not widespread among the mathematicians of his time. 


His book was a huge success. There was a second edition in 1715, and there were 
commentaries written on it. I have the 1781 edition, with additions made by another 
author. Not many textbooks last almost 100 years. Birkhoff and MacLane is not yet 
50 years old. L’Hospital’s book is about differentials and their applications to curves 
and the style is exclusively geometrical. There are not many equations, but there are 
an awful lot of letters and pictures, just as I remember in my tenth grade geometry 
text. Mathematics was geometry then, and mathematicians were geometers. 

There are many things not in the book. There are no sines or cosines, no 
exponentials or logarithms, only algebraic functions and algebraic curves. There are 
also no derivatives, only differentials. Here is L’Hospital’s proof of L’Hospital’s 
Rule, using differentials. In Figure 1, points on the graph of f(x)/g(x) are found 
by dividing lengths of abscissas, except at a. But if you dx past a, the point on the 
graph will be df/dg. Since dx is infinitesimal, that ratio gives you the point at a. Is 
that not nice? 

It took some time for calculus to become generally taught in colleges. Eventually 
it made it, and calculus textbooks began to appear in the nineteenth century. I have 
a copy of one, Elements of the Differential and Integral Calculus, by Elias Loomis, 
Ll. D., Professor of Natural Philosophy at Yale College. His calculus was first 
published in 1851, and my copy of it is the 1878 edition. It sold in excess of 25,000 
copies, so it must reflect accurately the style and content of calculus teaching of the 
time. Just as with L’Hospital, the differential was the important idea. Loomis 
derived the formula for the differential of x” with no use of the binomial expansion, 
(d(xy)/xy = dx/x + dy/y, d(x")/x" = dx/x + +++ +dx/x, add and simplify) 
and his proof of L’Hospital’s Rule was short, simple, and clear, and also one which 
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does not appear in modern texts because it fails for certain pathological examples. 
Also, Loomis put all of his formulas in words, italicized words. After deriving the 
formula for the differential of a power of a function, he wrote 


The differential of a function affected with any exponent whatever is the 
continued product of the exponent, the function itself with its exponent 
diminished by unity, and the differential of the function. 


It was a good idea. It is also probably a good idea to do as L’Hospital and Loomis 
did and talk about differentials instead of derivatives whenever possible. Little bits 
of things are easier to understand than rates of change. It is a still better idea to 
strive for clarity and let students see what is really going on, which is what Loomis 
did, rather than putting rigor first. But nowadays authors cannot do that. They must 
protect themselves against some colleague snootily writing to the publisher, “Evi- 
dently Professor Blank is unaware that his so-called proof of L’Hospital’s Rule is 
faulty, as the following well-known example shows. I could not possibly adopt a text 
with such a Serious error.” It is a shame, and probably inevitable that calculus books 
are written for calculus teachers, but I have nevertheless concluded 


CONCLUSION #1: CALCULUS BOOKS SHOULD BE WRITTEN FOR 
STUDENTS. 


It Would be worth a try. Calculus Made Easy by Silvanus P. Thompson was quite 
successful in its time, which ran for quite a while. The second edition appeared in 
1914, and my copy was printed in 1935. It is still in print. The book has a motto: 


What one fool can do, another can. 


and a prologue: 


Considering how many fools can calculate, it is surprising that it should be 
thought either a difficult or a tedious task for any other fool to learn how to 
master the same tricks. ... 
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Being myself a remarkably stupid fellow, I have had to unteach myself the 
difficulties, and now beg to present to my fellow fools the parts that are not 
hard. Master these thoroughly, and the rest will follow. What one fool can do, 
another can. 


Chapter 1, whose title is “To Deliver You From The Preliminary Terrors” forth- 
rightly says that dx means “a little bit of x.”” Thompson did not include L’ Hospital’s 
Rule. 

Both Loomis and Thompson are like L’Hospital when it comes to giving 
applications and examples: they are all almost entirely geometrical. Loomis’s 
applications of maxima and minima are all about inscribing and circumscribing 
things, and so are all of Thompson’s except one. In fact, all three books are full of 
geometry. Thompson concluded with arc length and curvature, Loomis had invo- 
lutes and evolutes, cusps and multiple points, and lots of curve sketching. Did you 
know that the asymptote to y> = x? +x? is y= x + 1/3? I didn’t until I read 
Loomis. It is nice to know. There must have been some reason why calculus books 
for more than 200 years taught so much geometry. Mathematics may no longer be 
synonymous with geometry, but we have discarded, wrongly I think, the wisdom of 
the ages, and I have concluded 


CONCLUSION #2: CALCULUS BOOKS NEED MORE GEOMETRY. 


Before writing this essay, I examined 85 separate and distinct calculus books. I 
looked at all of their prefaces, all of their applications of maxima and minima, and 
all of their treatments of L’Hospital’s Rule. By the way, I found five different 
spellings of L’Hospital. There were the two you would expect, and Lhospital, as 
L’ Hospital sometimes spelled his name. In addition, one author, not wanting to take 
chances, had it L’Hdspital, and one thought it was Le Hospital. Why are there so 
many calculus books, and why do they keep appearing? One could be cynical and 
say that the authors are all motivated by greed. But I do not think so. I think that 
authors write new calculus books because they have observed that students do not 
learn much from the old calculus books. Therefore, prospective authors think, “if I 
write a text and do things properly, students will be able to learn.” They are wrong, 
all of them. The reason for that is 


CONCLUSION #3: CALCULUS IS HARD. 


Too hard, I think, to teach to college freshmen in the United States in the 1980s, 
but that is another topic. 

If you plot the books’ numbers of pages against their year of publication, you 
have achart in which an ominous increasing trend is clear. The 1000-page barrier, 
first pierced in 1960, has been broken more and more often as time goes on. New 
highs on the calculus-page index are made almost yearly. Where will it all end? We 
can get an indication. The magic of modern statistics packages produces the 
least-squares line: Pages = 2.94 (Year) — 5180, showing that in the middle of the 
next millennium, the average calculus book will have 2,270 pages and the longest 
one, just published, will have 3,783 pages exclusive of index. 

Why do we need 1000 pages to do what L’Hospital did in 234, Loomis in 309, 
Thompson in 301, and the text I learned calculus from, used exclusively for four 
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whole semesters, 14 semester-hours in all, in 416? There are several reasons. One, of 
course, is the large number of reviewers of prospective texts. No more can an editor 
make up his mind about the merits of a text, it has to go out to fifteen different 
people for opinions. And if one of them writes that the author has left out the 
tan(x/2) substitution in the section on techniques of integration, how can he or she 
do that, we won’t be able to integrate 3/(4 + 5sin6x), how can anyone claim to 
know calculus who can’t do that; isn’t the easiest response to include the tan(x/2) 
substitution? Of course it is, in it goes, and in goes everything else that is in every 
other 1000-page text. It is impossible to escape 


CONCLUSION #4: CALCULUS BOOKS ARE TOO LONG. 


Another reason for the length is the current mania for Applications. If you go to 
Books in Print and look in the subject index under “Calculus” what you see is 


The Usefulness of Calculus for the Behavioral, Life, and Managerial Sciences 
Essentials of Calculus for Business, Economics, Life Sciences, and Social 
Sciences 


and many, many similar titles. Now authors have to explain, with examples, what 
marginal revenue is, and consumer surplus, and what tracheae are whereas in the 
old days, all their readers knew what a cone was. A third reason is the supposed 
need to be rigorous. Now we See statements of L’Hospital’s Rule that take up half a 
page and proofs of it that go on for three pages. My 416-page calculus book never 
even mentioned L’Hospital’s Rule, and I never felt the lack. Its author never proved 
that the derivative of x* was ex°~', but I was willing to believe it. Trying to include 
everything and trying to prove everything makes for long books. Everything gets 
longer. Prefaces used to be short, a page or less. Now they are five and six pages, 
hard sells for the incredible virtues of the text that follows, full of thanks to 
reviewers, to five or six editors, to wives, to students, even to cats. 

Let me return to “applications.” There aren*t many, you know. In the 85 calculus 
books I examined, almost all of them had the Norman window problem—the 
rectangle surmounted by a semicircle, fixed perimeter, maximize the area. The 
semicircle always “‘surmounts.” This is the sole surviving use of “surmounted” in 
the English language, except for the silo, a cylinder surmounted by a hemisphere. 
Only one author had the courage to say that the window was a semicircle on top of 
a rectangle. All the books had the box made by cutting the corners out of a flat 
Sheet, all have the ladder sliding down the wall, all had the conical tank with 
changing height of water, all had the tin can with fixed surface area and maximum 
volume, all had the V-shaped trough, all had the field to fence, with or without a 
river flowing (in a dead straight line) along one side, all had the wire—usually wire, 
but sometimes string—cut into two pieces to be formed into a circle and a square, 
though some daring authors made circles and equilateral triangles. There are only 
finitely many calculus problems, and their number is very finite. 


“Applications” are so phony. Ladders do not slide down walls with the base 
moving away from the wall at a constant rate. Authors know the applications are 
phony. One book has the base of the ladder sliding away from the wall at a rate of 2 
feet per minute. At that rate, you could finish up your painting with time to spare 
and easily step off the ladder when it was a foot from the ground. Another author 
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has the old run-and-swim problem—you know, minimize the time to get somewhere 
on the other side of the river—with the person able to run 25 feet per second and 
swim 20 feet per second. That’s not bad for running (it’s a 3:31.2 mile), but it is 
super swimming, 100 yards in 15 seconds, a new world’s record by far. There are no 
conical reservoirs outside of calculus books. Real reservoirs are cylindrical, or 
perhaps rectangular. The reason for this is found in the texts: in the problems, the 
conical reservoirs usually have a leak at the bottom. Tin cans are not made to 
minimize surface area. I could give any number of examples of absurd applications 
in which businessmen “observe” the price of their product decreasing at the rate of 
$1 per month, or where the S. D. S. (remember them?) “find” that staging x 
demonstrations costs $250x°. Why will authors not be honest and say that these 
artificial problems provide valuable practice in translating from English into 
mathematics and that is all they are for? Surely they cannot disagree with 


CONCLUSION #5: FIRST-SEMESTER CALCULUS HAS NO 
APPLICATIONS. 


Before getting to my next conclusion, here is my favorite “application.” 


A cow has 90 feet of fence to make a rectangular pasture. She has the use of a 
cliff for one side. She decides to leave a 10 foot gap in the fence in case the 
grass should get greener on the other side. Find ... . 


Hardly any authors dare to do that. Calculus books are Serious. The text from 
which that problem came was titled Calculus Without Analytic Geometry and it is no 
surprise that it did not catch on. 

The existence of all those calculus books with “Applications” in their titles 
implies a market for them. There must be students out there who are being forced to 
undergo a semester of calculus before they can complete their major in botany and 
take over the family flower shop. I cannot believe that any more than a tiny fraction 
of them will ever see a derivative again, or need one. Calculus is a splendid screen 
for screening out dummies, but it also screens out perfectly intelligent people who 
find it difficult to deal with quantities. I don’t know about you, but I long ago 
concluded 


CONCLUSION #6: NOT EVERYONE NEEDS TO LEARN CALCULUS. 


The book by Simmons is a fine one. It was written with care and intelligence. It 
has good problems, and the historical material is almost a course in the history of 
mathematics. It is nicely printed, well bound, and expensive. Future historians of 
mathematics will look back on it and say, “Yes, that is an excellent example of a 
late twentieth-century calculus book.” This leads to my last conclusion 


CONCLUSION #7: THAT'S ENOUGH ABOUT CALCULUS BOOKS. 
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Geometry I. By Marcel Berger. Translated from the French by M. Cole and S. 
Levy. Springer-Verlag, New York, Berlin, Heidelberg, 1987, xiii + 427 pp. 


Geometry IT. By Marcel Berger. Translated from the French by M. Cole and S. 
Levy. Springer-Verlag, New York, Berlin, Heidelberg, 1987, x + 405 pp. 


F. A. SHERK 
Department of Mathematics, University of Toronto, Toronto, Ontario, Canada M5S 1A1 


It would be an interesting study to follow the fortunes of geometry as a discipline 
within the broader framework of all mathematics. There was a time when geometry 
was dominant: every person with pretensions to being educated had to know 
geometry, and Euclid was king. This was true also in the mathematical world, where 
great attention was paid to Euclid. Attempts over many centuries to prove the 
Parallel Postulate led in the nineteenth century to the development of non-Euclidean 
geometry, while at about the same time a closer, more sophisticated critique of the 
foundations of Euclidean geometry laid the basis for the development of projective 
and affine geometry. On the latter, all the geometries of relevance to 
physics— Riemannian, Minkowski, and the like—are based. Spherical and inversive 
geometries were also developed, the result of all this activity being that geometry, if 
not supreme, was at least still a coruler in mathematics. 

But as the present century proceeded, interest in geometry fell off; it began no 
longer to be classified in the “mainstream” of mathematics. Today, in North 
America at least, an undergraduate student majoring in mathematics will be lucky 
to have even one geometry course included in his or her program. High school has 
provided the student with the rudiments of Euclidean geometry—synthetic and 
analytic—and there may be some offerings of differential geometry in college. But it 
is rare to find other college undergraduate geometry courses, and still more rare to 
find substantial enrollments in those that do exist. The geometer on the staff, if 
indeed there is one, is swamped by analysts, topologists, and algebraists. 

Yet we optimistic geometers sense a revival of interest in the subject. Certainly an 
impressive amount of geometry has been produced in the twentieth century, and 
continues to appear. Work on geometric transformations, for example, begun a 
century ago, has been developed beyond the wildest expectations of Klein’s 
“Erlangen Programme,” and well beyond the bounds that it intrinsically set. Not 
only has this given impetus to the abstract study of groups, by providing examples 
of many of the most important, but it has at times shown the way to better and 
shorter proofs of purely group-theoretic theorems. For example, the proof that the 
group |p, q]*= (R, S|R? = S4 = (RS)? = 1) is finite if and only if (p — 2)(q — 2) 
< 4(p, q being positive integers) is long and tedious if dealt with abstractly. But 
Coxeter gave a fine proof in a few lines, by identifying [ p, qg]* with the rotational 
symmetry group of the regular tessellation { p, g}. Again, Coxeter’s important work 
on groups generated by reflections has not only given clarity and precision to 
polytope theory, but it has provided a classification of an important family of 
discrete groups and given an excellent notation for them (the “Coxeter-Dynkin 
diagram’). 

The study of non-Desarguesian geometry, almost entirely a twentieth-century 
subject, is very interesting for its own sake. But it has also made many contributions 
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to the better understanding of nonassociative algebra, group theory, and combina- 
torial theory. 

While the geometry studied today is often generalized to the point that it is based 
on any field (or even on a more general algebraic system, as in the case of 
non-Desarguesian geometry) there is still much to do in geometry over the real field. 
Convexity problems, extremal problems (such as close-packing of spheres), and 
tilings come readily to mind as examples. And, of course, there are many aspects of 
classical differential geometry that still need exploration. In all of these problems, 
modern analysis and topology become indispensable tools, both aiding the develop- 
ment of geometry and in turn being enhanced and clarified by their application to 
geometry. 


One reason for optimism on the future of geometry is the appearance of good 
expository books on the subject. The two-volume work under review here is one of 
these. To quote from the Introduction, the author has three main objects in view: 


—to emphasize the visual, or “artistic,” aspect of geometry, by using figures in 
abundance; 

—to accompany each new notion with as interesting a result as possible, 
preferably one with a simple statement but a non-obvious proof; 

—finally, to show that this simple-looking mathematics does not belong in a 
museum, that it is an everyday tool in advanced mathematical research, and 
that occasionally one encounters unsolved problems at even the most 
elementary level. 


In pursuing these objectives, the author naturally confines his attention to geometry 
over a field, with heavy emphasis on the field of real numbers. In that sense, the 
book deals with classical geometry, although many of the results dealt with are quite 
recent. It is a book which can be read by anybody with a college mathematical 
background, and could be used as a text for a number of different geometry courses. 
To the extent that that is possible, it need not be read in order from first chapter to 
last; rather, one can choose portions from various places in the book, with a 
reasonable chance of understanding the material. Some of the topics dealt with in 
the twenty chapters are geometric transformations and tilings, projective, affine, and 
Euclidean spaces, convex sets, polytopes, quadratic forms and orthogonality, spheres, 
elliptic and hyperbolic geometry. 

There are many features of the content and style that are admirable, and one or 
two that are not. The selection of material—it being impossible to include every- 
thing—is consistent with the objectives quoted above, thus adding real excitement 
to the content without unduly skewing its systematic theoretical development. 
Definitfons are immediately followed by several examples, and there are many 
exercises given to close off each topic. Each volume has an extensive bibliography, 
the contents of which are constantly referred to in the text. This feature, of course, 
has dual advantages of putting the topic under discussion into a historical context, 
and providing the serious student with easy access to further research. 

But the most admirable feature of all, in this reviewer’s opinion, is the astounding 
number of illustrations. These range from simple diagrams, put in exactly the right 
place relative to the text, to reproductions of art, pictures of mechanical gadgets, 
charts and maps. The ratio of illustrations to total pages of text is in fact slightly 
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greater than one. If this signals a trend to the return of pictures to help in the 
understanding of abstruse mathematical points, we should welcome it with becom- 
ing enthusiasm. 

On the negative side, the literary style of the book leaves much to be desired. 
Particularly noticeable is the rather annoying occasional insertion of colloquial, 
folksy expressions (“If you don’t know any exterior algebra...” ; ““The moral of the 
Story is...”; “Here’s a simpler way...’’). This may, of course, be an indiscretion of 
the translators rather than the author. In any case, some mathematicians regard it as 
desirable to communicate in this light-hearted way; given that mathematics is 
serious business, requiring much concentration, it is not very appropriate in this 
reviewer's opinion. It is bad enough to have this style common in the classroom 
today; perhaps we should try to use something more appropriate in our serious 
textbooks. 

The good features of the book, however, far outweigh the questionable. The work 
deserves a long run of popularity. And it will certainly help, rather than impede, the 
fortunes of geometry in its long trail back to the recognition it so richly deserves. 


Pi and the AGM. By Jonathan M. Borwein and Peter B. Borwein. John Wiley & 
Sons, Inc., New York, Toronto, 1986, xv + 414 pp. 


RICHARD ASKEY 
Department of Mathematics, University of Wisconsin, Madison, WI 53706 


The Mathematical Association of America, The American Mathematical Society 
and the Society for Industrial and Applied Mathematics have joined to try to get 
better media coverage of mathematics. One result of this is stories that say that 
computers have calculated 10 million, or now 130 million digits of 7. When asked to 
comment on this by friends, most mathematicians wince and say they have no idea 
why anyone would want to do these calculations. Privately, they start to have 
serious doubts about more publicity for mathematics. If the computation of 7 to 
millions of places is to be the image of a mathematician, then it might be better to 
keep mathematics out of the newspaper. However, ten million is so much larger then 
the 707 digits that was claimed to be known when I first heard of this problem that 
something interesting must lie behind these calculations. Part of the improvement 
comes from larger and faster machines, but much more comes from increased 
mathematical knowledge. Part of this is specific to certain numbers, and part 1s 
general and of wide applicability. All of it is very interesting. 

Calculations of 7 (really approximations to 7, but I will be short and sloppy) 
really started with Archimedes. He used inscribed and circumscribed regular 6 - 2”- 
gons and was able to show that 332 < @ < 35 from n = 4. By the year 1600 or so 
this method had led to 34 digits. The next major advance was the arctangent series. 
While this was first discovered in India it was Gregory’s work, which was used by 
Sharp under Halley’s direction to obtain 71 digits. Machin used 


~ = 4tan~} tan~? 

4 0 \5}. «239 
to get 100 digits in 1706. W. Shanks eventually extended this type of calculation to 
707 digits, 526 of which were correct. This calculation took years and his major 
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publication was in 1853. Electronic computers were used to reach one million digits 
in 1973, and the series used was similar to Machin’s formula. In 1976 the mathemat- 
ics used jumped to late 18th- and early 19th-century mathematics with the indepen- 
dent discovery by Salamin and Brent that the arithmetic-geometric mean algorithm 
of Lagrange and Gauss could be combined with an identity of Legendre to lead to a 
quadratically convergent algorithm. 

Archimedes’ algorithm can be stated as 


2a,b, 5 
a+ b ? Dad — (a,,41b,)°” (A) 


n n 


Gn+1 — 
a) = 2V3, by = 3. 
The arithmetic-geometric mean algorithm is 


a, +b 
n n 2 
any. = 9) ’ Ba+t — (a,,b,)'” . (G) 


If you have never seen algorithm (G) take out your pocket calculator and try both of 
these algorithms for a, = 1, by = 2. The results for n = 3 should be a real surprise. 
They were for Gauss, and he was led to an integral representation for the limit in 
(G), and from this to elliptic functions. Lagrange had the integral first and 
discovered the arithmetic mean as a method of evaluating this integral. Since this 
solved the question he had, he did not push it further. This is an important point to 
remember, living as we do in a period when mathematicians are being encouraged to 
solve practical problems. Lagrange solved a practical problem, and quit after solving 
it. Gauss was playing around with numbers and recognized a number that arose as 
one he had calculated by another method when solving a practical problem. This 
problem was to evaluate {(7/*(a’cos*@ + b*sin?@)~1/* d@. Gauss was struck by the 
surprising result that there was a way of representing the limit of (G) in terms of 
this integral, and by the incredible rapidity of the convergence of (G), and so 
examined his discovery in more detail. What he then discovered was much more 
important than the solution of how to evaluate the above integral in a rapid way. It 
was elliptic functions, as opposed to elliptic integrals, and he also discovered theta 
functions and the start of modular forms. (See the expository paper by Cox [1] for 
more on this.) I am not suggesting we not solve practical problems, but spend all 
our time experimenting with numbers; however, some time spent playing games is 
appropriate. Also, do not quit looking at a discovery after solving a problem. This 
discovery may have done much more than just solve the problem at hand. 

In the fall of 1985 a contemporary computer wizard, Bill Gosper, decided to get 
in the competition for the largest calculation of 7. He had two reasons. One was a 
desire to change the problem. The natural, invariant way to represent a real number 
is by a continued fraction rather than by a decimal or binary expansion. He hoped 
to convince others of this. The second was a request from his employers at 
Symbolics that he show that their relatively small LISP machine, a general computer 
that was not especially built for numerical work, could be used to compete with 
giant machines in the area for which large machines were designed—large scale 
numerical computations. Gosper was successful in getting 17 million digits, a record 
for a short while, and also the corresponding continued fraction. As frequently 
happens, he discovered a subtle design error in the computer. This is one reason 
massive calculations that can be checked are important. 
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The people using large machines were using nineteenth-century mathematics. 
Gosper introduced twentieth-century mathematics when he used the following result 
of Semanwyan: 


(4) (3) (4), 1 yer 
— = 2/2 x aay ay (1103 + 263900) | 55 5| (R) 
nao (1) nd) an 
with (a), = a + 1)::-(at+n-—- 1), n=1,2,...,(a@) 9 = 1. Gosper asked if I 
knew how to prove (R), and I had to admit I did not. Ramanujan had not given a 
proof. While the cortvergence in (R) is linear, it is so rapid that the crossover point 
when the quadratic methods become better is surprisingly high. Gosper was unsuc- 
cessful in getting others to compute continued fractions, so while 132 million digits 
of a are now available, the largest continued fraction calculation is still Gosper’s. 

In the nineteenth-century calculations of 7 the arithmetic was done without any 
sophisticated methods. It is now necessary to be very sophisticated and use 
something like the fast Fourier transform to do multiplication. 

If this subject begitis to sound more interesting than it did in the last newspaper 
article on 130 million digits of 7, I have partly succeeded. To succeed completely I 
will have gotten you interested enough to read the delightful and important book by 
the Borweins with the strange title given above. Start with the last chapter, which 
gives a history of the calculations of 7, then includes some exercises that give the 
irrationality of e and a and also their transcendence. Enough hints are given to 
make it possible for many people to carry out the details. Finally, irrationality 
measures for 7, {(2) = 77/6, and (3) are given using Beukers’ version of Apéry’s 
proof of the irrationality of §(3). Then start at the beginning and work your way 
through a portion of such topics as the arithmetic-geometric mean, elliptic integrals, 
theta functions, elliptic modular functions, modular equations, hypergeometric 
functions, computational complexity, the fast Fourier transform, other means and 
their iteration, lattice point sums, and other topics. As one might expect there are 
more than twice as many references to Ramanujan than to anyone else, since 
Ramanujan’s work on modular equations was significantly deeper than that of 
others. It is fitting in the year of Ramanujan’s centenary to realize that he was really 
a very modern mathematician. Surprisingly, the work of his that is mentioned most 
in this book comes from a paper he wrote in India before coming to England. The 
paper was rewritten after Ramanujan arrived in England so the appropriate refer- 
ences could be added, but Hardy wrote that all the work had been done while 
Ramanujan was in India. We are only now realizing what a remarkable paper this is. 

If you are curious about how to prove (R), an outline is given in Chapter 5. Also, 
the next time you read a newspaper story about mathematics, read it the way 
political news needs to be read in many parts of the world, that is, read between the 
lines to get the story that is really there. It is probably very interesting, unlike the 
story that appeared in the paper. The problem of educating a few reporters so that 
they appreciate and understand some mathematics can probably be solved, but the 
problem of educating the editors to realize that not only does mathematics matter, 
but they should treat it seriously is much harder. However, an existence theorem 
exists at the Los Angeles Times, both for a reporter and an editor. 
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A New Approach to Bernoulli Polynomials 


D. H. LEHMER 


Dr. Lehmer needs no introduction to readers of the MONTHLY. It is a 
pleasure to have a contribution from him in these pages. Ed. 


Abstract. Beginning with Jacob Bernoulli’s discovery before 1705 of the polynomials that bear his name, 
there have been five approaches to the theory of Bernoulli polynomials. These can be associated with the 
names of Bernoulli [1}, Euler [2], Lucas [3], Appell [4], and Hiirwitz [5]. Each mathematician chose to 
define the Bernoulli polynomials in a different way, and from his definition derived as theorems one or more 
of the four other definitions. The present article introduces a sixth definition from which the other five are 
derived. 


Introduction. For the convenience of the reader the first seven Bernoulli poly- 
nomials are listed below. 


B(x) =1 
1 
B(x)=x- 5 
1 
B(x) =x*—x+% 
3 x 
B,(x) =x? - ze t 5 
4 3,2! 
B,(x) =x*- 2x? +x — 39 
; Ox" 5x? x 
B(x) = x —z tz 7<= 
xt x? 
B(x) =x 3x + S--> +e 


A good account of the formulas involving B(x) together with a table of their 
coefficients up to n = 15 will be found in Abramowitz and Stegun [6]. Excellent 
graphs of B(x) on the unit interval are given in Davis [7]. 

Five different approaches to the theory of Bernoulli polynomials have been 
made. 

Bernoulli [1] (1705?): Sums of powers of the first natural numbers 


m~-1 


Yk! = —(B,(m) ~ B,(0)). (1 


k=0 


905 


906 D. H. LEHMER [December 


Euler [2] (1738): Generating Functions 


0 B(x)t" te*" 
——_—_— = —— t| < 27. 2 
Lucas [3] (1891): Umbral Calculus 
B,(x) = (B+ x)". (3) 


Appell [4] (1832): Appell Sequences 


1 
B,, (x) =~“ n(X). (4) 


Hiirwitz [5] (1890): Fourier Series 


—n! (oe) 
B,(x)=——aGee "eM (0 <x <1). (5) 
(27) k=~— oo 
In 1851 Raabe [8] discovered the multiplication theorem 
1“! k 
— >) B, x+— = m™~"B (mx). (6) 
M = 


It is this identity that we have chosen for a new approach. We begin with two 
lemmas. 


LEMMA 1 (existence). Let n and m be positive integers. Then there are polynomials 
of degree n in x satisfying the functional equation 


1 m—1 


m cay! 


Proof. If m =1, then (7) becomes f(x) = f(x), so that we can assume that 
m > 1. Let us substitute 


P(x) = box” + bx" 1 +--+ +b, (b, # 0) (8) 


=m "f(mx). (7) 


k 
x+— 
m 


into (7), where the b’s are indeterminate coefficients. If f is any solution of (7) so is 
cf, for any constant c. Hence we may choose b, = 1. 
If we substitute (8) into the left-hand side of (7) we get 


1 m—l1 
LS a(x A) = Sa, Ferm" "im, 
M K=0 v=0 =0 
where 
m~-1 
S,(m) = » k* 
k=0 
Collecting the coefficients of x”~” we have 
1 m—1 k n 7 n n-»? 
m LP, x + m 7 dx" r 8(* — ,} S,— »(m) (9) 
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Substituting P(x) on the nght-hand side of (7) we obtain 


m~"P (mx) = ), x"~"bm™. (10) 
r=0 


Identifying the coefficients of x”~” in (9) and (10) we see that the b’s are subject 
to the condition 


(m’— 1)8, = ~ 5 6( = $)m »~1S,_,(m). (11) 


In particular when r = 1, since S,(m) = m(m — 1)/2, we have 
b, = —n/2. 
In general if 
b,, b5,..-, 5.4 


have been determined, (11) serves to determine b, because m > 1. This proves the 
lemma. 


LEMMA 2 (uniqueness). For a given integer n there is only one monic polynomial of 
degree n satisfying 
J mal 


—~ Lf 


k=0 


k 
xt — 


=m "f(mx)  (m> 1). (12) 


Proof. If possible, let P,(x) and Q,(x) be two different monic polynomials of 
degree n satisfying (12). Suppose that 


P(x) _ Q(x) = A, (x) = A,x4 + A,x47} +... 
where d < n and A, # 0. Subtracting in (12) we get 


Identifying the coefficients of x“ on both sides we have 
A, = m*""Ao, 


But this contradicts our stipulations that A, # 0, d < n and m > 1. This proves the 
lemma. 


We make the following definition: 
DEFINITION. The nth Bernoulli polynomjal B,(x) is defined as the unique monic 
polynomial of degree n which satisfies, for all integers m > 0, 


] mal 


— > B 


M (20 


k 
x + —] =m~"B (mx). (13) 


THEOREM 1. The Bernoulli polynomials form an Appell set: 


By a(x) == B(x) (n> 0) (14) 
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Proof. If we differentiate the defining relation (13) we get 
J mal 


— ) B, 


M 20 


= m~"*1B’ (mx). 


k 
x+— 
m 


This exhibits the monic polynomial (1/n)B/(x) as satisfying the functional equa- 
tion reserved for B,_,(x). The theorem follows from Lemma 2. 


THEOREM 2. Let B, = B(0) be the nth Bernoulli number. Then B,(x) = (B + x)", 
where, after expanding this binomial, the exponents on B are degraded to subscripts. 


Proof. If we continue to differentiate (14) we get 


[£00 = m1( 2), (0. (15) 


By the Maclaurin expansion of B(x) 


- nm xk? g k 
B = ——|{—| B(0 
= F(z] 
-y xT) By a4 =(B+x)". 
k=0 
This proves (3). 
THEOREM 3. If n > 0 
B(x +1) — B(x) = nx"™?. (16) 


Proof. The theorem is true for n = 0. Suppose the theorem is true for n = k so 
that 


B,(x +1) — B(x) = kx*®7}. (17) 
By (14) 
(K+ 1) f Bat) dt = Besi() ~ Bes(0). 
Integrating (17) we get 
1 * k-1 k 
ay (Beri +) ~ Beare) = kf dt = x". 


So the theorem is true for n = k + 1 and, therefore, it is true in general. The 
identity (1) now follows by summing both sides of (16) over the integers x from 0 to 
m—1., 


THEOREM 4. If n > 0, 


—n! 00 ermirx 
B(x) =—— y 7 (r # 0, 0<x<1). (18) 
(277) pero 7 
Proof. For n = 1 the theorem becomes 
1 —] 1 —-1 & sin2anx 


_ 7 27irx 


x- 5 = VY etre yp 
T 2] n 
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This is a well-known fact about Fourier series whose proof is sketched as follows: 
Let 27x =t +a so that x — 1/2 =t/27. When x ranges from 0 to 1 then f 
ranges from —7 to 7. Then the identity 


(o,@) 
f(t) =a,+ > (a, cos,, + 6, sin nt), 
n=] 


where 
1 pa 
a, = —f f(t )cos nt dt 
T ¢ —a 
and 
1 pa 
b= —f f(t)sin nt dt 
WT! —a 
becomes when f(t) = t/27 


n 


1 7 
a= 543 J tos nt dt = 0 


lo opr n-l 
b, = ae sin ntdt = (—1)" “/nz. 


Thus, 


(-1)"7- —1 & sin2anx 
sin nt = — oo 
vil 


n=] n 


If we denote the right-hand side of (18) by ¢,(x), then 
ld 
n Fa onl) _ ,—1(x). 


Therefore, since 


1 a" 1 
Tr gunat on(X) = (4) =x - 5, 


$, (x) is a monic polynomial of the nth degree. Now 


1 m~-l1 —y! e2mirx m—1 
__ ») o x + = —-— y _ > e2tirk/rn 
M Koo m(2mi)° ao 0" k=0 


The inner sum is m when r is a multiple of m and 0 otherwise. Thus 


1 ™=1 k —n! 1 
__ > oy[ + | = —-—— y ne27ixmh 
mM K=0 m m(2mi)” yao (mh) 


= m~"$, (mx). 
By the uniqueness Lemma 2 
$n(x) = B,(x). 
This proves (5) and Theorem 4. Finally we prove (2). 
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THEOREM 5. 
y B(x)t"/n! = te*/(e' — 1). 
n=(0 
Proof. Let us expand the function 
F(x, t) = te*‘/(e’ — 1) 
in powers of x and ¢ and collect the coefficients of t”/n! as a polynomial— V(x) 
of degree n in x: 


F(x,t) = y WV (x)t"/nt. (19) 


Suppose 
W(x) = AQP x” + AVYXM TE + oes FAM, 
If we replace x by 1/y and t-by ty in (19) we get 
(o@) 
F(1/y, ty) = tye'/(e” —1) = Ly", (1/y)e"/nt. 
n=0 


Letting y tend to zero we obtain 
(o@) 


ef’ = SAME" /n! 


n=Q 


Hence, A\” = 1 and, hence, V,,(x) is monic. 
If in (19) we replace x by x + k/m and sum over k and divide the result by m 


we get 


> > W(x + k/m)t"/n! Fs + k/m, t) 


iM 


tex! m1 (t/m)e* 
kt/m _ 
y* ef om (20) 


If, instead, we replace in (19) x by mx and t by t/m we obtain 


o 4] 
F(mx,t/m) = ¥ —n¥(mx)t"/n! = (t/m)e**/(e'/™ — 1). (21) 
n=o 
Identifying coefficients of ¢”/n! in (21) and (20) we conclude that V,(x) satisfies 


the functional equation 
] m-l 


—)yy, 


M 420 


Since WV, (x) 1s monic we have V,(x) = B,(x) and (2) is proved. 
There are more identities involving Bernoulli polynomials which may be derived 
from the above. For example, the integral equation 


AG. + t) dt = x” 
0 


k 
x+ — =m "¥ (mx). 


follows immediately from (6) by letting m tend to infinity. 
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The functional equation 
B,(1 — x) = (-1)°B,(x) 


is a Simple consequence of (5). 
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A ddenda, errata, etcetera for 1988 


1. The Note by Yoram Sagher, What Pythagoras knew, in the February, 1988 issue, has drawn several 
comments to the effect that the points made in the Note had been made before. Professor Nathan J. 
Fine, in particular, called our attention to his earlier Note in Math. Mag. 49, Nov.-Dec. 1976, 249, where 
virtually the same observations were made. Dr. Fine stated there that Dedekind had already published 
the idea in 1901. 


2. Concerning his article Angle trisection, the heptagon, and the triskaidecagon, March, 1988, pp. 
185-194, Dr. Andrew Gleason advises us that the following references, all authored by T. I. Zamfirescu, 
should be mentioned: 


(a) Geometric constructions with ruler, compass, and trisector (Romanian), Stud. Cerc. Mat., 15 
(1964) 405-411. 

(b) Constructibility with ruler, compass, and trisector (Romanian), Gaz. Mat. Ser. A, 70 (1965) 
204-213. 


(c) Constructions with ruler, compass, and trisector (Romanian), Gaz. Mat. Ser. A, 71 (1966) 9-18. 


The first of the above papers contains a proof of the theorem that describes the integers n for which a 
regular nm-gon can be constructed, using ruler, compass, and trisector. 


3. Dr. Doug Hensley, of Texas A & M, has noted an error in On continued fractions and a certain example 
of a sequence of continuous functions, by J. Fabrykowski, on page 539 of the June-July, 1988 issue. 
Dr. Hensley observes that if 0 < x < 1 is irrational then x has convergents. These are rationals in [0, 1], 
so they must occur among the 7. For this infinite sequence of values of j one cannot assert (as claimed in 
the paper) that |x —r|>1 /(207). The conclusion is true, nonetheless. 


4. Dr. Paul Kumpel, of SUNY Stony Brook, points out that R. Dybvik’s letter (95, p. 533) should read 
‘OFPD is a rhombus...’ instead of ‘OFPD is a parallelogram... .’ 


Finding Saddlepoints of Two-Person, Zero Sum Games 


DONNA CRYSTAL LLEWELLYN! AND CRAIG TOVEY?, 
School of Industrial and Systems Engineering, Georgia Institute of Technology, Atlanta 


MICHAEL TRICK’, Carnegie-Mellon University 


DONNA CRYSTAL LLEWELLYN: During my undergraduate years at Swarth- 
more College, I became interested in combinatorics and graph theory, and 
their applications. Following these interests, I went to Stanford Univesity 
and then Cornell University, receiving my doctorate in operations research in 
1984 under the direction of L. E. Trotter, Jr. I had the honor of being 
awarded a National Science Foundation Mathematical Sciences Postdoctor- 
ate Fellowship, which I used to pursue my research interests at the Univer- 
sity of Bonn, West Germany. Currently, I am an assistant professor at 
Georgia Institute of Technology in the Department of Industrial and Sys- 
tems Engineering. 


CRAIG Tovey: As an undergraduate in applied mathematics at Harvard 
College, I wrote a thesis under the direction of Christos Papadimitriou. Since 
then my principal research interests have remained in the area of design and 
analysis of algorithms, particularly probabilistic analysis of combinatorial 
algorithms and computational complexity. I received my Ph.D. in operations 
research from Stanford University in 1981, supervised by George Dantzig. 
Since then I have taught and done research at Georgia Tech, where I am now 
an associate professor. 


MICHAEL TRICK: I received my Ph.D. in Industrial and Systems Engineering 
from Georgia Tech in 1987. I spent the 1987-88 academic year at the 
Institute for Mathematics and Its Applications in Minneapolis as a postdoc- 
toral fellow. My research interests include integer programming, design and 
analysis of algorithms, and applications of operations research in the biologi- 
cal and social sciences. I am now affiliated with the Graduate School of 
Industrial Administration, Carnegie-Mellon University, Pittsburgh. 


1. Introduction. A two-player, zero sum game, as defined by von Neumann and 
Morgenstern in their classical work [13], is completely specified by its payoff matrix. 
Let player I have m possible “moves” or “plays” (called pure strategies) and player 
II have n pure strategies. Suppose that player I pays player II a, j if player I chooses 
his ith strategy and player II chooses her jth. Then the payoff matrix of the game is 
the m Xn matrix A = [a;|]. 


‘Supported by NSF Postdoctoral Fellowship DMS84-14104 
*Supported by NSF Grant ECS-8451032 
‘Supported by ONR Grant N00014-86-K-0173 
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A saddlepoint (SP) of a matrix A is an entry a 
in row i and the smallest in column /; that is 


;j, Where a;, is the largest element 


a;,; >a, forallk 


a,,;<a,; forall k. 


If both inequalities are strict, then we call a;, a strict saddlepoint (strict SP). 

If there exists a strict saddlepoint of the payoff matrix, then it is the unique 
optimum solution to the game. A saddlepoint of the payoff matrix (if it exists) 
corresponds to a (not necessarily unique) optimum solution. Notice that if all of the 
entries of the payoff matrix are distinct, then a saddlepoint is a strict saddlepoint. 
(In general, finding the optimal solution vectors for the two players is equivalent to 
solving an m X n linear programming problem [3].) 

While these properties of saddlepoints are well known, it appears that the 
question of how to find these solutions efficiently has not been carefully studied. 
For instance, in [4] it is stated that “a [saddlepoint] is easy to find, even in a very 
large matrix.” Other standard texts, such as [1], [2], [5]—[10], [12] and [14]-[17] also 
mention that saddlepoints are easy to locate but fail to discuss finding an efficient 
algorithm. 

The purpose of this article is to show that a strict saddlepoint, when it exists, can 
be found very quickly. In fact, we give an algorithm that requires looking at only 
= n°>8m entries of the payoff matrix (where m > n). Hence as the number of 
strategies increases, the proportion of the payoff matrix examined approaches zero. 
This algorithm also determines when a matrix has no strict SP. Further, we will 
prove that finding a (nonstrict) saddlepoint requires examination of the entire 
payoff matrix. 

Given a matrix, we refer to the problem of finding a (strict) saddlepoint or 
determining that one does not exist as the (strict) SP problem. For ease of notation, 
we will denote log,x by lg x. 

We distinguish between two different types of computations. One kind is the 
ordinary arithmetic operation (i.e., comparison, addition, multiplication and divi- 
sion by integers). The other kind is the identification of the value of a matrix entry; 
this might involve a lengthy function evaluation. Therefore, we adopt the following 
model of computation. Assume that each matrix query contributes a units to the 
running time and each arithmetic operation requires 8 time; where a and £ are 
some constants. 


II. Finding Strict Saddlepoints. We motivate our algorithm for finding a strict 
saddlepoint with a couple of observations. 
Observation 1. If two entries of A, a;, and_a,, are known, then at least one of them 
can be eliminated as a possible strict SP with a single matrix look-up and a constant 
number of arithmetic operations. 


Proof. Clearly this follows if either i = k or 7 = /. Hence assume that neither of 
these holds. Without loss of generality let a;; < a,,. Look up a,,. Clearly, if a;; is a 
strict SP then a,; > a;,. Further, if a,, is a strict SP then aj, > a,,. Since a;; < ay), 
it is clear we can not have a,,; > a,, > a,,. Hence, knowing a,, will eliminate either 
a;, Or a,, Or both from consideration. 


14 D. C. LLEWELLYN, C. TOVEY, AND M. TRICK [December 


Notice that in the proof of observation 1 it is assumed that a,, < a,;. Hence, the 
entry a,, cannot be a strict SP since a,;; < a,, implies that it is impossible to have 
a;; > a,; > @,,. This leads to our second observation. 


Observation 2. If the diagonal terms of a square matrix, A, are known and are such 
that a,, < a,,< -:: <a,,, then no term below the diagonal can be a strict SP. 


nn? 


Note that to apply observation 2 we will first have to rearrange the rows and 
columns of A so that the diagonal is ordered (which will not affect the existence or 
nonexistence of a strict SP). Our algorithm uses observations 1 and 2 together with 
proper manipulation of rearrangement pointers. The idea of the algorithm is 
illustrated in FIGURES 1 and 2. 


WN 
> 


> Ra 
N 


The main diagonal is looked up and then sorted to eliminate the subdiagonal 
portion of the matrix from consideration. The remaining region (i.e., the upper 
triangle) is divided into two triangles and one square which are studied recursively 
(see FIGURE 1 for the first step of this recursion and FIGURE 2 for the next two 
steps). These pieces are then put back together using observation 1. In the diagrams, 
the shaded regions are the parts of the matrix that have been ruled out by our 
observations and hence do not have to be searched. 

We now state the algorithm precisely. 


FIG. 1 FIG. 2 


Procedure Square 
Input: A, n, r(1),..., r(n), cQ),..., c(n) 
Output: candidate 


This procedure takes as input a matrix A, and a subset of n of its rows and a subset 
of n of its columns. These rows and columns define an n X n submatrix A of A. 
The procedure then returns candidate, a location in A which might be a strict SP of 
A. No other element of A is a strict SP of A. 


Step 0: If n = 1, return candidate = (r(1), c(1)) 
Step 1: Look up and sort A(r(i), c(i)), i= 1,..., m in ascending order of size. 
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Suppose o is the permutation that gives this sorting, 1.e., suppose that the set 
{ A(r(o(i))), c(o(i))}7_] 1S an increasing sequence. 


Set 
r(i) = r(o(i)) 
c(i) =c(o(i)) fori=1,...,n. 


Step 2: Call Triangle (A, n, r(1),..., r(”), c(1),..., c(n); candidate) 
Step 3: If candidate = @, return @. Otherwise, take o~' (candidate) and return it. 


Procedure Triangle 
Input: A,n,r(1),..., r(n), c(1),..., c(”) 
Output: candidate 


This procedure takes as input a matrix A and a subset of of its rows and a subset 
of n of its columns. These rows and columns define an upper triangle submatrix T 
of A. The procedure then returns a location in JT which might be a strict SP of A. 
No other element of 7 1s a strict SP of A. 


Step 0: If n = 1, return candidate = (r(1), c(1)) 

Step 1: Call Triangle (4,|4], r(),..., 7d ¢]), c(),..., cd 4]); candidatel1) 

Step 2: Call Square (A4,[4], 7(1),..., 73), c($),..., c(v); candidate2) 

Step 3: Call Triangle (4,|4]), 7(. + [$],..., r(7), ce + [4)),..., e(); candidate3) 
Step 4: Using Observation 1, discard at least two of the candidates. If candidatei 
remains (i = 1, 2 or 3), let candidate equal candidatei. Otherwise, candidate = ©. 
Step 5: Return candidate. 


Program Strict Saddlepoint 

This program solves the strict saddlepoint problem on an n X n matrix A. 

Step 1: Call Square (n,1,...,7,1,..., n; candidate) 

Step 2: If candidate # @, check to see if candidate is a strict SP; if so, then report 
the strict SP. Otherwise, then conclude there does not exist a strict SP. 


PROPOSITION. Program Strict Saddlepoint solves the strict saddlepoint problem for 
a square matrix. 


Proof. Observation 2 implies that candidatel, candidate2, and candidate3 are the 
only possible positions of a strict saddlepoint in the upper triangle matrix spanned 
by the row and column indices which. are passed to Procedure Triangle. Let 
candidate be the output from Procedure Square. Then by the above remark and 
Step 3 of Procedure Square, this position is the only possible position in the square 
matrix defined by the indices passed to this procedure by the central program. Thus 
all that is left for the Program Saddlepoint to do is to check if this position is indeed 
a Strict saddlepoint; which it does in Step 2. 


THEOREM. Program Strict Saddlepoint terminates in time O(n'®°). 


Proof. Let F(n) be the time required to run Procedure Square 
(n, r(1),..., 7(”), c(1),..., c(n); candidate) and let G(n) be the analogous time for 
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procedure Triangle (n, r(1),..., r(n), c(1),..., c(m); candidate). Hence, the work 
required to run Program Saddlepoint is F(n) + O( Bn); (due to the last check 
required to see if candidate is really a strict SP in step 2 of Program Saddlepoint). 
Notice that G(n) = F(n) — c,Bn log( Bn) for some constant, c,, because the only 
difference in the two algorithms is that the sorting step of Procedure Square does 
not occur in Procedure Triangle. By analyzing the steps of Program Saddlepoint, 


F(n) = c,Bnlg( Bn) + F(5] + 26 (5) + c,B + c,Bn, 


where the c,8n term is the cost of passing the row and column pointers and c, is 
the cost of making the final decision between candidates; that is, step 4 of the 
triangle procedure (using observation 1). 

So, 
n 


c, Bn |g( Bn) + 3F| 5 | + ¢,B + c3Bn — 2eb(>)ie{ S") 


F(n) 5 


1 
eyngn + 3F(5) + eB + 6(Bn) ~ ex(Bn)la(Bn) + cx( Brel = 


n 
= 3F[ 5) + ca +(e ~ a)(Bn). 
Iterating this expression yields 
Ign—1 3 i 
F(n) = 38°F) + [dep + (3) (- «)(4)} 
i=0 


But, F(1) = a, so after simplifying this geometric series we get 


F(n) = s(a + 2B | + (3) ~ ¢)(Bn) - [eb + Uc, - es)Bn | 


= a( a + ~exp + =(¢ ~ cB — [eab + 2(¢; - cs)Bn), 
So, 
F(n) = 0(3!8"). 
But, 32” = n'83, and hence 
F(n) = O(n'83). 
Hence, by the remark at the beginning of the proof, the Program Strict Saddlepoint 
runs in time O(n'8?) as claimed. 


Note that this analysis shows that the program saddlepoint requires both order 
n'°8 matrix look-ups and order n'°* total running time. That is, even if a matrix 
look-up takes significantly more time than an arithmetic operation, that is, a > B, 
the algorithm will remain O(n'>*). 

The reader familiar with fractals will note that the unshaded region in FIGURE 2, 


corresponding to the portion of the matrix that must be searched, is an illustration 
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of a fractal of dimension lg 3; and hence has area n'®°. This provides a geometrically 
intuitive argument for the running time of Program Strict Saddlepoint [11, pp. 
141-142], 


COROLLARY. The strict SP problem on an m Xn matrix, with m >n, can be 
solved in O(()n'®?) time. 


Proof. Divide the matrix into [”] possibly overlapping n X n square matrices, 
Aj,..-, Atm nj Apply the theorem to each A; to get at most [%'] candidate strict 
SP’s, from which the real strict SP (if it exists) can be found in work O(7 + m + n). 
The total work is thus bounded by 


[= Jo(n's?) + O(m) = o({— Jn) 
as claimed. 


Ill. Saddlepoints. Note that if we are seeking a SP then observation 1 holds only 
if a,; > ay, > ++: >a,, and similarly observation 2 will fail unless the inequality 
is strict. Thus it is not clear whether the search for a SP can be sped up in a similar 
way. In fact we show next that the SP problem cannot be solved in time better than 
cn’. 

Let C be the class of all algorithms that are made up entirely of table look-ups 
and arithmetic operations. 


THEOREM. There does not exist an algorithm for the SP problem in the class C that 
can run faster than cn. Further, there exists an algorithm in C that does run that fast. 


Proof. We can put a check mark in each matrix cell which is the maximum in its 
row in time cn’, and then do the same for the cells which are the minimum in their 
columns in cn? time while checking for a cell with two check marks which would 
clearly be an SP. Hence, there surely is a quadratic time algorithm in C for this 
problem. 

We now show that no faster algorithm is possible. We do this by playing the 
adversary against an arbitrary algorithm in C. We argue that the algorithm must 
look up every entry in A. 

Our strategy: For the first n* — 1 queries of the values of a, j we answer 0, unless 
all the other entries in row i have been queried, in which case we answer 1. If we 
answer .5 to the last (n*) query, then either that cell is the unique SP or there does 
not exist an SP. This follows since every 0 entry has either a 1 or a .5 in its row and 
every 1 has either a 0 or a .5 in its column. If instead, we answer —1, then the cell is 
not a SP, and in fact any zero entry in that last row will be a SP. Hence no 
algorithm in C can know the answer to the SP problem'without n? queries. 


In conclusion, we have shown that the strict SP problem can be solved in O(n'>®) 
time, but that a similar improvement from O(n’) for the SP problem cannot be 
found. Note that our algorithm for the strict SP problem uses O(n'>°) matrix 
queries and has an O(n'*) total running time. We conjecture that it may be 
possible to solve this problem with less than O(n'>*) matrix queries but at the cost 
of a longer running time. 
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Dehn’s Algorithm and the Complexity of Word Problems* 


RONALD V. BOOK 
Department of Mathematics, University of California, Santa Barbara, CA 93106 


1. Given a collection of objects such as strings or graphs or trees or... (data 
structures) and a system of transformation rules that apply to the objects in that 
collection, there is the following problem: 


Instance: two objects from the collection; 

Question: can one of the objects be transformed into the other (or does there 
exist a third object such that both of the given objects be transformed 
into the third) in a finite number of steps by applying rules from the 
system? 


This problem is called the “word problem” for the given collection. Thue [27], 
[28] described this problem for strings, that is, concatenations of letters from a finite 
alphabet. In this case the transformation rules are ordered pairs (u, v) of strings and 
the idea of a transformation is that any occurrence of uw as a substring may be 
replaced by v, and vice versa. Thus, the formal version of Thue’s problem 1s the 
word problem for finitely presented monoids. Thue was also interested in the word 
problem for other types of combinatorial objects such as graphs and trees (i.e., for 
data structures). Dehn [8] also described this problem, in this case for finitely 
presented groups, and the work of Dehn became fundamental for combinatorial 
group theory [20]. Word problems have been studied in many contexts in mathe- 
matics and in computer science. In any given setting one would like to have an 
algorithm to solve the word problem. In general, such an algorithm cannot exist 
since the word problem for finitely presented monoids is undecidable [21], [25] as is 
the word problem for finitely presented groups [4], [23]. (For a general discussion of 
the existence and nonexistence of algorithms for such problems, see the book by 
Davis [7].) 

In cases where the word problem is decidable, one wishes to know the intrinsic 
computational complexity of the problem. Frequently, the computational complex- 
ity of a problem is measured in terms of the “running time” for some fastest 
algorithm that solves the problem. The running time is the number of steps in the 
computation of the algorithm on a given input. By considering all inputs of size n 
and taking the maximum running time over this set of inputs, one obtains an upper 
bound as a function of n. This function depends on the precise definition of these 
variables but its asymptotic behavior is invariant under most reasonable definitions. 
A problem is considered to be tractable if and only if it can be solved in time 
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bounded by some polynomial in the size of the input. (For a discussion of 
tractability, see the book by Garey and Johnson [11].) 

In the present paper we restrict attention to a class of finitely presented monoids 
whose presentations are very well behaved. The property of presentations that is of 
interest is the property of being “Church-Rosser on the congruence class of the 
identity,” a property that is enjoyed by group presentations to which Dehn’s 
Algorithm applies. We will consider this property from the standpoint of rewriting 
systems and present a simple result that has as a corollary the recent observation of 
Domanski and Anshel [10] that the word problem for groups with Dehn’s Algorithm 
can be solved in linear-time. First we review some of the facts regarding Dehn’s 
Algorithm. 

Dehn [8] considered presentations of fundamental groups of closed orientable 
surfaces and showed the appropriate word problems to be decidable. From the 
standpoint of rewriting systems, Dehn’s strategy may be described in the following 
way. 


(1) Suppose that a presentation of a group G, consisting of a finite set 2 of 
generators and a finite set R, = {(7,,1)|1 < i < mp} of relators, has the following 
property: every freely reduced nontrivial word w that is equal to the identity 1 in G 
has a factorization xyz where for some i, r, has a factorization yt with y being 
longer than ¢. Then R, can be transformed into a finite set R, of relations for 
which (u,v) € R, implies that the string u is longer than the string v (by placing 
(y,t~') in R, if (yt,1) is in Ry). Thus, R, permits xyz to be transformed in one 
step to xt~ +z, which is shorter. Moreover, (2, R,) presents G so that it is sufficient 
to solve the word problem modulo the group presentation (2, R,). 

(ii) Consider “reduction rules.” The reduction rules in (2, R,) are “u reduces to 
v” for (u, v) © R,. In addition, the “free reductions” are allowed: “oo~* reduces to 
1” and “o~'o reduces to 1” for o € =. What is known as “Dehn’s Algorithm” is the 
notion of applying the reduction rules in R, and the free reductions until no such 
rule is applicable. For the appropriate group presentations, it is the case that for any 
w equal to 1 in G, one can apply the reduction rules in R, and the free reductions 
arbitrarily until no such rule is applicable, and regardless of the order of application 
of these rules the final result is 1. Thus, to determine whether w, is equal to w, in G, 
it is sufficient to determine whether w,w,' reduces to 1 by applying some finite 
sequence of the reduction rules. 

In the language of rewriting systems, the presentation (2; R, U {(oo0~*, 1), 
(o~ ‘o,1)|o © &}) is “Church-Rosser on the congruence class of the identity” (this 
notion will be defined in Section 3). Here it is shown that the fact that the word 
problem for groups with Dehn’s Algorithm can be solved in linear-time is a special 
case of a‘more general phenomenon: If 7 1s a-finite Thue system and w is a string 
such that T is Church-Rosser (with respect to length) on the congruence class [w] of 
w (mod T), then there is a linear-time algorithm to determine for an arbitrary string 
z whether z is congruent to w; if, in addition, T presents a group, then the word 
problem for that group is solvable in linear-time. The fact that Dehn’s Algorithm 
applies to a finitely presented group G implies that there is a finite Thue system T 
that presents G (as a monoid) and has the property that T is Church-Rosser on the 
congruence class of the identity 1, and the observation of Domanski and Anshel 
follows immediately. 
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For a description of Dehn’s Algorithm from the point of view of combinatorial 
group theory, see Lyndon and Schupp [18] or Zieschang, Vogt, and Coldewey [30], 
and for a description from the point of view of rewriting systems, see Le Chenadec 
[16]. 


2. Here we provide formal definitions of Thue systems and related notions. 

An alphabet > is a finite set whose members are called /etters. The set of words 
over = is denoted =*, and 1 denotes the empty word. Thus, =* is the free monoid 
generated by > under the operation of concatenation, with the empty word as 
identity. If w © 2*, then the length of w, denoted |w|, is defined as follows: 
|1] = 0, Ja] =1 for ae X, and |wa| = |w| +1 for we &* and ae X. The 
identity of words is written as = , and the concatenation of words u, v is written as 
uv. 

A Thue system T on 2 is a subset of =* xX X*. The Thue congruence <— * > 
generated by T is the transitive, reflexive closure of the relation < defined as 
follows: for (u,v) © T and x, y © 2*, xuy © xvy and xvy © xuy. Two strings 
w, z © >* are congruent (mod T) if x — * — y, and the congruence class of w is 
[w] = {z © 2*|z — * > w}. The monoid presented by T is defined as follows: (4) 
the elements are [x], x © &*, (ii) the multiplication is [x] -[y] = [xy], x, y © 2*, 
and (iii) the identity is [1]. This monoid is denoted by =*/< * — . If both 2 and T 
are finite, then =*/< * — 1s finitely presented. 

Our main interest here is the word problem for the monoid 2*/< * — for any 
given Thue system. Given T on &, this can be expressed as follows: 


Instance: two strings x and y in 3*; 
Question: are x and y congruent (mod T)? (i.e., [x] = ?[ y]). 


In general, this problem is undecidable [25], that is, there exists a Thue system 7 
such that no algorithm will decide the word problem for T. We are interested in the 
situation where the monoid presented by a Thue systems is, in fact, a group and 
where it is decidable. 

Recall that if there is an algorithm to solve the membership problem for a set, 
then the set is recursive. Thus, the word problem for a Thue system T is decidable if 
and only if for every string z, the congruence class of z modulo T is a recursive set. 


3. It is sometimes the case that one wishes to place an ordering on the strings 
making up each rule of a Thue system. This gives rise to the notion of a “rewriting 
system.” (Rewriting systems are important for the study of automated deduction. 
See [14] and [29].) 

A rewriting system R on & is a subset of &* X X*. An element (u,v) © R is a 
rewriting rule. The reduction relation generated by R is the transitive reflexive closure 
=*=> of the relation = defined as follows: for (u,v) €@ R and x, y € 3*, 
xuy => xvy. A string w is irreducible if there is no y such that w => y. 

For a rewriting system R, denote by R~' the rewriting system {(v, u)|(u, v) © 
R}. Then RU R™! is a Thue system and the Thue congruence generated by 
RUR7™! is precisely the transitive reflexive closure of the binary relation 
(= 7!U =). By the word problem for R, we mean the word problem for the 
monoid presented by R U R71. 

Let R be a rewriting system on 2. We say that R is noetherian if there is no 
infinite chain x; > x,> °°: 
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Notice that if R is finite and noetherian, then for x € X* one can compute an 
irreducible element x such that x =* => x: If x is not irreducible, then it can be 
reduced; this process can be iterated only finitely many times since R is noetherian. 
For a string z © 2*, R is Church-Rosser on [z] if [z] contains at most one 
irreducible element. If R is Church-Rosser on [z] for every z € 2*, then R is 
Church-Rosser. If R is both noetherian and Church-Rosser, then R is complete. 

It is easy to see that if R is a finite rewriting system that is complete, then the 
word problem for R is decidable. Furthermore, if R is a finite rewriting system that 
is noetherian and for some string z, R is Church-Rosser on [z], then [z] is recursive 
so that the word problem for [z] is decidable. From Lemma 2 notice if R 1s a finite 
noetherian rewriting system on = such that the monoid presented by R U R7™! isa 
group and such that there is a string z such that R is Church-Rosser on [z], then the 
word problem for R is decidable. 

Now we consider some special systems. Let T be a Thue system on alphabet 2 
such that 7 has no rules that preserve length, i.e., (u, v) € T implies that |u| # |v]. 
Let R be the rewriting system defined by R = {(u, v)|(u, v) © T or (v, u) & T, and 
|u| > |v|}. Thus the notion of reduction is based on decreasing the length of the 
string by applying a rewriting rule. In this context we will use — * — to denote the 
reduction relation =* =. Clearly, R is noetherian. By assuming that (u,v) © T 
implies |u| > |v|, we refer to T instead of to R. Throughout the rest of this paper 
we will restrict attention to Thue systems with no length-preserving rules and with 
the notion of reduction being based on decreasing the length of the string by 
applying a rewriting rule. 

The following lemma is very helpful in this context. 


LEMMA 1 [2]. Let T be a finite Thue system on a finite alphabet =. There is a 
linear-time algorithm that on input a string w © X* will compute an irreducible string z 
such thatw —- *—> z. 


Proof. Proceed by scanning w from left to right and applying reductions whenever 
possible. Each time a reduction is applied one must consider the possibility that a 
portion of the previously scanned string is a prefix of the left-hand side of a 
reduction and the corresponding suffix of the left-hand side of that reduction is the 
next portion to be scanned. Thus, backtracking may be needed but the amount of 
backtracking necessary after a reduction is at most the length of the longest 
left-hand side of the rules in 7. This strategy can be implemented with two 
pushdown stores, with the first pushdown store being initially empty and the second 
initially containing w, where the leftmost symbol of w is on the top of the store. 
When the computation halts, the second pushdown store is empty and the first 
pushdown store contains an irreducible string z with the rightmost symbol of z on 
the top of the store. Clearly, at most |w| such reductions can be applied. Hence, the 
number of steps in the computation is linear in |w| where the constant of linearity 
depends on m and the number of rules in T. 


COROLLARY. Let T be a finite Thue system on finite alphabet =. Suppose that z is a 
string such that T is Church-Rosser on [z]. Then there is a linear-time algorithm to 
decide the membership problem in {z], i.e., to decide whether a given string is 
congruent to Z. 
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COROLLARY [2]. Let T be a finite Thue system on finite alphabet X. Suppose that T 
is Church-Rosser. Then there is a linear-time algorithm to solve the word problem 
for T. 


The local property of being Church-Rosser on the congruence class of a given 
string is generally nontrivial since Otto [24] has shown that the following question is 
undecidable: 


Instance: a finite Thue system 7 and a string z: 
Question: is T Church-Rosser on [z]? 


On the other hand, it is known [3] that there is a polynomial time algorithm to 
solve the global property of a finite Thue system being Church-Rosser. 

It is clear that if the restriction that no rules are length preserving is omitted and 
reduction is interpreted as “not increasing the length,” then the Church-Rosser 
property still guarantees that the word problem is decidable. But now a linear time 
bound cannot be guaranteed; in fact, there are cases where the word problem is 
complete for polynomial space. 


4. Now we turn to the situation where the monoid presented by a finite Thue 
system is a group. We are interested in the complexity of the word problem for that 
group in a special case where the word problem is decidable. 

Suppose that T is a finite Thue system on alphabet = such that =*/< * > isa 
group. It may be that the inverses of the elements of 2 are not explicitly given; 
however, the fact that those inverses exist (since 2*/< * — is a group) along with 
the finiteness of T and 2 allows one to compute those inverses. This can be seen as 
follows. Assume some effective enumeration of the set 2*: wo, w,,.... For each 
o © 2 and each i, let y, , denote the string ow,. Since T is finite, for every string x 
there are only finitely many strings z that are reachable from x by just one 
application of a rewriting rule. For each i one can effectively enumerate the set of 
strings z such that y, ; <— * — z (say by first enumerating all the strings reachable 
from y, by means of only one application of a rewriting rule, then enumerating all 
the strings reachable from y, , by means of two applications of rewriting rules, etc.). 
By “dovetailing” these enumerations, we can begin to compute a list of all strings 
congruent to y, 9, all strings congruent to y,j,.... Each time such a string is 
generated, we check to see if it is identically 1; if y, , is a string such that 
Yo, ;(= ow,) 1s congruent to 1, then w, is congruent to o~', so that this computation 
can halt. But by hypothesis, =*/< * > is a group, so that o~' has an inverse. 
This means that eventually some y, ; and z will be enumerated where y, ; is 
congruent to z and z is identically 1. 

Recalling that % is finite, let k be the maximum over all o € 2 of the running 
times of this procedure on o. Recall that each w € 2* has a unique factorization as 
a product of elements of 2, say w = a,...a,, so that a7’... aj is congruent to 
w'. Hence, a string that is congruent to w~’ can be computed in time O(k|w]) = 
O(|w|). 

This leads to the following fact. 

LEMMA 2. Let T be a finite Thue system on a finite alphabet %. Suppose that 


2*/< * — is a group. Then there is a linear-time algorithm that on input a string 
w € &* will compute a string z such that z is congruent to w™', i.e., [z] = [w]71. 
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The reader may note that since >*/< * — is a group, one can assume for each 
element o € > there is a string y such that y is congruent to o~'. Hence, one 
knows that Lemma 1 holds simply by knowing that the inverses of the elements of 
>= exist, that is, those inverses can be treated as constants in the algorithm. 


LEMMA 3. Let T be a finite Thue system on a finite alphabet = such that 
=* /<— * > is a group. Suppose that there is a string z such that |z] is a recursive set, 
that is, there is an algorithm to decide the membership problem for |z]|. Then the word 
problem for X*/<— * — is decidable. 


Proof. Given x and y in 2*, one wishes to determine whether x and y are 
congruent, i.e., whether [x] = [y]. This is accomplished by computing y~' and then 
determining whether zxy~' is congruent to z. By hypothesis, this question is 
decidable. Since =*/< * — is a group, cancellation applies, so that zxy~! is 
congruent to z if and only if xy~! is congruent to 1 if and only if x is congruent 


to y. 


Lemma 3 shows that if =*/< *— is a group and for some z € 2*, [z] is a 
recursive set, then for all z € =*, [z] is a recursive set. 

Now we consider the situation described in Lemma 1 when the monoid presented 
by the Thue system is a group. We have the following result. 


THEOREM. Let T be a finite Thue system on alphabet X such that the monoid 
presented by T is a group. Suppose that z is a string in &* and that T is Church-Rosser 
on [z]. Then there is a linear-time algorithm to solve the word problem for this group. 


Proof. As in the proof of Lemma 3, given x and y, one can compute zxy~! and 
determine whether this string is congruent to z. By Lemmas 1 and 2 this can be 
determined in time linear in |zxy~*|. But T and z are given, so that in general this 
can be determined in time O(|xy~*|) = O(|x| + |y]). 


As a corollary we have the result of Domanski and Anshel. 


COROLLARY [10]. Let G be a finitely presented group such that Dehn’s Algorithm 
solves the word problem for G. Then there is a linear-time algorithm to solve the word 
problem for G. 


One should note that there are other interesting studies of groups with Dehn’s 
Algorithm from the standpoint of rewriting systems. In particular, Greendlinger 
(12], [13] analyzed Dehn’s Algorithm from the standpoint of rewriting systems for 
free groups. Biicken [5] showed that if a finitely presented group G has small 
cancellation, then there is a finite Thue system that presents G and is Church-Rosser 
on the congruence class of 1. Le Chenadec [15] developed a characterization of 
small cancellation groups by considering a hybrid of rewriting techniques and 
geometric tools. To see the flavor of current research in the theory of rewriting 
systems applied to groups and other algebraic structures, see [16], [17]. 

The study of groups with Church-Rosser presentations is of considerable interest. 
Muller and Schupp [22] characterized the class of groups with context-free word 
problems. Diekert [9] and Madlener and Otto [19] studied these groups from the 
standpoint of Church-Rosser presentations, while Autebert, Boasson, and 
Senizergues [1] studied the type of context-free languages that characterize the 
congruence class of the identity for such groups. 
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Squier and Otto [26] describe new results on canonical rewriting systems for 
monoids. Cannon [6] describes the system CAYLEY which is designed to solve 
certain computational problems about finitely presented groups. 
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UNSOLVED PROBLEMS 


EDITED BY RICHARD GUY 


In this department the MONTHLY presents easily stated unsolved problems dealing with notions ordinarily 
encountered in undergraduate mathematics. Each problem should be accompanied by relevant references (if 
any are known to the author) and by a brief description of known partial results. Manuscripts should be sent 
to Richard Guy, Department of Mathematics and Statistics, The University of Calgary, Calgary, Alberta, 
Canada T2N IN4. 


Four Problems on Prime Power Divisibility 


A. GARDINER 
Department of Mathematics, University of Birmingham, Birmingham B15 2TT, UK 


We all know about binomial coefficients, about the sums Lr~! and Lr~’, and 
about congruences modulo p’, where p is a prime. Yet combining these simple 
ingredients, and stirring in a few elementary calculations can still have some 
surprising results. Our first three problems deal with these very familiar objects. 


(A) For which primes p, if any, does the congruence (2? = 2 hold modulo p*? 

(B) For which primes p, if any, does the congruence )? ~1~l = 0 hold modulo 

39 

p*? 

(C) For which primes p, if any, does the congruence 1?" 'r~ 
p*? 

Elementary calculations suffice to establish that each of these congruences holds 
modulo any smaller power of p (§§ 2,3), and that the three questions stated are 
equivalent (§ 4). Moreover, we shall show that questions (A)—(C) are equivalent to a 
fourth question (D) (§ 5) which has an interesting history of its own. 


2 


0 hold modulo 


(D) For which primes p, if any, does p divide the numerator of the Bernoulli 
number B,_,? 


At present only one such prime is known (see § 1). 


1. Irregular primes and Fermat’s Last Theorem. We begin by outlining some 
background to question (D). In 1850 Kummer proved that Fermat’s Last Theorem 
is true for the prime exponent p provided that p is regular; that is, provided that p 
does not divide the numerator of any of the Bernoulli numbers B,, B,, Bg,..., B,_3. 
This focussed attention on the irregular primes p, which do divide the numerator of 
one or more of these Bernoulli numbers. 

Is there anything special about irregular primes p which divide the numerator of 
B,_3? Selfridge and Pollack [5] listed the irregular primes p < 25000 and verified 
Fermat’s Last Theorem in each case. Johnson [4] extended (and corrected) this work 
to cover primes p < 30000. He made the explicit observation that, while almost 40% 
(1273) of the primes p in this range are irregular, only one—namely p = 
16843—divides the numerator of the /ast relevant Bernoulli number B,_,. Samuel 
S. Wagstaff of Purdue University has recently extended this analysis to p < 150000, 


but p = 16843 remains the only known irregular prime with this property. Since our 
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four questions (A)—(D) will be shown to be equivalent, p = 16843 is also the only 
prime < 150000 which satisfies the congruences in (A)-—(C). 


2. Congruence modulo p. We show first that the congruences in (A), (B), (C) 
hold modulo p. 
(A) We give two proofs. 


(i) The binomial coefficient (2? counts the number of p-element subsets of the 
set {1,2,...,2p} of size 2p. The permutation 


(12--+ p)((p + 1)\(p + 2) +++ 2p) 


leaves exactly two of these p-subsets invariant, namely {1,2,..., p} and { p + 1, 
p+ 2,...,2p}, and cycles all the other p-subsets in orbits of length p. Hence 


(2?) = 2 + (multiple of p) = 2 (mod p). 
(ii) The same congruence modulo p also follows from the algebraic definition of 
the binomial coefficient 


(27) — (py! | », Cea 1)(2p — 2)---(p +1) 
Pp p\p! (p— 1)! 

Since the denominator ( p — 1)! is not a multiple of p, it is invertible modulo p. 
Moreover (2 p — 1)((2p — 2)-:-(p+1)=(p— 1)! (mod p), so 


(2?) =2-(p-1(p- 1) *=2 (mod p). " 


(B) The congruence in (B) is false modulo 2, so let p be an odd prime. Distinct 
non-zero residues modulo p have distinct inverses modulo p. Thus as r runs 
through the residues from 1 to p — 1 modulo p, so also does r~ +. Hence 


p-1 p-l (p-—1)/2 


Yrit= viz 3s (i+ (p—i))=0 (mod p). a 


1 1 
(C) The congruence in (C) is false both modulo 2 and modulo 3, so let p be a 
prime > 5. As, runs through the nonzero residues, so also does r~!. Hence 


p-l p-l 


Ports x (1)? = Yi (modp). 


1 1 
Since 6 is invertible modulo p when p > 5, the familiar formula 


y i* = =p(p — 1)(2p —- 1) 


yields 


p-l 


y° r-?=0 (mod p). 
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3. Congruence modulo p’ (r > 2). 
(B) We begin by showing that the congruence in (B) holds modulo p* provided 
p > 5 [2]. (The congruence is clearly false modulo 3.) The identity 
p-ly (p-~)/2/ 4 1 (p—1)/2 1 1 p-l 1 
= — + = . = —p . —— 
an | p 1 r(p-r) 2 1 r(p-r) 


1 7 1 
works just as well modulo p?, so 


p-l p-l 
Drts2t-p- Yr p-r) (mod p?) 
1] 1] 


yep. -'s (mod p2) 


1 


0. (mod p’). eo 


(A) The congruence in (A) holds modulo p* when p = 3, and modulo p? when 
p > 5. (Dickson attributes this to Wolstenholme (1862) and to Cunningham (1907) 
[1].) This is the special case “s = 0, k = 2” of the following theorem. 


THEOREM 1. Let p be any prime > 5. If the positive integer k is a multiple of p’*, 
then 


(1 | =k (mod p?5*?), 


Proof. Let k = mp*, and put t= s+ 1. Then kp = mp’. Partition a set of size 
mp’ into m parts of size p’. Then the number of p-subsets of the whole set satisfies 


y= ETE) (2) 


t 
The m terms with exactly one i, # 0 are all equal to C ). The terms with three or 


more of the i; # 0 are all divisible by p*. Hence 


Byam )+E (2, (moar. 


P i p-i 
We consider the two terms on the right-hand side separately: 


(2 p'(p' — 1) - +: (p= (p- V) 


P p! 


1 P-!(p-1)! 
— SL _ _ t. eT * 2t 
=p (pi)! C 1)! —p 2 ; + (multiple of p ) ; 


If we interpret this identity modulo p***? and use the fact that, since p > 5, we 


know that the congruence in (B) holds modulo p?, then we obtain 


2" =p’ (mod p®**?). (2) 
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Expanding the summands in the second term of (1) in the same way we get 


ral p' | _ Pet = Ys (= = 0) 
p 


p'(p'-1)-++(p'- (p-i-1)) 
(pi)! 


—p*-i-"(p—i) © (mod p*). 


= (?)[,” | = —p". ae —i)~" (mod p*) 


1 


p-l 


p”! . y j~2 (mod pt") 
1 


=0 (mod p*‘*'), (3) 
since £?~'i~* = 0 (mod p). Combining (1), (2), and (3) gives the result. 


4. The unsolved problems (A)-(C). We show first that (A) and (C) are equiv- 
alent. We then show that (B) and (C) are equivalent. 

The fact that (A) and (C) are equivalent follows from a natural attempt to extend 
Theorem 1. When does the congruence in Theorem 1 hold modulo a higher power of 
the prime p? It certainly does if k = 0 (mod p‘*'). But it can also happen in other 
ways: for example («5 = 142506 = 6 (mod 5*). This suggests the question 


(A’) For which primes p, and which integers m # 0 (mod p) does the congruence 
(7) = m hold modulo p**+4? 
When s = 0 a Straightforward expansion of the binomial coefficient gives 


THEOREM 2. Let p be any prime > 5. If m is not a multiple of p, then (” =m 
(mod p*) holds if and only if either (a) m=1 (mod p), or (b) L?-'r-* =0 
(mod p’). 


COROLLARY. Questions (A) and (C) have the same answer. 


We now ask when the congruence in (B) holds modulo p°. The calculation used 
in § 2 to prove this congruence modulo p” already contained a strong hint that this 
is closely related to question (C). 


THEOREM 3. Let p be any prime > 3. Then L?~'r~* = 0 (mod p*) holds if and 
only if X?~*r~? = 0 (mod p’). 


Proof. 
p-l p-1 4 
2p- der *=2p- di (r*) 
1 1 
p-l 


= -2p- r (pr) (mod p*). (4) 
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When 1 <r < p — 1 the identity 
2p 1 1 


+ 
p-r? p-r ptr 


has a natural analogue modulo p’. If we use this, then (4) becomes 


—2p: y r z= y (p- ry? + y (p+ ry? (mod p?) 


p-1 1 


Erte E(p+r)} (mod p?). (5) 


Since 1 <r < p — 1 we have the following alternative expression for (p +r)~' 
(ptr) =r'-pr?+ pr? (mod p?). 
Substituting this into (5) now gives 
p-l p-1 p-l p-1 
—2p- Yr’?=2: Yrt—-p- Vrvrstp?- Yr? (mod p?). (6) 
1 1 1 1 
Finally, the formula for the sum of the first p — 1 cubes shows that 


p-l p~l —~1)\? 
y ro 3 = y i? = od = 0 (mod p*) 
1 1 2 
p~-l p-1l 
—p-: Yor 7=2- yr! (mod p?). 
1 1 


The theorem follows. 
COROLLARY 1. Questions (B) and (C) have the same answer. 


COROLLARY 2. Let p be any prime >5. If m #0,1 (mod p), then the con- 
gruences (a) ("")= m (mod p*), (b) LP~'r~' =0 (mod p’), (c) LP * = 
(mod p*), are all equivalent. 


5. Bernoulli numbers. We show that questions (C) and (D) are equivalent. It 
follows that all four of our questions (A)—(D) have the same answer. 
Jacob Bernoulli posed the problem of expressing the sum 


S,(n) =1"+ 2% +4 +--+ +(n— 1)" 


as a polynomial in n. He gave an inductive method for finding S,,(n) for any given 
m [3, § 15.1]. The constant term of this polynomial is —1 when m = 0 and zero 
when m > 1. The coefficient of m in S,(n) is defined to be B,, the mth Bernoulli 
number. Thus B, = 1, B, = —1/2, B, = 1/6, B,; = 0, and so on. 

Let B, = U,,/V,, where U,, and V,, are relatively prime integers. If p is a prime 
>5, then S,_,(p)=090 (mod p) [3, p. 235], and V,_,-S,_,(p) = U,_3°p 
(mod p”) [3, Proposition 15.2.2]. Hence p divides the numerator U,-3 of B,_, if 


1988] UNSOLVED PROBLEMS 931 


and only if S,_3,( p) = 0 (mod p*). A simple calculation then shows that (for p > 5) 


S,-3(p) = x r-* (mod p?). 


Hence questions (C) and (D) have the same answer. 


Acknowledgement. I am grateful to D. R. Heath-Brown for directing my attention to question (D). 
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NOTES 


EDITED BY DAVID J. HALLENBECK, DENNIS DETURCK, AND ANITA E. SOLOW 


Another Brief Proof of the Sylvester Theorem 


X. B. LIN 
Mathematics Department, Michigan State University, East Lansing, MI 48824 


A finite set of points, S, in a projective or affine space such that no lines 
intersects S' in exactly two points is known as a Sylvester-Gallai (SG) configuration. 
In real space it is well known that there are no nonlinear SG’s. There are several 
simple proofs of this fact (see [1]—[4]) and we offer here still another, as far as we 
know somewhat different from the others. The result in E” follows from that in E* 
by projection. 


THEOREM. If S is a finite set of points in E? such that no line intersects S in 
precisely two points, the S is a subset of a line. 


Proof. We use Hilbert’s definition of angle as the union of two noncollinear rays 
with a common end point. If A, B, and C are in § in S, then the angle defined by rays AB 
and AC is denoted BAC or CAB. If AB U AC contains a fourth point of S, the 
angle is called an admissible angle (relative to S). Suppose now that S is not linear. 
Then the angles at the vertices of the convex hull of S are certainly admissible so the 
set of such angles is not empty. Suppose BAC is the largest _su such angle with angle 
measure a < 180 and that D is a fourth point of S on ray AC. There is no loss of 
generality in assuming that C is between A and D. 

Now BCD is larger than BAC so it is not an_ admissible angle. Thus ray CB 
cannot contain a third point of S. But the line CB, by assumption, must contain 
another point E of S so E must be on the ray opposite to CB. 

But now the line AE presents us with a contradiction, since it must contain a 
third point F of S, and if F is on ray AE, then BAE is admissible and larger than 
BAC; while if F is on the ray opposite to AE, then FAC is admissible and greater 
than BAC. 

This contradiction shows that S must be linear. 
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A > B > 0 Ensures ( BA’B)'” > B?— 
Solution to a Conjecture on Operator Inequalities 


TAKAYUKI FURUTA 
Department of Mathematics, Faculty of Science, Hirosaki University, Bunkyo-cho 3, Hirosaki 
036, Aomori, Japan 


DEDICATED TO PROFESSOR ZIRO TAKEDA WITH RESPECT AND AFFECTION 


In an issue of this MONTHLY [1, p. 539], the following conjecture is stated. 


CONJECTURE. Let A and B be hermitian matrices on a finite dimensional Euclidean 
space. If A > B = 0, then 


(BAB) > B? 
and 
A? > (AB?A)””. 
In this short note, we shall prove this conjecture in a more general form. 


THEOREM 1. If A and B are positive bounded hermitian linear operators on a 
Hilbert space such that A => B = 0, then 


(BA2B)'’” > B? (1) 
and 

A? > (AB?A)”. (2) 
We prove the following Lemma needed for Theorem 1. 


LemMMA. If A and B are positive bounded hermitian linear operators on a Hilbert 
space such that A = B = 0, then 


(BY/243B1/2)'/* > B? (i) 
and 
(BY/242B1/)'” > B. (ii) 


We quote the following result to show the Lemma. 


THEOREM A ([2]). Let X and Y be bounded linear operators on a Hilbert space H. 
We suppose that X = 0 and ||Y|| < 1. If f is an operator monotone function defined on 
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[0, oof, then 
f(Y#XY) = Y*s(X)Y. 


Proof of the Lemma. If A= B20, then A+e>B+e2€ for any e > 0, so 
B+ eand A + « are both invertible and since for any bounded linear operator 7, 
(T + ©)? converges to JT? in norm as e > O for each p> 0, therefore we may 
assume that A and B are invertible. If A > B > 0, then B’”*47'Bl”* < J, hence 
|A7*7*B 41 < 1. Put Y=A7~'/*B'/*. By Theorem A, we have f(Y*XY) = 
Y*f(X)Y for an operator monotone function f defined on [0, oof and for X = 0 
since |/Y|| < 1. Take X¥ = A4 and f(t) = t’” for real ¢ > 0, then f(t) is an operator 
monotone function ({3]) and X = 0. By Theorem A we have 


(BM24-1/2444-1/2Bl2y'/? > BMV2471/244/24-1/2B1/ 
> B)/24B 4 > B'/7BB)2 _— B2 


that is, (i) of the Lemma is shown. Similarly, take X = A? and f(t) = t'”°, then we 
have (ii) in the Lemma. 


Proof of Theorem 1. Put C = (B'/74’B'/*)'”. Then C > B = 0 by (ii) in the 
Lemma, so that applying (i) in the Lemma to C and B, we have 


(B?2C3B V2)" > B 
that is, 
(BIB Y/242BV/2Bi2) 7 > B?, 
whence 
(BAB) > B? 
so that (1) is shown. (2) is an easy consequence of (1) taking inverses. This is shown 
in [1] and we reproduce it for the sake of convenience. We may assume the existence 
of B~! without loss of generality. By hypothesis, B~! > A~! > 0. (1) in Theorem 1 
ensures (4 ~!B~*A7~')!/2 > A~?. Taking the inverse gives (2). 


Remark. Further extensions of Theorem 1 will appear in my forthcoming paper 
together with some related results. 


Addendum. We give an elementary proof of Theorem 1. Since A => B = O ensures 
A“ > B*% for any a & [0.1] ({3]), for any r € [0,1/2] we have 


1/2 1/2 


> (BYAl~ B24) "Bry? > B’A!~"B’ > Bit. 

(*) 
Put r = 1/2 in (*), so C = (BY’7A’Bl’’)'? > B3/” = D > 0. Then applying (*) to 
C>D2>0 and put r= 1/3, we have (D!°C?D!7)!” > D4/?, that is, (BAB)? 
> B*. We remark that (B’A?B")'” > B!*” holds for any r > 0 by the same method 
stated above. 


(B’A?B")°” = ( B’A'~'A?"A)'B’) 
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The Evaluation of a Putnam Integral 


S. J. BERNAU 


Department of Mathematical Sciences, The University of Texas, El Paso, TX 79968 


One of the problems on the 1985 Putnam Competition [1] was the evaluation of 
fect 177e~ 1985+") at, with the value yz given for {%,e~* dx. The suggested 
solution sets a = 1985 and considers 


I(x) _ | a dt 
5 . . 


Differentiation under the integral sign, and changes of variable lead to a first order 
differential equation for /(x). The solution of this gives 


I(x) = 1(0)e7 a, 


with J(0) obtainable from the given integral as /7/a. 

The purpose of this note is to point out a simpler method which does not use the 
auxiliary variable x, nor the differentiation under the integral sign. 

Suppose a > 0, and let 


J(a) = [oer de, 
0 
We have 
1 ~1 oo ~1 
J(a) = i] to /2e7 att) gy + | tte at et) ap 
0 1 
Substitute 1/t for ¢ in the first integral to conclude 


(o,@) _ 
J(a) = i (17172 4 4-3/2) eae) dy, 


Now make the substitution u = a!/*(1'/? — t7~'/*) to obtain 


oe) 5 7 
J(a) = 2a? [ e7~24 dy = | — e774, 
0 a 


Our method is also strong enough to recapture the two parameter integral above. 
In the integral for J(x) set v = (a/x)'/*t to conclude 


I(x) 


(x/a)'/4F((ax)'”’) = (x/a) a'/*( ax) 7/7 e7 Man” 


[a ; 
~ /2 
— _ e 2(ax) 5 
a 


as in [1]. 


REFERENCE 


1. Leonard F. Klosinski, G. L. Alexanderson, and Loren C. Larson, The William Lowell Putnam 
Mathematical Competition, The Amer. Math. Monthly, 93 (1986) 620-626. 


936 STEVEN LANDY [December 


A Generalization of Ceva’s Theorem to Higher Dimensions 


STEVEN LANDY 
1702 Arley Dr.. Indianapolis, IN 46229 


Ceva’s theorem in plane geometry gives a necessary and sufficient condition for 
the concurrence of three cevians in a triangle [1, p. 4]. A cevian is defined to be a 
segment from a vertex to the side opposite it. With reference to FIGURE 1, Ceva’s 
theorem states that AX, BY, and CZ are concurrent in point P if and only if 


AZ BX CY 
—— . — . —- =], (1) 
BZ CX AY 

Cc 

X 
Y 
A 7 B 
Fic. 1 


A generalization of (1) has been found for pyramids in higher dimensional 
spaces. N points in an N — 1 dimensional Euclidean space determine an N — 1 
dimensional pyramid. Define a “cevian” as a segment which joins a vertex of that 
pyramid to a point (hereafter called a “base point’) in the N — 2 dimensional cell 
opposite it. A general Ceva theorem may then be stated as follows: 


N cevians of an N — 1 dimensional pyramid are concurrent if and only if a vertex 
mass assignment exists such that each base point is at the center of mass of its 
surrounding cell. 


Proof. In one direction the theorem is obvious. If each vertex may be assigned a 
mass such that the base points are all cell cm’s, then each of the cevians must pass 
through the cm of all N vertices. So all the cevians will be concurrent at that point. 
Therefore, a mass assignment is sufficient for a “Ceva configuration.” 


Now assume a Ceva configuration exists for an N — 1 dimensional pyramid. Let 
A and B be two of the vertices and X and Y their opposite base points (see FIGURE 
2). AX and BY intersect in P (the “Ceva point’) and so determine a plane Q. Q 
intersects the N — 3 dimensional pyramid, R, generated by the N — 2 remaining 
vertices C, D,... FE. AY and BX extended, being in Q, and being not parallel, 
intersect in a point F lying in R. We now recall that the vector to any point within 
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Fic. 2 


a k-dimensional pyramid may be expressed as a linear combination of the k + 1 
vectors to the pyramid vertices. The expansion coefficients are made unique by 
requiring that they sum to one. Since A, Y, F, and B, X, F are collinear triples, we 
have 


X=X,B+X,F (2) 
Y=Y,4+ Y,F (3) 
X,+Xp= Y¥,+ Yr= (4) 
And since F lies in R, we have 
F=F.C+F,D+---+F,E (5) 
Fat Fy +--+: +F, = 1. (6) 


Now using (2), (3), and (5) we see 


(7) 


Equation (7) says that in the resolution of base points into linear combinations of 
cell vertices, where 2 vertices enter the calculation of 2 base points, their “relative 
weights” must be equal. Choose masses M., Mp, M;, so that F is the cm of cell 
C, D, «.. E. Then choose M, so that X is the cm of cell B,C, D, ... E, and choose 
M, so that Y is the cm of cell B,C, D, ... E. Now all the masses are assigned. Let 
Z be the base point opposite C in cell A, B, D, ... E. From (7) and the choice of Y 
as its cells cm, we have 


M, 
nw wt (8) 
D 


which generalizes to 
Z4:Z3:Zp °°: Zp = My: My: My ---: Me (9) 
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P;(x3, Y3) 


Pi(%45 i) P (x3, Y2) 


FIG. 3 


Thus Z is also its cell’s cm. Hence the sought for mass assignment exists and is 
necessary. 

It is possible to determine the vertex masses for any Ceva configuration. 
Consider first the plane triangle (see FIGURE 3). Let (x, y) be the Ceva (cm) point. 
Suppose the total mass is 1. Then 


x = M,x, + Myx, + M;x; (10) 
y= My, + Myy, + M3); (11) 
1=M,+M,+ Mg. (12) 
We may Solve this system by determinants to get 
M,: M,: M; = (13) 


X X, Xz] |x, xX X3| |X, xX, x 

Y Yo Vsis[" Y Ys]. Ye Y| = APP,P,: APP|P,: APP,P, 

1 212 d471 +12 Tyyl 1 di 
This result may be generalized by stating that in any N-dimensional Ceva pyramid 
the mass assignment is such that the mass at any vertex is proportional to the 


“volume” of the opposite N-dimensional pyramid determined by the remaining N 
vertices and the Ceva point. In TABLE 1 and TABLE 2 are shown the proper mass 


TABLE 1 
medians i1: 
altitudes tan A: tan B: tanC 


angle bisectors sin A: sin B: sinC 
= a:b: c (sides) 


to incircle tangency pts tan A/2:tan B/2:tanC/2 


thru circumcenter sin2A:sin2B:sin2C 
(. bisectors meet here) = tan B+ tanC:tan A + 
tanC:tan A + tan B 
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TABLE 2 
Pyramid Cevians M,: Mg: Mc: Mp 


medians 1:1:1:1 


angle bisectors a:b: c:d (face areas) 
(meet at center of 
insphere) 


assignments for some of the classic Ceva configurations. These may be used to 
prove theorems in a straightforward manner. For example, by observing that 
collinear Ceva points are produced by linearly related mass assignments, the 
collinearity of the centroid, circumcenter, and orthocenter of any triangle on the 
Euler line is immediate. Another demonstration made easy using the cm-concur- 
rence viewpoint shows the necessary and sufficient condition for the concurrence of 
all the altitudes of an N-dimensional pyramid. That condition is that nonadjacent 
edges be orthogonal. 
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A Sequence of Coin Toss Variables for which the 
Strong Law Fails 


S. RAMAKRISHNAN 
Department of Mathematics, University of Miami, Coral Gables, FL 33124 


W. D. SUDDERTH* 
Department of Theoretical Statistics, University of Minnesota, Minneapolis, MN 55455 


Introduction. Let X = { X,, X,,...} be a sequence of {0,1}-valued random 
variables defined on a probability space (Q, A, P). Call X a coin toss sequence if 


P[X, = i, Xo = in,..., X, =i,] = 274 (1) 


for every finite sequence i,,...,7, of 0’s and 1’s. It follows easily from (1) that 
X,, X5,... are independent and identically distributed. 

Define S, = X, + --- +X, for n= 1,2,.... The weak and the strong laws of 
large numbers for the sequence X are, respectively, 


lim P[|S,/n —1/2| <e] =1 for every e > 0. (2) 
and 
P| lim S,/n=1/2| =1. (3) 
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Both (2) and (3) are classical results if (2, A, P) is a conventional countably 
additive probability space in which ¥ is a o-field of subsets of 2 and P is a 
countably additive probability measure on F¥. 

Suppose instead that (Q, ¥, P) is a finitely additive probability space in which F 
is a field of subsets of 2 and P is a finitely additive probability measure on ¥. The 
weak law remains true. Indeed the usual proof that (1) implies (2) does not rely on 
countable additivity. However, it is well known to students of finite additivity that 
the strong law need not hold. 

At least two recent papers have given examples in which the strong law fails 
(Dubins and Freedman [2], Kumar and Fine [5]). The examples presented in these 
papers are nonconstructive and rely on more axioms than those of ZF (ZF is the 
Zermelo-Fraenkel set theory without the axiom of choice). The example presented 
in the next section is completely natural and constructive. It has the property that 
the convergence set C = [lim, _,,,S,/n = 1/2] is empty so that no extension of P is 
necessary to see that (3) fails. A slight refinement gives a constructive example for 
which the set D =[S,/n converges] is empty. Thus finitely additive spaces can be 
used to model bounded, stationary sequences for which averages fail to converge. 
Real world examples of such phenomena have been reported by Kumar and Fine [5] 
who suggest nonadditive models. 


The Example. Let Q = {1,2,...} be the set of positive integers and let F be the 
collection of all finite unions of congruence classes of the form 
A(n,r) = {w € Q|w has remainder r when divided by n}, 


where n = 1,2,...; r=0,1,...,” — 1. It is easy to check that F is an algebra. 
It is also straightforward to verify that there is a unique finitely additive 
probability measure P on ¥ satisfying 


P(A(n,r)) =1/n for every set A(n,r). 

The space (2, .#%, P) is familiar to number theorists who consider certain 
extensions of P. It was also mentioned by Dubins and Savage [3] in their seminal 
book on finite additivity and gambling. 

To define the coin toss sequence X = { X,, X,,...}, write w © Q in its unique 
binary expansion 

w=a,X2%+a,x2+--- 
and set 
X,(w) = ay. 


For a sequence i,,..., 7, of 0’s and 1’s let 
Ips dy = Lh x 2° + oes +1, x Qk-1 
It can then be verified that 
{ w|X{(w) = i,,..., X,(w) =i, } = A(2*,i,...4). (5) 
(For example, [X, = 0] is the set of even numbers.) Property (1) now follows from 
(4) and (5). Thus X is a coin toss sequence. 


Now, for every w € &, there is a positive integer k, such that a, = X,(w) = 0 
for k => ky. Hence, S,(w)/n converges to 0 for every w. 
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Next we will construct a coin toss sequence Y = {Y,, Y,...} such that (Y,(w) 


+ +--+ +Y (w))/n fails to converge, for every w. To define Y, let j,, j.,... be an 
infinite sequence of 0’s and 1’s such that (j, + --- +j,)/n does not converge. Let 
Y,= X, if j, = 0, 

=1-¥X, if j,=1. 


It is easily checked that Y is a coin toss sequence. Also, for every w, Y,(w) is 
eventually equal to 7, because X,(w) is eventually equal to 0. 

By the way, an example quite similar to the above can be constructed by taking 
to be the rational numbers in [0,1], # to be all finite unions of intervals of 
rationals and P to be the unique probability on F such that P(J) =“length” of J 
for every interval J of rationals. The random variables X,, X,,... correspond to a 
binary expansion as before. 


Remarks. We have shown this example to a number of our friends who are 
conventional, countably additive probabilists. Most of them are amused by the 
example and consider it to be evidence of the perversities of finite additivity. 
However, we view it as evidence of the arbitrariness of the assumption of countable 
additivity since we know of no simpler or more elementary model of a coin toss 
sequence. It seems to us a mistake to ban such an example from consideration. (A 
recent editor of the Annals of Probability ruled that “finitely additive probability is 
not probability.’’) 

There is a natural class of finitely additive probability measures which includes 
the countably additive measures and for which the classical strong laws of probabil- 
ity theory do hold. (See Purves and Sudderth [6], Chen [1], Ramakrishnan [7], and 
Karandikar [4].) 
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A Simple Test for the nth term of a Series to Approach Zero 


JONATHAN LEWIN 
Department of Mathematics, Kennesaw College, Marietta, GA 30061 


MYRTLE LEWIN 
Department of Mathematics, Agnes Scott College, Decatur, GA 30030 


Using Stirling’s formula, one may see at once that if a, = (2n)!/4"(n!)’, then a, 
is of the order of 1/yn, and one may conclude from the alternating series test that 
the series ).(—1)"a,, is conditionally convergent. At an elementary level, however, 
the convergence of the latter series may be a little more difficult to obtain. Since 
An41/4, = (2n + 1)/(2n + 2) < 1 for each n, it is clear that the sequence (a,,) is 
decreasing, but it is not immediately obvious within the environment of a typical 
calculus course that a, > 0 as n-— oo. For this purpose, one might use the 
following simple result which takes a leaf out of the theory of infinite products: 


THEOREM. Suppose (a,,) is a decreasing sequence of positive numbers and for each 
natural number n, define b, = 1 — a,,,.,/a,. Then the sequence (a,,) converges to zero 
if and only if the series Ub, diverges. 


Proof. We note first that unless b, — 0 as n — oo, both of the series Lb, and 
»~ log(1 — b,,) diverge. On the other hand, if b, — 0, then b,/(—log(1 — 5,)) > 1 as 
n — oo, and it follows from the limit comparison test that Lb, diverges if and only 
if © log(1 — b,) diverges. We note also that since 0 < b, < 1, we have log(1 — b,) < 
O for each n. 

Now since 1— b,=a,.,,/a, for every n, it is clear that a, = a,(1 — },) 
(1 — b,)1 — b3)---(Q— 6,_,) for each n > 2, and we therefore conclude that 
a, > 0 iff loga, ~ —oo iff loga, + X?_, log — b,) ~ —oo iff Llog(1 — b,) 
diverges iff Ub, diverges. 


Returning now to the above example, we see that b, = 1/(2n + 2) for each n, 
and the obvious divergence of Ub, implies that a, — 0. The same technique gives 
an easy proof of the convergence of such series as L((—1)"n"/e"n!), and the series 
»(*) of binomial coefficients with a > —1. 


Universal Topological Spaces 


K. D. MAGILL, JR. 
Department of Mathematics, SUNY, Buffalo, NY 14214 


Let U = {a, b,c} and let FJ, = {U, >, {a}}. It has been known for a long time 
that U with the topology .7, is a universal topological space in the sense that any 
topological space whatsoever is homeomorphic to a subspace of some topological 


942 


THE TEACHING OF MATHEMATICS 943 


power U“% of U. Indeed, this is an easy consequence of the well-known Embedding 
Lemma of [1, p. 116] (which appears in [4] as Cor. 8.15). The main problem we 
consider here is that of finding all universal topological spaces. We present two 
characterizations whose proofs involve only elementary notions and techniques. 
Thus this problem is appropriate for the students in a beginning topology course 
after they have been presented with the Embedding Lemma. We now State a version 
of the Lemma which will be convenient for our purposes here. 


EMBEDDING LEMMA. Let X be any topological space whatsoever and let {Y,\ven 
be any collection of topological spaces. For each a © A, let f, be a continuous function 
from X to Y, and suppose the collection of functions { f,}.<, Separates points and also 
separates points and closed sets. Then the evaluation map e from X into Il{Y,}aen 
defined by e(x), =f,(x) for each a € A is an embedding. 


Traditional applications of the Embedding Lemma include characterizing com- 
pletely regular Hausdorff spaces as subspaces of compact Hausdorff spaces and, 
indeed, producing the Stone-Cech compactification of a completely regular Haus- 
dorff space as well as deducing the metrization theorems. As we mentioned 
previously, another very simple application of the Embedding Lemma is to prove 
that U is a universal space. One need only verify that for any space X, the family 
C(X, U) of all continuous maps from X into U separates points and also separates 
points and closed sets. For distinct points x, y € X, define f(x) = b and f(z) =c 
for z # x. For a closed subset F C X and w € F, define g(z) = b for z © F and 
g(z) =a for z € F. Both f and g are continuous and g(w) € clg[F]. Thus, maps 
of the type f separate points while those of the type g separate points and closed 
Sets. 

It is not too much to ask students in a beginning topology course to produce such 
maps. Finding a universal space is more difficult but my experience has been that 
the better students can also do that although they often produce considerably more 
complicated spaces than is necessary. Some students may eventually need hints. 
Nevertheless, the exercise is an informative one in that it underscores, at an early 
Stage, just how complicated things can get by taking products and subspaces. 

To my knowledge, the statement that U is a universal space first appeared in 
print in [3] but it doesn’t seem to be as well known as one as one might expect for 
such a surprising result. Incidentally, a nice historical discussion of various embed- 
ding results may be found in [3]. 


THEOREM 1. X is a universal space if and only if it is not T) and it contains two 
points x and y such that some open set contains x but not y while every open subset that 
contains y also contains x. 


Proof. Suppose first that X is a universal space. Then X cannot be 7) since 
products of J) spaces are 7) and subspaces of 7, spaces are also 7). Now let 
W = {a,b} and let W have the subspace topology induced by U. Then some 
topological power X* of X must contain a copy Y of W. That is, Y = { p, gq} where 
the topology induced on Y is {@, Y, { p}}. Thus there exists an open subset G of X* 
containing p but not g and, furthermore, every open subset of X% that contains gq 
also contains p. Then 


P © ta, |Ga,| M Ta, | Ga, | Moe Ota" | Gay | S G 
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and 


[Ga] 0 ta," |G, | 9 --- Am [G,, ], 


where for each a,, 7, 1s the corresponding projection map and G, is an open subset 
of X, = X. Evidently, for some a,, 9, & G, (where q, is the a; th coordinate of q) 
while’ Pa, © G,,. Moreover, every open subset of X that contains J, Must also 
contain p, since every open subset of X“% containing g must also contain p. 

For the converse, suppose that X is not 7, and that it has two points x and y as 
described in the statement of the theorem. Since X is not 7), it also contains two 
points a and b such that a # b but both a and b are contained in precisely the 
same open subsets of X. Next, let Z = {(a, y),(b, y),(a, x)} and let Z have the 
subspace topology induced by X X X. Let G be an open subset of X containing x 
but not y. Then ZM (X X G) = {(a, x)} is open in Z. Now suppose V is any 
open subset of X X X and (a, y) € ZO V. Then 


(a,y)EZN(G,XG,)CZAYD, 


where G, and G, are open subsets of X. Since a and b belong to precisely the same 
open subsets of X, (b, y) = ZMV. Moreover, y © G, implies x € G, so that 
(a,x)& ZO V. Similarly, (b, y) © ZO V implies that both (a, y) and (a, x) 
belong to Z M V as well. Evidently the topology induced by Z is { 2, Z, {(a, x)}}. 
That is, X X X contains a copy of U which is a universal space as we verified 
previously. It is now immediate that X itself must be a universal space. 


The next two corollaries follow easily from Theorem 1 and we omit their proofs. 
COROLLARY 2. If X is a universal space then X must have at least three points. 


COROLLARY 3. Up to homeomorphism there are exactly two universal topological 
spaces with exactly three points and each has precisely one nonempty, proper, open 
subset. Consequently, in the one case it is a singleton and in the other, a pair of points. 


We proved the second half of Theorem 1 by showing that if the space X satisfies 
certain conditions then X X X contains a copy of (U, .7,) where 7, = {2,U,{a}}. 
We have also observed in Corollary 3 that (U, .7,) is a universal space where 
J, = {6,U, {a, b}}. One might be tempted to try to show that if X satisfies those 
conditions then X itself must contain a copy of either (U, .7,) or (U, Y,) but as the 
following example shows, this is not possible. 


Example. Let V = {a, b,c, da} and 


F, = {2,V,{a}, {a,b}, {c,d}, {a,c, d}). 

It follows immediately from Theorem 1 that (V, .7,) is a universal space and one 
can use that theorem to check that no proper subspace of (V, .7;) is a universal 
space. 


In a sense, then, the three spaces (U, 7,), (U, 7), and (V, 7) are all minimal 
universal spaces. Of course, any space that contains a copy of any one of these 
spaces is also a universal space. Our next theorem shows that the converse is true as 
well and this results in another quite different characterization of universal spaces. 

For a subspace Y of a topological space X, the symbol 7; will denote the 
topology induced on Y. 
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THEOREM 4. A topological space X is a universal space if and only if it contains a 
copy of one of the spaces (U, 7,), (U, F,), or (V, F;). 


Proof. We have already discussed the fact that the condition is sufficient so we 
prove only necessity. By Theorem 1, X contains two distinct points a and b that 
belong to precisely the same open sets and also two points c and d such that some 
open set contains c but not d while every open subset containing d also contains c. 


Case 1. 
c=a or c=b. 


We may assume that c = a. Then d # a, b. Let G be the open set containing a but 
not d and let Y = {a, b,d}. Then Y N G = {a, b} is open in Y and moreover is 
the only proper open subset since every open set containing d also contains a and 
hence b as well. Thus 7, = {Y, @,{a,b}} and (Y, 7) is homeomorphic to 
(U, 7). 


Case 2. 
d=a or d=b. 
We may assume d= a. Then c # a, b. Let Y = {a, b,c} and let G be an open 
subset of X that contains c but not d= a. Then b € G and GN Y = {c}. Every 
open subset of X that contains either a or b must contain the other as well and 


hence also c since d = a. In this case WY) = {Y, @, {c}} and (Y, Zy) is homeomor- 
phic to (U, 7,). 


Case 3. 
c#a,b and d#a,b. 


In this case a, b,c and d are all distinct and we let Y = {a, b,c, d} with induced 
topology .7,. It follows that a and b are contained in precisely the same open 
subsets of Y and there exists an open subset of Y containing c but not d while 
every open subset containing d contains c as well. Now let E be the smallest open 
subset of Y containing a and 5, let G be the smallest open subset of Y containing c, 
and let H be the smallest open subset of Y containing d. There are three 
possibilities for EF, two for G, and two for H. They are as follows: 


(Z£1) E=Y (Gl) G= {a,b,c} (Hl) H=Y 

(E2) E = {a,b,c} (G2) G= {c} (H2) H = {c,d} 

(£3) E = {a,b} 
Now (£1) together with (G1) is contradictory as is (G1) and (H2) together. In the 
latter instance, for example, if {a,b,c},{c,d} G7, then {c}@©7, which 
contradicts the fact that {a, b,c} is the smallest open set containing c. We now list 
all the other possibilities and simply observe that for each of these, (Y, Z,) and 
hence the space X contains a subspace homeomorphic to either (U, 7), (U, F,), or 


(V, 7,). We express the fact that (Y, 7,) is homeomorphic to (Z, 7;) by simply 
writing (Y, 7;) = (Z, J). 


(El) &(G2)&(H1) Let Z=Y- {d}. Then (Z, Ty) = (U, 7F;). 
(E1) & (G2) &(H2) Let Z=Y-{c). Then (Z, Z;) = (U, J). 
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(E2) & (Gl) &(H1) Let Z=Y-{a). Then (Z, 7) = (U, J). 
(E2) &(G2)&(H1) Let Z=Y-{d}. Then (Z, Zz) = (U, J;). 
(E2) & (G2) &(H2) Let Z=Y-{d}. Then (Z, Z;) = (U, J;). 
(E3) & (G1) &(H1) Let Z=Y-{d}. Then (Z, Zz) = (U, J). 
(E3) & (G2) &(H1) Let Z=Y —- {c}. Then (Z, Zz) = (U, J). 


(E3) & (G2) &(H2) Then (Y, Z,) = (V, J). 


This concludes the proof of the theorem. 


Some Further Exercises 

One can have students investigate related problems by defining a space X to be 
C-universal (where C is a prescribed class of topological spaces) if X € C and each 
Y € Cis homeomorphic to a subspace of some topological power X* of X. Let Tz 
denote the class of all 7) spaces, Ty the class of all Tychonoff spaces, and Zd the 
class of all 0-dimensional Hausdorff spaces. A 0-dimensional space here is one that 
has a basis of sets that are both open and closed. Exercises 1, 2 and 3 are 
straightforward applications of the Embedding Lemma and the techniques used in 
this paper. 


Exercise 1. Prove that the following statements about a topological space X are 
equivalent. 
(1.1) X is Tz-universal. 
(1.2) X is J) and not 7}. 
(1.3) X is 7) and contains a copy of the space (Y, 7) where Y = {a,b} and 
T= {Y,8,{a}}. 


Exercise 2. Prove that a topological space is Ty-universal if and only if it is a 
Tychonoff space and contains an arc. 


Exercise 3. Prove that a topological space is Zd-universal if and only if it is a 
0-dimensional Hausdorff space and contains more than one point. 


Of course, Exercises 2 or 3 may be inappropriate at this particular point, 
depending upon what topics have been covered previously. One might then assign 
the following problem. 


Exercise 4. Is there a result about 7, spaces similar in nature to the result of 
Exercise 1? 


If, after a time, students show no success in dealing with Exercise 4 it can be 
suggested that they check [2]. 
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A New Algorithm for Computing the Rank of a Matrix 


LARRY J. GERSTEIN 
Department of Mathematics, University of California, Santa Barbara, CA 93106 


To determine the rank of a matrix A and a basis for its row space, students are 
usually instructed to reduce A to echelon form by using elementary row operations. 
Then the number of nonzero rows of the echelon matrix is the rank of A, and the 
rows of the echelon matrix constitute a basis for the row space of A. 

While this is quite correct, in practice the computations can become unwieldy. 
For example, if A is an integer matrix there are two standard ways to proceed. In 
the first approach we start by using elementary row operations (based on the 
division algorithm) to eliminate all but one entry in the first column of A. In the 
second approach we first divide the top row by the left-hand entry a,, (assuming 
a,, # 0), then subtract suitable multiples of the new top row from all the others; in 
either case the computation reduces matters to considering an (m — 1) X (n — 1) 
matrix, whereupon the process is repeated. 

The first approach has the advantage that all computations are performed with 
integers, but a great many row operations are likely to be needed, and there may be 
some uncertainty as to exactly which row operations should be used in order to 
minimize computation. In the second approach denominators will immediately 
appear on the scene (unless A has been specially designed to make computations 
easy, say by having lots of rows with leading entry 1 involved in the calculations), 
and the potential for error then increases dramatically. Indeed, many texts make the 
appearance of nontrivial denominators especially likely by including the require- 
ment that each nonzero row of an “echelon” matrix have 1 as its first nonzero entry. 
That requirement is irrelevant for rank and row space determinations. (Of course, a 
leading 1 is important if the reduction is being done in order to solve a system of 
linear equations.) 

Our purpose here is to present a rank algorithm in which division is never 
needed. Like the usual procedures, this one yields an echelon matrix (not necessarily 
with a leading 1 in its nonzero rows) that is row-equivalent to the original, and 
whose nonzero rows constitute a basis for the row space of A. But here there will be 
no uncertainty about how to proceed, and’denominators will not arise in the 
computations. 

First we fix some notation. Let F be a field, and let M,,.,( F) denote the set of 
m Xn matrices over F. For any matrix A = (a,;;) © M,,.,(F) and any pair of 
indices i, j satisfying 2 <i<m and 2 <j <n, we define the 2 X 2 subdetermi- 
nant 


= 4114; ; ~ GQ) ;- 
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The following result provides the key to our procedure. 


THEOREM. Let A = (a;;) © M,,y,(F), and suppose a,, # 0. Then 


Ay, —_ Aon 
rank A =1+rank| - 
Am? Te Amn 
Proof. Write ~ for row equivalence of matrices. We first multiply rows 2 
through m by a,,, obtaining 
Qj @12 _ Gn 

41399) 4402 1 Fan 
Q4m = %114m2 0 11 Amn 


Now we take this new matrix and, for each row index i satisfying 2 < i < m, we 
subtract a,, times row 1 from row i. This yields the row equivalence 


44, 42°" Ay 
A~ ° dy a Ayn 
0 Am? Amn 


Because a,, # 0, the first row of this last matrix is not a linear combination of the 
other rows. The conclusion follows. O 


An algorithm for rank computation. First note that if a given nonzero matrix 
A = (a,,) has a leading column of Zeros, then the rank is unchanged when that 
column is deleted; thus we may assume the first column to be nonzero. Moreover, 
by using an elementary row operation (if necessary) we can assume without loss of 
generality that a,, # 0. Making use of these observations as needed, we now apply 
the theorem recursively until we obtain a matrix with only one row or column, and 
then the solution is evident. (In practice, the rank of a matrix with only two rows or 
two columns is usually evident.) 


Examples. 
3 -8 7 18 8 
(i) rank}5 ~—4 9]/=1+ rank| 35 5) = 2 + rank(312) = 3. 
2 3 6 
oe S -10 30 28 
(ii) rank 35 10 «5 | ht tank} 11-33 2 
22 -4 2 a 76 A 


Remarks. (a) The given matrix A is row-equivalent to the echelon matrix con- 
structed by stacking up the top rows of the matrices appearing in the rank 
computations, with the understanding that leading zeros need to be prefixed to the 
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rows that have fewer than n entries. For example, the matrix A in Example (1) is 
row-equivalent to the echelon matrix 


3-8 7 
0 28 —8 D) 
0 O 312 


and the rows of this matrix constitute a basis of the row space of A. (Of course in 
this example A is nonsingular, so its own rows are also a basis for the row space.) 
Similarly, in Example (ii) the nonzero rows of the echelon matrix 


4 3. —5 6 
O -10 30 —28 
0 0 O 288 
0 0 0 0 


constitute a basis for the row space of A. In building up the echelon matrices in this 
way, one must be careful to reinsert any columns of zeros that were deleted during 
the rank computations. 

(b) Let A be an n Xn matrix. Each reduction from a k X k matrix to a 
(k — 1) X (k — 1) matrix requires 2(k — 1)? multiplications. Adding these numbers 
for k = 2,3,...,n, gives a total of (2n? — 3n* + n)/3 = 2n°/3 multiplications, 
which has the same order of magnitude as the number required to compute the 
determinant of A (see [1, pp. 479—480)]). 
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ELEMENTARY PROBLEMS 


FE 3295. Proposed by A. W. Goodman, University of South Florida, Tampa, FL. 


A function f on the open unit disc E = {z: |z| < 1} is said to be univalent if it is 
analytic on EF and takes no value more than once. A function f on E is said to be 
infinite-valent if it is analytic on EF and takes some value infinitely often. 

(a) Is there a univalent function f on E such that f’ is nonconstant and 
infinite-valent on FE? 

(b) Is there an infinite-valent function g on E such that g’ is univalent on E? 


E 3296. Proposed by Robert M. Young, Oberlin College, Oberlin, OH. 


Given four arbitrary points in the plane, describe a procedure for constructing 
the thinnest annulus containing all four points. (An infinite strip is regarded as an 
extreme case of an annulus.) The thickness of an annulus is the difference between 
the two radii (or the width of the infinite strip). 


E 3297. Proposed by Gunnar Blom, University of Lund and Lund Institute of 
Technology, Sweden. 


If m and k are positive integers and m + 2k = pr, where p > k is an odd prime 
and r is a positive integer, prove that 


k-1 . 

(m+ 

gmtl yy? (-1)/(" ’ = 2’"!(mod p). 
j=0 


953 
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E 3298. Proposed by Marvin Marcus and Claire Pesce, University of California, 
Santa Barbara, CA. 

If A and B are square matrices, we define the Kronecker product A ® B as the 
matrix 


a,,B ayB a,,B 
Se we ee oe ee ee eee eee , 
a,B a,2B a,,B 
where 
Qi, 42 Qiy 
Ax] cece ccc eee eceuen 
ani an2 ann 


Let A,, A,,..., A,, be square matrices over a field and let K = A, ® A, ® -:: @A,. 
Prove that K is lower triangular if and only if the following two conditions are 
satisfied: (a) A, is lower triangular; (b) if there exists p < m such that A,,, is not 
lower triangular but all of A,, A,,..., A, are lower triangular, then at least one of 
the matrices A,, A3,..., A p has zero main diagonal. 


E 3299. Proposed by Jihad Yamout, Ohio University, Athens, OH. 


_Suppose ABCD is a plane quadrilateral with no two sides parallel. Put E = ABO 
CD and F = AD aay Bc. If. M, N, P are the midpoints of AC, BD, EF, respectively, 
and AE = aAB, AF= bAD, where a and b are nonzero real numbers, prove that 
MP= abMN. 


E 3300. Proposed by H. Das Gupta, Kanchrapara, West Bengal, India. 


Let N be the set of natural numbers. Prove that there is exactly one mapping f 
from N X N to N having the following three properties for x, y € N: 
(i) f(x, y) = fy, x), 
(il) f(x, x) = x, 
(ili) (y — x) f(x, y) = f(x, y — x)if y > x. 


SOLUTIONS OF ELEMENTARY PROBLEMS 
Palindromes with Two Letters 


E 3156 [1986, 482]. Proposed by Raphael M. Robinson, University of California, 
Berkeley. 


Suppose that r,s, t are integers with r > 0, s > 0, t =r+-s > 2. Is there a word 
W of length ¢ in the alphabet {a, b} such that W = AB = Cab, where A, B,C are 
palindromés, and the lengths of A and B are rand s? Show that such a word W 
exists if and only if r + 2 is prime to s — 2, and that in this case it is unique. 


Solution by Allan Pedersen, Soborg, Denmark. Given a word VJ, let V denote the 
reverse of V, and let |V| denote the length of V. We need several statements about 
the solutions to word equations. 


LEMMA 1. Jf |A| =|B| + 1, the equation A = Ba has a unique solution in which A 
and B are palindromes, namely A = a'4| and B = a!®|, Also, if |A| =|B| + 2, the 
equation A = Bab has a solution in which A and B are palindromes if and only if |B| is 
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odd, in which case the solution is unique and is A =(ba)"4I-)/b and B= 
(ba) {3I-D/2p, 


Proof. We prove only the second claim, since the proof of the first is similar. The 
proof is by induction, with the claims holding by inspection for |B| < 1. Suppose 
|B] > 2. Since A and B are palindromes, we have Bab = A = baB. Hence A = 
baB’ab, where B’ = B’ and B = B’ab. Let A’ = B. Then we must solve A’ = B’ab, 
and the desired result follows from the induction hypothesis. 


LEMMA 2. The equation C = AabB has a solution in which A, B, C are palindromes 
and |A| = r, |B| = s if and only if gcd(r + 2, 5 + 2) = 1, in which case the solution is 
unique. 


Proof. Since C is a palindrome, we may assume r > s. If r = s there can be no 
solution, since ab # ba. If r=s +1, then A =A = DB; by Lemma 1, there is a 
unique solution. If s = 0, Lemma 1 shows there is a solution if and only if r is odd, 
in which case it is unique. For the remaining cases we have r—s > 2 and s > 0, 
and we proceed by induction on r + s, reducing to the cases above. 

Since C is a palindrome, reversing C and deleting |C| — |A| = s + 2 letters from 
the end yields A = BbaA’, with |A’|}=r—s—2>0. By substitution, C = 
BbaA’abB, so A’ = A’. Thus C = AabB has a solution if and only if A = BbaA’ has 
a solution in palindromes. In this equation we have |A|] =r, |B| =s and |A’| = 
r—s-—2. If s<r—s — 2 wecan reverse the equation. By induction, the equation 
has a solution if and only if gcd(s + 2, r —.s) = 1, which is true if and only if 
gcd(s + 2,r + 2) = 1. Furthermore, if the shorter equation has a solution, it is 
unique. Since it yields a solution to the longer equation in a unique way, solutions to 
the longer equation are also unique. Thus Lemma 2 is proved. 

Now we are ready to prove the claim of the problem. We proceed by induction 
on s. If s=0 or s = 1, then the claim (including the primality condition) follows 
from Lemma 1. If s = 2, then B = ab, which is not a palindrome. Hence there is no 
solution, corresponding to the fact that gcd(r + 2,0) =r + 2. If s = 3, then B = bab 
since B is a palindrome. Now Ab = C, and by Lemma 1 there is always a unique 
solution, corresponding to the fact that gcd(r + 2,1) = 1. Finally, suppose s > 4. 
Now B = baB’ab where B’ = B’. Deleting two letters from the end of the equated 
words, we have AbaB’ = C. By Lemma 2, we have a solution if and only if 
gcd(r + 2,5 — 44+ 2) = gcd(r + 2, s — 2) = 1, in which case the solution is unique. 


Editorial Comment. Most solutions were inductive. J. M. Cohen provided a 
solution using group theory, and R. B. Eggleton and P. J. Zwier provided solutions 
using graph theory. 


Also solved by I. C. Bivens, J. M. Cohen, R. B. Eggleton (Australia), J. Fukuta (Japan), O. P. Lossers 
(Fhe Netherlands), P. J. Zwier, and the proposer. 


Some Jordan Curves and Convex Curves 


E 3158 [1986, 482]. Proposed by I. J. Schoenberg, Madison, WI. 


Consider the image C, of the circle z = e“(0 < t <2) under the function 
f(z) = z+ az", where a is a real constant. Show that: 

(a) If n > 2, then C, is a closed Jordan curve if and only if |a| < 1/n. 

(b) If n > 2, then C, is a closed convex curve if and only if |a| < 1/n’. 
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Editorial Remark. The condition n > 2 was inadvertently omitted from (a) in the 
published statement of the problem. 


Solution by Allan Pedersen, Soborg, Denmark. (a) If C, has multiple points, 1.e., 
if there exists a pair of distinct numbers z and w of modulus 1 such that 
z+az"=w+ aw”, then 


—1l=aw" {1+ z/wt (z/w)? ++: +(z/w)"""}, 
IZj=|wJ=1, z#w (*) 
and, hence, 1 < |a|n. Suppose, conversely, that |a| > 1/n. Since |sin n@/sin @| 


ranges between 0 and n for real 0, we can find a number e = e7", 0 < @ < a, such 
that 


lteter? +--+ te” "|= |(1 — e”)/(1 — €)| = 1/al, le] = 1, e# 1. 
Next, determine w so that 
w'(1+ete7+--- +e" !) = -lL/a 


and choose z = ew. Then (*) will be fulfilled and hence z + az” =w+4+ aw”. 
(b) By part (a) we may assume that |a| < 1/n. The curve C, may be parametrized 
by 
x = cost + acosnt, y=smt+asinat. 


Since 
(x’)? + (y’)? =1 4 a?n? + 2ancos(n — 1)t > (1 — |a|n)’, 
we first consider the case |a| < 1/n. Now the curvature of C, at the point f(e’’) is 
given by 
xy" — y'x” 1 + a’*n? + an(n + 1)cos(n — 1)t 


\2 \2) 372 n\2 \2) 372 
{(x’)" + (v’)"} {(x’)" + (v’)"} 
The necessary and sufficient condition for C, to be convex is that the curvature 
K(t) does not change sign. But this is equivalent to the nonnegativity of 


min {1+ a*n? + an(n + 1)cos(n — 1)r} 


O0<t<2a 
= 1+ an? —|aln(n + 1) = (1 — nfal)(1 — n*Ia]), 
i.e., to the inequality |a| < 1/n*. In the remaining case a = +1/n, the curve C, is 
the (nm — 1)-cusped epicycloid 
| 1 . ° 1 . 
x = cost + —cosnt, y=sint+ —sin at, 
n n 


with cusps at the m — 1 points where cos(n — 1)t = +1; accordingly, convexity 
does not hold in this case. 


Editorial Comment. The proposer remarked that the condition that a be real is 
unnecessary, since, if, say, a/|a| = e"~)"*, then 


elf (z) = (ze") + Jal(ze"*) 
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Several readers noted that when |a| < 1/n, the curve C,, is actually starlike, 1.e., 
arg f(e'') increases with ¢. 

A necessary and sufficient condition for the convexity of the image of the unit 
circle z = e“(0 < t < 27) under the function f is that 1 + {Re zf’(z)/f(z)} > 0 
for all z with |z| = 1. [Cf. Problem 108 of Part III of Pélya-Szegé]. This inequality 
is equivalent to the nonnegativity of the curvature as used above. 

O. P. Lossers remarked that (a) occurs as Problem 239 in L. I. Volkovyskii, G. L. 
Lunts, and I. G. Aramanovich, A Collection of Problems in Complex Analysis, 
Pergamon Press, 1965. In that book it is mentioned that C,, is the trajectory of a 
point on a spoke of a circular wheel of radius 1/n which moves on the outside of a 
fixed circle of radius (n — 1)/n, where the point on the spoke is at distance |a| from 
the center of the wheel. This remark enables one to give a geometric argument for 
the result of (a). 


Solved also by I. C. Bivens, L. R. King, and B. G. Klein (jointly), M. R. Gopal, H. O. Kim (Korea), 
P, L. Hon and T. S. Kin (jointly, Hong Kong), W. A. Newcomb, E. S. Rosenthal, V. Schindler, and the 
proposer. 


Concurrent Conics 


E 3172 [1986, 733]. Proposed by Jordi Dou, Barcelona, Spain. 


Let A’, B’, C’ be the feet of the altitudes from the vertices A, B, C respectively 
of a triangle ABC. Let H be its orthocenter and M be an arbitrary point of the 
plane. Prove that the conics MABA’B’, MBCB’C’, MCAC'’A’, MHAC’'B’, 
MHAB'C’ and MHBC’A’ have a point other than M in common. 


Solution by O. P. Lossers, Eindhoven University of Technology, Eindhoven, The 
Netherlands. There are six lines in the configuration, viz. the three edges and the 
three altitudes of the triangle, which are pairwise intersecting in one of the given 
seven points A, B,C, A’, B’, C’, H. Introduce homogeneous, projective coordinates 
(x), X5, x3) such that A’ = (1,0,0), B’ = (0,1,0), C’ = (0,0,1) and A’ = (1,1, 1). 
Then A = (—1,1,1), B=, -1,1), C=(1,1, —1), and the six lines have the 
equations x,+x,=0, for 1 <i<j <3. Let M=(p,,M2,¢3) be a point not 
collinear with any pair of these seven points, ie. wj(m; + uM; — By) #0 for 
distinct i, j, k. 

Instead of investigating the six conics, we consider an arbitrary cubic through the 
seven points and M. Its general form is 


F(x, X2, x3) = Ayx,(x2 — x3) +°Apx2(x? — x?) + A3x3(x? — x2) =0, 


where the real numbers A,, A,, A; must be chosen so that F(p,, Ho, M3) = 0. 
Because F(my*, 7", 31) = —(Hy Mas M3)” FCM, M2, M3), we see that also N = 
(uz *,w>', wz") is on this cubic. In particular, this is true if the cubic degenerates 
into one of the six lines with its corresponding conic, so M and N are both common 
points of the six conics. In the original situation, M and WN are isogonally conjugate 
with respect to the triangle A’B’C’. 


Also solved by H. Demir and C. Tezer (Turkey), E. Morgantini (Italy), and the proposer. 
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Zero-free Multiples 


E 3196 [1987, 300]. Proposed by I. Martin Isaacs, University of Wisconsin, 
Madison, and Dave Witte, University of California, Berkeley. 


Prove that every natural number n not divisible by 10 has a multiple whose 
decimal representation involves no 0’s. (For example, 6144 is a multiple of 1024.) 


Solution by the University of South Alabama Problem Group. If (n,10) = 1, then 
10%) — 1 suffices, since 10°” = 1 mod n. Hence we may assume n is a multiple of 
2 or 5 (but not both). 

If m > 1, then there is an m-digit number x = x,,_,...Xq such that 2”|x and 
each x, is 1 or 2. To construct x inductively let x) = 2, and if x,,_,...X%9 = ¢2”, 
then let x,, be 1 if c is odd and 2 if c is even. This works because x,,X,,-1--- Xo = 
(c+ x,57)2™. A similar argument constructs an m-digit integer y=y,_1..- M% 
such that 5”|y and each y, € {1, 2,3, 4,5}. 

Now write n = p”a, where p-is 2 or 5 and (a,10) = 1. A standard pigeonhole 
argument using partial sums of 10°,10”,10?",... shows that a has a multiple of 
the form 10°" + 1067)" 4... +1061", which in turn yields R = 1+ 10” 
+--+. +10’" as a multiple of a. Lett M=R-(x,_|...X9) or M=R- 
(Yn-1 °°* Yo), depending on whether p is 2 or 5. Now M has digits in {1, 2, 3, 4,5} 
and is a multiple of both p” and a, so M is a multiple of zn. 


Editorial comment. Eugene Levine pointed out that the digits of the zero-free 
multiple can be restricted to any set of five digits that form a complete residue 
system mod 5 and include at least one even digit. 


Also solved by B. Gordon and J. M. Rojas (student), E. Levine, O. P. Lossers (The Netherlands), and 
the proposers. Partially solved by M. W. Ecker and R. A. Simon (Chile). 


Birth of a Regular Hexagon 


E 3198 [1987, 301]. Proposed by I. J. Schoenberg, Madison, Wisconsin. 


Let IT. = (X09, X1, X2, X3, X4, X5) = (x,), 0 < k < 5(mod6) be an arbitrary 
closed skew hexagon in R°. Form a new hexagon II y = (y,) by 


1 


Ve = Xt 5% — X43) = 7 Xk — 7 Xk-3° 


Finally, define a third hexagon II, = (w,) by 


Ww, = 3% + Xp_y + X p44). 


Show that the hexagon II,, lies in a plane @ and is in 7 an affine image of a regular 
hexagon. 


Solution by Jan van de Craats, Koninklijke Militaire Academie, Breda, The 
Netherlands. Since w, = ¢(2X,41 + 3x, + 2x,_; — xX,_3) for all k, we have 
Wra3 — Wy = 2(Wy42 — Wea 1) for all &k. Any hexagon satisfying this property is 
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planar, since 

(a) if three consecutive vertices coincide, then all vertices coincide, 

(b) if three consecutive vertices are collinear, then all vertices are collinear, and 

(c) if no three consecutive vertices are collinear, then all vertices lie in the plane 
determined by the first three. 

Finally, let IL, = (ao,..., a5) be an arbitrary regular hexagon. Then a(a,) = w, 
for i = 0,1,2 determines an affine map. Applying a, 3 — a, = 2(@,4. — 4,4 ) for 
k = 0,1,2 yields a(a;) = w, for i = 3,4,5, which completes the proof. 

Note that the proof generalizes to hexagons in affine spaces of dimension > 2 
over an arbitrary commutative field K with char(K) # 2,3. Note also that the 
degenerate cases may occur. If II, is the non-planar ((3, 0,0), (— 2, —2, 1), (0, 3, 0), 
(1, —2, —2), (0,0, 3), (—2,1, —2)), then each vertex of II, is (0,0,0); if Il, = 
((0, 0,0), (1,0, 0), (0,1, 0), (0,0, 0), (1,0,0), (0,1,1)), then II, = (2, 2,2), (2, 2,0), 
(2,2,— 1), (2,2,0), (2,2, 2), (2,2,3)). Further material on affinely regular polygons 
appears in J. van de Craats and J. Simonis, Affinely regular polygons, Nieuw Archief 
voor Wiskunde IV, 4(1986) 225-240. 


Editorial comment. There are several approaches to this problem. O. P. Lossers 
computed v, = Ww, — (2x;)/6 = (X,21 + 2X, + Xpa1 T Xe42 — 2X e4+3 — Xe4+4)/6, 
from which v, = —0v,,, and v, + 0,45 + 0,44 = 0 follow immediately and imply 
that II, is the plane affine image of a regular hexagon. The proposer began by 
expressing the x, as the values of a fifth-degree polynomial at the sixth roots of 
unity. 

Also solved by P. L. Hon (Hong Kong), O. P. Lossers (The Netherlands), W. A. Newcomb, V. C. and 
D. J. Williams, and the proposer. 


A Superlogarithmic Density 


E 3200 [1987, 301]. Proposed by Paul Erdés, Hungarian Academy of Science, 
Budapest. 


Suppose a,, a,,... are real numbers such that 0 < a, <a, < ---,a,— ©, 
and(a, + a, + --: +a,)/a, > 0. IfN(x) = 2X, <,1, prove that N(x)/log x — 00 
aS X > 00. 

Solution I by Bernt Lindstrém, University of Stockholm, Sweden. Put S, = a, + a, 
+--+ +a,. Since log x < x — 1 for x > 1, we have 

log S, — log S,_1 < (S, — S,-1)/S,-1 <@ 
when n > N = N(e), and log S, — log Sy < e(n — N) when n > N. Hence S, < ce*” 
fot n> N, where c = c(e). Given x > ce®%, determine n_ such 
that ce" <x < ce™"*), Then n> (1/e)log(x/c) —1 and N(x) > 2, since 
Q,,@,...,a, <S,< x. Therefore N(x)/logx > 1/(2e) when x > X(e). Hence 
N(x)/log x > 00 as x > 00. 


Solution II and generalization by Amram Meir, University of Alberta, Edmonton. 
Let N(t) be nondecreasing on [0, oo), and let 


A(x) = ~ fran). 
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We claim 


lim, ,A(x) < lim 
To prove (1), we have 


dx = fran f’S = Pavol - “) =N(y)-A(y). (2) 


If A(x) >A for all x > xo, then {jA(x) dx/x > Xr log(y/x,) for y > xo. By (2), 
this yields N(y) > A log y + O(1) as y > oo. This proves the left-most inequality 
in (1). To prove the right-most inequality, we may assume that lim A(x) < oo. If 
A(x) < p for all x > x,, then (2) implies N(y) > p log( y/x,) + O(1) as y > ©, 
which completes the proof. 

To obtain the proposed result as a special case, set N(t) = k for a, <t<a,,, 
and k = 0,1,.... Then A(t) = (a, + a, +--+ +a,)/t. The condition of the prob- 
lem is that A(a,) — oo. Since’ 


Aa, +a,t+:': ta, 


vy A(x) 
J 


0 Xx 


A(a,4,)-—1= 
AK+1 
a, tc: +a a,+::: ta 
< A(t) = ~—___—* S ='_____—. = A(a,), 
t a, 


it follows that A(t) — oo. Using (1) we find that N(x)/log x —> oo. 


Editorial Comment. Grahame Bennett noted that the result holds without the 
hypothesis that the sequence is increasing, as long as a, — oo and 
(a, ta,+-+: +4a,)/a, > ©. 

Solved also by M. A. Arcones (Spain), G. Bennett, J. Ferrer (Spain), E. Hertz, K.-W. Lau (Hong 


Kong), O. P. Lossers (The Netherlands), L. E. Mattics, A. Riese, J. M. Rojas (student), J. H. Steelman, 
P. Tracy, and the proposer. 


Polynomials Whose Values Are Divisible by m But Whose Coefficients Are Not 


E 3203 [1987, 372] Proposed by Hugh Montgomery, University of Michigan, Ann 
Arbor. 


Let m and n be positive integers. Show that the following two assertions are 
equivalent. 
(i) Thére exists a polynomial 


f(x) = » a,x" 
k=0 


with integral coefficients such that gcd(ao, a,,..., a,, m) = 1 and f(/) is divisible 
by m for all integers /. 
(11) m|n!. 


Solution I by L. E. Mattics, University of South Alabama. Let Af(x) = f(x + 1) 
— f(x). If m|f(/) for all integers 7, then m|A‘f(/) for all nonnegative integers i 
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and all integers j. Suppose p is prime and p*|m for some a > 0, and let gq be the 
largest integer such that gcd(a,, p) = 1. Then p® divides 


(0) = ¥ (-"(2)/0 
= ELE Corel) 


_ n n-1 tee 
= q\(Sta, +S" a,_, +--+ +Sia,), 


where S/ are Stirling numbers of the second kind. We conclude that p*|q!, because 
the second factor in the above product is relatively prime to p. Since gq < n, p°|n!. 
Hence m|n!. 

To prove the converse, let f(x) = x(x + 1)---(x +n -— 1). It is well known 
that n!| f(7) for all integers i. Thus if m|n!, then m|f(7) for all 7. 


Solution II by O. P.’ Lossers, Eindhoven University of Technology, The Nether- 
lands. To see that (i1) implies (i), let m|n!. Then the polynomial f(x) = [17_,(x + g) 
has the desired properties, since it is monic and | f(/)| for integer j is either 0 or n! 
times a binomial coefficient. 

To prove that (1) implies (ii), let f(x) be as in assertion (i). Then f(x) can be 
expressed as 


n 


f(x) =bo+ Db -d) 


for a unique sequence of integers b,, k = 0,1,...,n. It can easily be seen that 
gcd(by,... 5,) = gcd(ay,...a,). That f(j) =0 (mod m) for the integers 7 = 
0,1,..., implies that 


jib, = O(mod m) for j = 0,1,..., 2. (+) 

Since gcd(bo,..., b,, m) = gcd(ao,..., a,, m) = 1, there exist integers Cy,...C,, D 
such that 1” _9C;b; + Dm = 1. Hence 

dL Cb,n! + Dmn! = n!. (* * ) 


j=0 
By (*), the left-hand side of (* *) is a multiple of m, hence so is the right-hand 
side; that is, m|n!. 


Editorial comment. The proposer notes that the equivalence of (i) and (ii) was 
established by G. Polya in his article Uber ganzwertige Polynome in algebraischen 
Zahlkdrpern, J. Reine Angew. Math., 149 (1919) 97-116. 


Also solved by M. Chellali (France), C. B. Khare (India), J. H. Steelman, and the proposer. 


A Formula for Power Sums 


E 3204 [1987, 372]. Proposed by Ira Gessel, Brandeis University. 
Let S.(n) = L7_)i”. Prove that 
S,,(n) = (2n + 1)p,(n(n+1))  (r>1), 
Sopri(m) =4,(n(n+1)) (r= 0), 
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where the polynomials p, and q, are given by the generating functions 


sinh(xv1 + 4¢ /2) 


_ | — x 
Wieasanxr 2° YP Oye 
cosh(xv1 + 4¢ /2) — cosh(x/2) _ yore 
— Qsinh(x/2) halt) (Qr +1)!" 


Cf. A. W. F. Edwards, A quick route to sums of powers, Amer. Math. Monthly, 
93 (1986) 451-455. 


Solution by the University of Squth Alabama Problem Group, Mobile, AL. Let 
t = n(n + 1) and replace x by ix in the generating functions above to obtain 


sata + ef _ ne * 4 y (-1)'@n + 1)p,(n(n + eat 
ee eee - ¥ (=1)"an(n + own yr 


Replace the factors involving sin and cos by their exponential expressions; this 
enables the left sides of the two equations above to be rewritten as (1/2) + 
v7 cos kx and L7_,sin kx, respectively. Use the power series expansions of cos kx 
and sin kx in these expressions and interchange the order of summation. The 
formulas for S. then appear by equating corresponding coefficients with the gener- 
ating functions above. 


Editorial comment. The proposer used the approach of forming the exponential 
generating function 


F(x) =3,S.(n)x’/r! =1+e%+--- +e™ 
_— [e(rtt/2)x _ e*/?) /[er/? _ e7*/?], 


The identities and substitutions used above then yield the desired formulas for the 
coefficients S.(n). The proposer also noted that differentiation with respect to f 


yields g/(t) = (27 + 1)p,(t). 


Also solved by K. Dilcher (Canada), W. Janous (Austria), W. A. Newcomb, A. Pedersen (Denmark), 
H.-F. Yeung (Australia), and the proposer. 


A Summation-Product Identity 


E 3211-{1987, 457]. Proposed by Laszlo Toth; Satu Mare, Romania. 


Let w(k) denote the number of distinct prime factors of the positive integer k 
and let (i, n) denote the greatest common divisor of the positive integers i and n. 
Express 

n 
y Joi, 2) 
i=1 


in terms of the prime factorization of n. 
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Solution by C. Georghiou, University of Patras, Greece. Denote the given sum by 
a(n); we show that a(n) =nll,,(1 + 1/p). Grouping the terms of the sum 
according to the value of (i, n), we obtain the convolution 


a(n) = )i2 ye 1 = 22° (n/a), 
d|n l<k<n/d d|n 
(k,n/d)=1 


where ¢ is the totient function. Since 2°”) and $(n) are multiplicative arithmetic 
functions, a(n) is also multiplicative. (Cf. Niven and Zuckerman, An Introduction 
to the Theory of Numbers, §4.4.) Therefore, it suffices to evaluate a( p”), which is 


p™(1 + 1/p). 


Editorial comment. Walther Janous and Hubert Kiechle both commented that 
replacing 2 by an arbitrary complex number z in the definition of a leads to 


a(n) = nfI1,),,(1 + (2 — 1)/P). 


Also solved by the proposer and 21 other readers. 


ADVANCED PROBLEMS 


6586. Proposed by Charles S. Allen, Drury College, Springfield, MO. 


In an infinite-dimensional normed linear space does there exist a convex set 
which does not contain a ray but whose closure does contain a ray? 


6587. Proposed by Ilan Vardi, Stanford University, CA. 


(a) The Kloosterman sum S(m, n, c) is defined for positive integers m,n, c by 


S(m, n, c) -_ y" e27i(mktnk)/e 
k=1 
where the asterisk indicates that the summation is restricted to integers k relatively 
prime to c and k is defined by kk = 1(mod c). If c is an odd prime, compute the 
determinant of the c — 1 by c — 1 matrix such that the element in the jth row and 
kth column is S(j, k,c) forl <j,k <c-— 1. 
(b) Generalize Part (a) to composite c and the more general Kloosterman sum 


S(m, n,X, c) _ y" x(k) e27ink tnk)/e 
k=1 


where x is a Dirichlet character modulo c, i.e., a map from the integers to the 
cofnplex numbers such that 

x(1) = 1, x(mn) =x(m)x(n), x(m +c) =x(m),x(m) =0 if (mc) >1 
for all m,n. 


6588. Proposed by Walther Janous, Ursulinengymnasium, Innsbruck, Austria. 


As in E 3246 put J(n) = n(n), where ¢ denotes the arithmetic function of 
Euler. For positive real x let F(x) denote the number of positive integers n for 
which y(n) < x. Prove that lim, _,.. F(x)/x exists and find its value. 


x CO 
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Applied Differential Geometry. By William L. Burke, Cambridge University Press, 
1985. xvii + 414 pp. 


DEANE YANG 
Department of Mathematics, Rice University, Houston, TX 77251 


Ideally, differential forms should be introduced as a natural part of multivariable 
calculus. One book that does this nicely is [E]. I hope that in time virtually all 
advanced calculus texts will adopt this approach. For the moment most of them do 
not, creating the need for books such as Applied Differential Geometry, which 
introduces the foundations of modern differential geometry, including manifolds, 
tangent and cotangent bundles, and differential forms. Most such texts, even those 
directed towards non-mathematicians, introduce differential forms by first discuss- 
ing exterior algebra and the definition of a manifold and its tangent bundle. I dislike 
this approach, because the student is forced to absorb a lot of abstract material 
without any idea what it is good for. I would prefer a more heuristic approach at the 
start, something like the following: 

I will assume that you are familiar with the multivariable calculus. First, I will 
introduce differential forms over a domain D C R”. Later, when we understand 
everything a little better, we will define what a manifold is and extend the ideas to 
the more general setting. 

You have already encountered differential forms back in basic integral calculus. 
Recall that the integrand of a single variable integral is not simply a function f(x) 
but f(x) dx, where the symbol dx indicates the variable of integration. The symbol 
dx is often called a differential, but no precise definition of what it is is ever given in 
an elementary course. One simply learns how to use it to change variables in the 
integral. There is only one rule: if y is a function of x, then 

dy 
dy ” dx. 
In fact, differentials can be given a precise mathematical definition, which we will 
indicate later. For now, however, let’s just view them as symbols which obey certain 
rules. 

Motivated by the notation of a differential as described above, we extend them to 

the general notion of a 1-form on a domain D C R” to be an expression of the form 


f(x) dx! + +++ +f,(x) dx", 


where x = (x',...,x”) are the standard coordinates of R” and f,,..., f, are 
smooth functions on D. Given 1-forms w = f, dx' + --- +f, dx” and 6 = g, dx 
+--+ +g, dx” and a smooth function »(x), we can add w and @ to get 


wtO0=(f,+38,) a+ “+ +(f, + g,) dx", 


964 
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and multiply ¢ times w to get 
pu = of, dx' + --- +f, dx”. 
Given a function f(x), we define the differential of f to be the 1-form 


af af 


df = —~dx' +--+ + dx". 1 
T= Ga & axn () 
Observe that the chain rule implies that this definition is consistent under a change 
of independent variables, since if we set x' = x'(u), where u = (u’,...,u"), and 
view dx' as the differential of the function x’, we obtain 
of Of 
= ——dx'+---4+ dx” 
df axl ax ay x 
n Of ax’ 
~ Dae Ou 
j, j= OX' Ou 
) Of 
Ou Ou” 


As observed before, the integrand of a 1-variable integral is a 1-form on an 
interval in R. The rules above and the change of variables theorem conspire to keep 
the value of the integral invariant under a change of independent variable. More 
generally, the integrand of a line integral along a curve in R” is a 1-form on R”. 
Given the 1-form w above and any curve y: [a, b] ~ R”, define the integral of w 
along y to be 


b dx} rad 
fo= POY Z + +H) S a 


where y(t) = (x(£),..., x"(t)). Observe that this definition is consistent with the 
definition of a 1-form on R, where x,..., x” are no longer viewed as independent 
variables but as functions of the new independent variable ¢. The value of this 
integral is invariant not only under changing the coordinate ¢ but also under 
changing the coordinates x',..., x”. 

In the traditional view of things, the ease of changing the variables is lost for a 
multiple integral. The differentials which still appear in the integrand no longer 
seem to help in computing the formula, which involves the determinant of the 
Jacobian of the old coordinates viewed as functions of the new. By introducing the 
notions of p-forms, p a positive integer, and of exterior multiplication, we can 
change variables of a multiple integral very easily without memorizing any com- 
plicated formulas. I make no claim that what follows is an obvious way to proceed. 
On the other hand, I do believe that it is better to introduce the basics of exterior 
algebra in a familiar context, where the student sees its utility, rather than in an 
abstract setting. 

The first hint of what to do is that the integrand of a multiple integral is always 
of the form f(x) dx' --- dx", where each dx' appears once and only once. Let’s 
change variables, setting x’ = x‘(u), where u’,...,u” are the new coordinates. 
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Proceeding formally, we replace each dx' by 


Ox! a Ax! 

_—___ rr 

au du” 
and multiply them all together. Yet somehow, when we do this, we should end up 
with 


ax! ; 
det} —— | du’ --- du”. 
du! 

First, given a positive integer k, we define a k-form on D to be an expression of 

the form 
y a,,...;, ax dx +++ dx'k, 
1°2 k 
l<ip< ++: <i,<n 

where a, ...;, is a smooth function of x € D. Such an expression looks just like a 
homogeneous polynomial in the “variables” dx',..., dx" of degree k, except that 
each variable appears only once in each term. Observe that due to this restriction, 
there are no k-forms for k > n. Also, an n-form is always of the form 


a(x) dx' --- dx”. 


For convenience we shall also view functions as 0-forms. 

Next, we want to say how to multiply differential forms together. The usual 
polynomial multiplication doesn’t give us what we want. Define a new type of 
multiplication, called exterior multiplication, which is still associative and distribu- 
tive (with respect to addition) but is no longer commutative. Given a p-form and a 
g-form, obtain a (p+ q)-form by multiplying formally and by rearranging the 
order of the factors in each term using the following rules: Given a 0-form f and 
1-forms 6, w, 

(1) 0-forms commute with everything; 

(2) dw = —w. 

Observe that (2) implies that 0? = 0-6 = —@-6=0. In particular, terms which 
contain more than one copy of dx‘, for some i, vanish. The rules (1) and (2) imply 
the more general property that given a p-form {2 and a q-form 9, 


20 = (-1)"70Q. 
The reason that differential forms are so useful for integration is given by the 
following: 
LEMMA. Given functions f',..., f" on a domain D C R", 


0 
df! df? .-- df"= sel fa’ dx? +++ dx". 


Ox! 
Without computing the left-hand side in detail, it is clear that 


df! --- df" = (polynomial in 0f'/dx/) dx! --+ dx". 


It is not hard to see that (2) implies that the polynomial is an alternating multilinear 
function of the rows of the Jacobian matrix and that it is equal to 1 when the 
Jacobian is the identity matrix. It follows that it is equal to the determinant. 
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Therefore, if we are given an integral 


[far + a” 


and want to change coordinates from (y',..., y”) to new coordinates (x!,..., x”), 
we obtain the correct integrand by simply viewing y!,..., y” as functions of x and 
applying the rules of exterior multiplication. 

There is, however, one not-so-slight catch. Since the change of variables formula 
uses the determinant of the Jacobian and not its absolute value, the integrals in this 
paper are oriented integrals over oriented domains. In practice, what this means is 
that when changing variables, one should always order the new coordinates so that 
the determinant of the Jacobian is positive. 

So far, it appears that the only forms on R” of real interest are 0-, 1-, and 
n-forms. On the other hand, the integrand of a surface integral is a 2-form on R° 
and in general, the integrand of an integral over a k-dimensional “surface” in R” is 
a differential k-form. 

After this, we should define the exterior derivative of a differential form and 
observe that, like the differential of a function, it 1s well-defined and independent of 
the choice of coordinates. Also, in R°*, the exterior derivative of 0-, 1-, and 2-forms 
encodes the familiar formulas for gradient, curl, and divergence. Finally, one recalls 
Stoke’s theorem and observes that it has a simple and elegant formulation using 
differential forms. 

By now, we hope that the student is convinced of the usefulness of this notation. 
It is time to give a precise mathematical definition of differential forms and give it a 
geometric interpretation. 

Begin with 1-forms on D C R”. The idea is to view dx',..., dx” as basis vectors 
of some vector space, call it V. A 1-form w = f; dx’ + --- +f, dx” is then a map w: 
D — V. We prefer to set T*D = D X V and view w as a map 


Ww: D— T*D 
x (x, fi(x) dxl + +++ +f,(x) dx”), 


Now what happens when we change coordinates? Let the new coordinates be 
u=(u',...,u”) and suppose that x' = x'(u). Then for each x € D, we get a new 
basis of V, denoted du',..., du”, which is related to the old basis dx',..., dx” by 
the equation 


i Ox! ap Ox? 
i— + eee + 
* du: 4 Ou” 


du”. 


Thus, for each choice of coordinates on D, we get a new basis of {x} x V. Using 
this observation, we want to identify 7*D a little better. 
To do this, we look at a different vector space. Let D be as before, let T be 


another n-dimensional vector space with basis 7,,...,7,, and let 7,D = D xX T. 
Abstractly, we want to view {x} X T as the vector space of all possible tangent 
vectors to curves passing through x. The coordinates x’,..., x” induce a natural 


basis of T by specifying that 7, is the tangent vector to the curve obtained by fixing 
x/, 7 # i, and setting x’ = ¢. 


968 DEANE YANG [December 


Let y: [—«e, e] — D bea differentiable curve. Then the derivative of y determines 
a map 


Yx: [—e,e] > T,D 
tr (y(t), y’(4)), 
where the tangent vector to y at y(t‘) is 
dx! dx” 


“(t)=——7aAt+-- + T° 
y’( ) dt 1 dt n 
As before, a new set of coordinates u',..., uw” induce a new basis of T, call it 
[1,-+++, Bb, Lhe two bases are related by the chain rule: 
' dx} dx” 
, t =-—>T + coe -+- T 
"( dt * dt" 
n Qx* dui dx” du' 
= — —-_T . + -~—— 7 
12, Ou’ dt : du’ dt " 
> du’ dx/ du" ax/ 
=) ——1 - +$——7, 
ja Ht Out? dt du” / 
From this we see that 
n Ox/ 
= —7T:. 
Lh; j=l Ou J 


The set 7,D is called the tangent bundle of D and an element (x,v) € 7,D 
consists of a point x and a tangent or velocity vector sitting at x. 
We can now say what V is. Note that the change of basis formula relating 


dx',..., dx" to du',..., du” is exactly the same formula for the change of basis for 
the dual bases to 7,,...,7, and py,...,p,. In other words, using the coordinates 
x',...,x", we can identify V = T* by viewing dx' as the linear function on T 


satisfying dx'(1;) = §,,. The formulas above then imply that this identification is 
fixed under changes of coordinates. Therefore, the space of 1-forms is, in some 
sense, naturally dual to the space of tangent vectors. For this reason, 7*D is called 
the cotangent bundle of D. 

What about k-forms in general? They are, roughly speaking, dual to the space of 
k-planes in 7, D, but we will not try to be more precise about this. 

Observe that the object with natural geometric meaning is the tangent bundle, 
since it is easy to draw pictures of velocity vectors and sums of such vectors. It is 
not so easy to draw corresponding pictures of-1-forms. Differential forms should 
not be viewed as natural geometric objects, meaning that one should not try too 
hard to visualize what they are. Rather they are natural linear algebraic objects 
obtained from the tangent space. For reasons we have no space to go into, it turns 
out that differential forms are much easier to manipulate than vector fields and 
therein lies their utility. 

By now the idea of studying calculus using a coordinate-free approach is well 
established. The definitions of a manifold, its tangent and cotangent bundles, 
differential forms, exterior derivative, and integration now follow easily. Of course, 
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a quick course on exterior algebra should be included somewhere in all of this, now 
that the student is better motivated to learn it. 

And what of the book Applied Differential Geometry? The emphasis is on 
explaining the ideas and the applications of the mathematics and not on a rigorous 
definition-theorem-proof exposition. I like that. I do, however, find the book too 
vague and sketchy for my taste. Pictures and heuristic discussions are essential, but 
they cannot replace precise definitions and theorems. The book does not contain 
much mathematics or physics. The author’s mistake, I think, was to include material 
on too many different topics. He would have done better to cover only a few of 
them in more depth. He mentions that he taught an entire graduate course on 
electromagnetism using differential forms. This should be a much more effective 
way of introducing the notion to physics students. 

The author draws a lot of pictures of what the mathematical notation represents. 
Although many of them are very instructive, most of the explanations are inade- 
quate. Also, he goes too far, I think, in trying to draw a picture of everything. It is 
not hard to draw pictures-of curves, surfaces, tangent vectors, and tangent planes. 
Adding vectors pictorially is easy. The same cannot be said for cotangent vectors or 
differential forms. Nevertheless, the author includes drawings which “explain” what 
differential forms are, how to add two k-forms, how the Hodge star operator works, 
etc. 

As a professional differential geometer, I use differential forms all the time and 
must be able to visualize what my calculations and formulas mean. This does not 
mean visualizing differential forms themselves. The formulas and calculations are 
meant to represent geometric assertions involving manifolds and their tangent 
bundles. That is what I draw pictures of. What is important to me is the ability to 
translate with ease between equations and computations involving differential forms 
and the equivalent formulation using tangent vectors and submanifolds. 

Books such as Applied Differential Geometry or [F] concentrate on the basics of 
differential forms and give a few simple applications in physics. Differential forms, 
however, are a powerful tool in pure mathematics, particularly geometry and 
topology. Elementary expositions can be found in [GP] and [O]. For advanced 
applications in topology, try [BT]; in differential geometry, [BCG] and [C]; and in 
algebraic geometry, [GH]. 
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A Topological Picturebook. By George K. Francis. Springer-Verlag, New York, 
1987. xv + 194 pp. 


JEFFREY R. WEEKS 
Mathematics and Computer Science Department, Ithaca College, Ithaca, NY 14850 


Perspective drawing is a great introduction to projective geometry. The experi- 
ence gained in tackling the concrete problems of perspective is not only valuable in 
itself, but leads students to a deeper appreciation of the abstract mathematics which 
follows. While some problems of perspective are easy enough to build confidence in 
the weaker students, others are quite challenging. Consider, for example, the 
problem of drawing railroad tracks stretching out across a prairie to the horizon. 
Drawing the rails is easy enough, but where is the vanishing point for the ties? And 
how should the ties be spaced? FIGURE 1, from the book under review, illustrates a 
method for spacing the ties. Pick any convenient focus point F distinct from the 
vanishing point V and draw the line FV. (Figure 1 illustrates two possible choices of 
F. In one case the line FV happens to coincide with the horizon. In the other it 
happens to be perpendicular to the horizon.) Pick an origin O and a unit point U on 
one of the rails. The distance OU is the unit relative to which other distances along 
the rail will be measured. Draw any line r parallel to FV, and project O and U 
through F to r. The line r now serves as a perspective ruler for measuring distances 
along the rail. In particular, a sequence of equally spaced points on the ruler 
projects back through the focus F to a correct perspective view of equally spaced 
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points on the rail. This solves the problem of how to space the railroad ties. The 
justification of the method starts the student on the road to an understanding and 
appreciation of the abstract and powerful ideas of projective geometry. (The 
method’s validity follows as a corollary of the projective invariance of the cross 
ratio. Yet proofs involving no mathematics beyond high school geometry are also 
possible. The reader is challenged to find one.) 

The preceding example illustrates how the concrete problems of perspective 
drawing prepare a student for the abstractions of projective geometry. The general- 
ization of this pedagogical principle pervades every aspect of the book under review: 
specifically, A Topological Picturebook is based on the principle that an understand- 
ing and appreciation of concrete examples enhances one’s understanding and 
appreciation of the abstract theorems they illustrate. The book’s very existence 
embodies the principle. Its stated purposes are “to encourage mathematicians to 
illustrate their work, and to help artists to understand the abstract ideas expressed 
by such drawings.” In other words, it is a book which presents examples and 
encourages others to do so. Its internal structure also reflects the principle. The 
author skillfully presents examples of a technique before discussing the technique 
explicitly. For example, the book opens with a delightful discussion of how to draw 
a saddle in a box and in a drum (the box and the drum are not part of the final 
drawings— they serve only to bound the region of space to which the drawing of the 
saddle is confined). Only after presenting these examples does the author explicitly 
remark that drawing a surface clipped to the boundaries of a familiar region (in this 
case a cube or cylinder) is more effective than drawing it in reference to coordinate 
axes, because in everyday life people orient themselves relative to boundaries such 
as the walls of a room, not relative to thin rods with tick marks. The reader readily 
appreciates the author’s remark because he has already seen the method in action. 

This same principle, that an appreciation of nontrivial, concrete examples should 
precede the introduction of more general, abstract theorems, deserves wider applica- 
tion in the classroom, particularly in our upper-level courses. Students should begin 
a new topic by tackling some problems and working with some examples. The 
problems and examples must be diverse and nontrivial to prepare the students for 
what follows. For instance, students who have seen only the integers and the 
integers mod 7 will not appreciate the need for an abstract definition of a group. 
Once the students are comfortable with the examples, then it’s time for a theorem. 
As far as possible let the students themselves search for the correct statement. How 
might we prove the theorem? Again, let the students discuss their own ideas. There 
is no need to say whether their original guess for the statement of the theorem was 
correct or not; let them work it out themselves. Their extensive experience with 
examples and problems will help them sort the grain from the chaff. The final step, 
once’ the theorem and the outline of a proof are understood, is rigor. Details are 
important: they come last because only a student who is comfortable with the big 
picture can fully appreciate them. 

As an example of the above idea, consider the teaching of topology. When I was 
an undergraduate my first topology course contained nothing but point-set topol- 
ogy. It was interesting stuff, but it dwelt upon transfinite sets, infinite products, and 
so on, and never got as far as what I considered real topology. My conception of 
real topology was pretty dim, but I felt it should have something to do with twisted 
surfaces and other interesting shapes. (I now know that what I was looking for is 
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called geometric topology, but unfortunately it wasn’t part of the curriculum.) My 
second course in topology began with a tedious exercise in subdividing simplices, 
followed by the definition—in one fell swoop—of an infinite series of groups. In 
retrospect it’s clear that the instructor was defining the homology groups, but at the 
time I could find no meaning in it at all and I dropped the course. The problem was 
not that the homology groups are intrinsically uninteresting or that the instructor’s 
presentation was unclear; the problem was that I was intellectually unprepared to 
appreciate them. I would have learned much more had I begun with some concrete 
examples and problems. I would have loved to have gotten my hands on some 
interesting manifolds. Eventually the subtler notions would arise. For example, the 
surface obtained by identifying opposite sides of a square is somehow the same as 
the surface obtained by identifying opposite sides of a hexagon. In what sense are 
they “the same’? Here’s a great place to talk about homeomorphism! Furthermore, 
if different constructions can yield the same surface, how can we ever be really sure 
that two surfaces or manifolds are different? We could look at the properties of 
loops contained in the manifolds. On a sphere every loop can be shrunk to a point, 
but on a torus some cannot; this observation provides an informal proof that the 
sphere and the torus are not homeomorphic. Students who have internalized these 
ideas—by seeing examples and working problems—are ready to appreciate the 
homology and homotopy groups. 

I should emphasize that I do not advocate less abstraction or less rigor. To the 
contrary, ’m disturbed by the watering down of our upper-level courses. ’m 
suggesting that more hands-on experience with examples and problems prepares 
students for the abstraction and the rigor. 

I'd now like to comment on A Topological Picturebook itself. 

The organization of the book is imaginative and effective. The drawing lessons 
are nominally to be found in the first three chapters (“Descriptive Topology,” 
“Methods and Media,” and “Pictures in Perspective”). The five remaining chapters 

e “picture stories”: each contains a series of pictures, accompanied by some 
explanatory text, which leads the reader to an understanding of some especially 
delightful examples from twentieth-century topology, such as sphere eversions or the 
fibration of the figure-eight knot complement. The division between the methods 
chapters and the picture story chapters is not strict. The methods chapters use a 
picture story format in which each section is a mini-picture-story illustrating the 
technical point under consideration. And the picture story chapters are sprinkled 
with hints on making the drawings, each appearing in a natural context. 

The book’s two purposes, as previously stated, are “to encourage mathematicians 
to illustrate their work and to help artists to understand the abstract ideas expressed 
by such drawings.” The book succeeds nicely at its first purpose. By carefully 
interweaying the drawing lessons with concrete and interesting examples, it lets 
mathematicians learn about drawing while reading about math. Unfortunately the 
book fails at its second purpose. It is too difficult and too incomplete for a 
nonmathematician. For example, Boy’s surface plays a central role in the chapter on 
Shadows from Higher Dimension, yet the reader is referred elsewhere for a picture 
of Boy’s surface and an introduction to its topology. 

The book achieves a third, unstated purpose which is every bit as valuable as the 
first. The picture stories present a few of the most appealing examples in topology, 
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and are well worth reading for their own sake. (The incompleteness mentioned in 
the preceding paragraph will occasionally annoy even a professional mathematician, 
but fortunately the author did his homework and provided complete bibliographic 
references.) All too often the examples which motivate an author’s thinking get lost 
amongst the formalism when he goes to publish his work. It’s nice to see these 
examples taking their place in the literature, for our pleasure as well as our 
understanding of topology. 
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General, L. What Game Is That? Kenneth 
Watson-Jones. Vantage Pr, 1987, xiii + 58 pp, $8.95. 
[ISBN: 0-533-06966-1] For high school or college stu- 
dents. Forty-seven puzzles, problems, and games of 
the brain-teaser kind. Almost all will be known to 
aficionados but amateurs will find challenges. Nicely 
organized. Reviewer’s copy was missing figure 40. 
Answer to number 12 is not complete. Statement 
of and answer to number 16 are wrong. Some nice 
philosophical touches in Foreward, Preface, and text 
itself. JK 


Elementary, S*(13-14). An Easy Course in 
Using the HP-28C. John W. Loux, Chris Coffin. 
Grapevine, 1987, 233 pp, $22 (P). (ISBN: 0-931011- 
17-5] A very gentle introduction to the HP-28C calcu- 
lator. The book is a programmed learner with prob- 
lems interspersed within explanations. Topics cov- 
ered include menues, editing, stack operations, post- 
fix notation, and programming. This book would be 
ideal for a high school or undergraduate mathematics 
major who desires a working knowledge of HP-28C 
fundamentals. AM 

Elementary, T(13). Intermediate Algebra, Fourth 
Edition. M.A. Munem, W. Tschirhart. Worth, 
1988, xii + 612 pp, $28.95. [ISBN: 0-87901-377-X] 
Many excellent problems. Problem sets for each sec- 
tion/chapter are graded for difficult. Examples in 
text are clear (without assumptions) and easy to 
tread. Page layout and type choices highlight im- 
portant information. Includes discussions of systems 
of linear equations, sequences, series, and geometry. 
Highlights calculator usage. Good section on trans- 
lating English expressions to mathematical expres- 
sions. (First Edition, TR, February 1973.) LB-E 


Mathematics Appreciation, T(13). Preregui- 
site Algebra. Charles P. McKeague. Harcourt Brace 
Jovanovich, 1988, xv + 640 pp, $26. [ISBN: 0-15- 
571093-1] Author’s stated goal is to make it possible 
in an intermediate algebra course to actually cover 
the latter topics (conic sections, logarithms, polyno- 
mial functions) by greatly condensing the review of 


elementary algebra. AWR 


Precalculus, T(13: 1). Precalculus. Jerome E. 
Kaufmann. Prindle, Weber & Schmidt, 1988, xv + 
569 pp, $29. [ISBN: 0-534-92007-1] Solid, complete- 
as-need-be, no-frills treatment of precalculus. Em- 
phasis on problem solving, graphing techniques, and 
notion of function, all with an eye on the calculus. 
Plenty of worked-out examples, graded exercise sets, 
and study aids. Nicely laid out and printed, but not 
all figures ring true, especially parabolas and hyper- 
bolas and cycloids (ordinary, hypo-, epi-). The cur- 
tate cycloidin Figure 8.24 is just plain wrong. Right- 
fully, the calculator is treated as the tool it is. JK 


Precalculus, T(13). Algebra with Trigonometry 
for College Students. Charles P. McKeague. Har- 
court Brace Jovanovich, 1988, xvii + 700 pp, $26. 
[ISBN: 0-15-502120-6] A well-written text covering 
algebra, conic sections, relations and functions, log- 
arithms, and trigonometry. The chapters are orga- 
nized into sections which can be presented in a 45-50 
minute class lecture. Each section contains numer- 
ous exercises, including some application problems. 
In addition, each chapter ends with a chapter sum- 
mary and a chapter test. Requires only a beginning 
algebra course as a prerequisite. RH 


Precalculus, T(13). Trigonometry. John Baley, 
Martin Holstege. Random House, 1987, ix + 452 pp, 
$18. [ISBN: 0-394-35461-3] Definitely conversational 
in tone, including nurherous inserts on using various 
calculators and little conversations by one who asks 
questions and one who answers them. Includes com- 
plex numbers and polar coordinates. AWR 


Finite Mathematics, T(13). Applied Mathemat- 
ics for Business, Economics, and the Social Sciences, 
Third Edition. Frank S. Budnick. McGraw-Hill, 
1988, xxi + 1041 pp, $39.95. [ISBN: 0-07-008876-4] 
A text suitable for a two-term course combining finite 
mathematics with calculus. Intended for students of 
business, economics, and, to a lesser extent, social 
sciences. Prerequisite: high school algebra. Many 
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applications and examples, 3500 exercises and helpful 
algebra reminders at strategic locations throughout 
the text. Answers given for odd-numbered problems. 
(First Edition, TR, November 1979.) SM 


Education, T(14-16). Mathematics for Elemen- 
tary Teachers: A Contemporary Approach. Gary L. 
Musser, William F. Burger. Macmillan, 1988, xix + 
874 pp. [ISBN: 0-02-384760-3] Development parallels 
typical sequence of topics in elementary school cur- 
riculum. Heavy emphasis on problem-solving strate- 
gies and many good problems, but claim to inte- 
grated problem-solving approach is belied by Course 
Options suggestion to de-emphasize or omit problem 
solving if time is short. Use of hands-on activities 
and bibliographies for each topic are confined to sup- 
plementary Student Resource Handbook. MW 


Education, S*(15-16). Middle Grades Mathe- 
matics Project. Addison-Wesley, 1986, (P). Mouse 
and Elephant: Measuring Growth. Janet Shroyer, 
William Fitzgerald, 128 pp [ISBN: 0-201-21474-1]; 
Factors and Multiples. William Fitzgerald, et al, 152 
pp (ISBN: 0-201-21475-X]; Similarity and Equivalent 
Fractions. Glenda Lappan, et al, 166 pp (ISBN: 0- 
201-21476-8]; Spatial Visualization. Mary Jean Win- 
ter, et al, 150 pp (ISBN: 0-201-21477-6]; Probability. 
Elizabeth Phillips, et al, 191 pp. [ISBN: 0-201-21478- 
4] Each book is a 2-3 mathematics unit for grades 5-8, 
emphasizing problem solving approach, use of con- 
crete manipulatives, and carefully integrated and se- 
quenced lessons. Detailed lesson plans include work- 
sheets, transparency masters, and scripts for teacher 
talks and action. Three phases of instruction: launch 
challenge, class exploration, summarize. Both ele- 
mentary and secondary methods students should see 
these. MW 


Education, S(15-18). Readings for Enrichment in 
Secondary School Mathematics. Ed: Max A. So- 
bel. NCTM, 1988, v + 258 pp, $10 (P). [ISBN: 0- 
87353-252-X] The classics from two previous NCTM 
publications: eight articles from the Twenty-eighth 
Yearbook and five from Topics for Mathematics 
Clubs. Also eight reprinted from the Mathemat- 
ics Teacher and three original articles. Many up- 
dated bibliographies. Suitable for independent study 
by talented students or as the focus of enrichment 
lessons. Excellent resource should be on the sup- 
plementary list for pre-service and in-service teach- 
ers. MW 


Education. Mirrors of Minds: Patterns of Ezpe- 
rience in Educational Computing. Ed: Roy D. Pea, 
Karen Sheingold. Ablex, 1987, xvii + 329 pp, $42.50; 
$19.95 (P). [ISBN: 0-89391-422-3] Collection of pa- 
pers from Bank Street College Center for Children 
and Technology. Highlights learner and educational 
process, not machines, as starting points for stud- 
ies of the role of computer technology in education. 
Examines the potential impact of computer-based 
technologies on educational innovation, the cognitive 
outcomes of learning LOGO programming, and the 
issues involved in designing software to promote crit- 
ical inquiry. MW 
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Education, P. Mathematics Tests Available in the 
United States and Canada. James S. Braswell, Alicia 
A. Dodd. NCTM, 1988, v + 34 pp, $3.50 (P). [ISBN: 
0-87353-253-8] A catalogue of standardized tests, ar- 
ranged by grade level and subject, with information 
on publisher, purpose, published descriptions, and 
other relevant details. Includes directory of publish- 
ers and distributors, and a bibliography of reviews 
and descriptions. LAS 


History, L*. Izratl M. Gelfand: Collected Papers, 
Volume I, Ed: S.G. Gindikin, et al. Springer-Verlag, 
1987, vi + 883 pp, $149.50. [ISBN: 0-387-13619-3] 
First of three volumes of Gelfand’s 450 research pa- 
pers, from Banach algebras (normed rings) to cell 
biology. This volume contains survey papers by and 
about Gelfand; papers on Banach algebras, differen- 
tial equations, and mathematical physics; a complete 
bibliography; and tentative contents of subsequent 
volumes. LAS 


History, P*, L*. The Probabilistic Revolution. Ed: 
Lorenz Kriiger, et al. MIT Pr, 1987, $60 set. Volume 
1: Ideas in History, xv + 449 pp (ISBN: 0-262-11118- 
7]; Volume 2: Ideas in the Sciences, xvii + 459 pp. 
[ISBN: 0-262-11119-5] 35 essays arising from a year- 
long study undertaken by an international interdis- 
ciplinary research group convened to investigate the 
historical roots of the revolution that supplanted de- 
terminism with probabilistic models as the paradigm 
of physical and social science. Volume 1 deals prin- 
cipally with the nineteenth century; Volume 2 with 
applications to twentieth century science (psychol- 
ogy, economics, biology, physics). LAS 


History, T(13-16), S, L. The Historical Roots of 
Elementary Mathematics. Lucas N.H. Bunt, Phillip 
S. Jones, Jack D. Bedient. Dover, 1988, xiii + 299 
pp, $7.95 (P). [ISBN: 0-486-25563-8] Unabridged re- 
publication of a book published by Prentice-Hall in 
1976 (TR, January 1977). Authors’ corrections have 
been included in this edition. RH 


History, L*. The Collected Papers of Albert Ein- 
stein, Volume 1: The Early Years, 1879-1902. Ed: 
John Stachel. Princeton U Pr, 1987, lxvi + 433 pp, 
$52.50 [ISBN: 0-691-08407-6]; English Translation. 
Transl: Anna Beck. Princeton U Pr, 1987, xxii + 
196 pp, $22.50 (P). [ISBN: 0-691-08475-0] First of 
an anticipated thirty volumes constituting the defini- 
tive, comprehensive edition of all Einstein papers, 
letters, and related documents. This volume contains 
letters, school notebooks, exams, and related mate- 
tials from Einstein’s birth until his appointment as 
a patent clerk. Includes 50 recently discovered let- 
ters between Einstein and Mileva Maric¢, his fellow 
student at ETH and first wife, and a brief account 
of his childhood written in 1924 by his sister Maja 
Winteler-Einstein. Each manuscript is untranslated, 
typeset in a manner that is faithful to the original 
German; all editor’s notes are in English. The ac- 
companying paperback contains typewritten English 
translations of all original documents in the official 
edition. LAS 


History, L. Memoir of the Life and Labours of the 
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Late Charles Babbage. H.W. Buxton. Ed: Anthony 
Hyman. Reprint Ser. for the History of Comput., 
V. 13. MIT Pr, 1988, xxi + 401 pp, $50. [ISBN: 0- 
262-02269-9] Thirteenth in a series on the history of 
computing. Buxton was a barrister (and self-taught 
in mathematics) whom Babbage entrusted to write 
his memoirs. The manuscript was not published in 
its time and remained in storage until recently. This 
edition provides a view of Babbage’s broad range of 
interests and his own thoughts, as conveyed to Bux- 
ton, on his Analytical Engines. MR 


History, P, L**. Archimedes. E.J. Dijksterhuis. 
Transl: C. Dikshoorn. Princeton U Pr, 1987, 457 
pp, $15 (P); $50. [ISBN: 0-691-02400-6; 0-691- 
08421-1] A classic study now augmented with Wilbur 
R. Knorr’s 23-page bibliographic essay “Archimedes 
after Dijksterhuis: A Guide to Recent Studies.” 
Knorr’s essay ends with a provocative postscript on 
“Who was Archimedes?” Knorr provides over 200 
references to recent (several as late as 1987) studies 
on or related to Archimedes. The’ paperback edition 
is a real bargain. JK 

History, T(15), S*, L*. Jdnos Bolyai Appendiz: 
The Theory of Space. Ed: Ferenc Karteszi. Math. 
Stud., V. 138. North-Holland (US Distr: Else- 
vier Science), 1987, 238 pp, $100 (P). [ISBN: 0-444- 
86528-4] Revision and translation of a memorial vol- 
ume first published in 1952. Bolyai’s Appendiz— 
containing his ideas of non-Euclidean geometry— 
appears here in its original Latin as well as in trans- 
lation. As the Appendiz is extremely terse, editor 
Karteszi has included a chapter of “explanatory, sup- 
plementary, orientative remarks.” In addition to 
this, Karteszi brackets the Appendiz with chapters 
on the evolution of the concept of space and on fur- 
ther evolution of Bolyai’s ideas. Beautifully written, 
this book illuminates the drama surrounding Janos 
Bolyai’s life and work. LW 


History, P*, L***. Pascal’s Arithmetical Triangle. 
A.W.F. Edwards. Oxford U Pr, 1987, xii + 174 pp, 
$37.50. [ISBN: 0-85264-283-0] A comprehensive his- 
tory of Pascal’s triangle, beginning with its distinct 
roots in recursion (the ‘figurate’ triangle), in combi- 
nations (the ‘combinatoric’ triangle), and in algebra 
(the ‘binomial’ triangle). Edwards, a reader in math- 
ematical biology at Cambridge, identifies particular 
interpretations of early examples (Greek, Chinese, 
Hindu, etc.), and then shows how Pascal and his 
successors (Wallis, Newton, Bernoulli) linked diverse 
interpretations into a powerful tool. LAS 


Logic, P. Methods and Applications of Mathematical 


Logic. Ed: Walter A. Carnielli, Luiz Paulo de Alcan- — 


tara. Contemp. Math., V. 69. AMS, 1988, xi + 250 
pp, $28 (P). [ISBN: 0-8218-5076-8] Proceedings of 
the Seventh Latin American Symposium on Mathe- 
matical Logic held July 29-August 2, 1985. Sixteen 
research papers on set theory, algebraic logic, philo- 
sophical aspects of logic, and interactions between 
logic, mathematics, and computer science. Includes 
summary of work of A.I. Arruda. KS 


Foundations, T*(14: 1), S*, L. Foundations of 
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Higher Mathematics. Peter Fletcher, C. Wayne 
Patty. Ser. in Math. Prindle, Weber & Schmidt, 
1987, xiii + 263 pp, $28.50. [ISBN: 0-87150-164- 
3] Introduces basic techniques of writing proofs and 
fundamental ideas in the foundations of mathemat- 
ics, logic, sets, induction, relations, functions, num- 
ber systems, cardinality, and introductory algebra 
and analysis. Readable, with lots of worked exam- 
ples, exercises, and illustrations. CEC 


Graph Theory, P. Recent Results in the Theory of 
Graph Spectra. Dragos M. Cvetkovic, et al. Annals 
of Disc. Math., V. 36. North-Holland (US Distr: 
Elsevier Science), 1988, xi + 306 pp, $97.25. (ISBN: 
0-444-70361-6] Results in the theory of graph spectra 
which appeared since 1978. Topics include graphs 
with least eigenvalue -2, distance regular graphs, 
graph polynomials, and spectra of infinite graphs. 
Most theorems stated with no proof, but over 700 
references given. LC 


Combinatorics, S(13-16), L**. Challenging 
Mathematical Programs with Elementary Solutions. 
A.M. Yaglom, I.M. Yaglom. Transl: James Mc- 
Cawley, Jr. Dover, 1987, $5.95 each (P). Volume I: 
Combinatorial Analysis and Probability Theory, ix + 
231 pp [ISBN: 0-486-65536-9]; Volume IJ: Problems 
From Various Branches of Mathematics, ix + 214 pp. 
[ISBN: 0-486-65537-7] Republication of the trans- 
lation of Non-Elementary Problems in an Elemen- 
tary Ezposition from the Russian. Volume J contains 
100 problems with hints and detailed solutions deal- 
ing with combinatorial analysis (prerequisite: stan- 
dard high school mathematics); problems concerning 
other areas are in Volume IJ. For problem-solving 
seminars, math clubs, etc., at the level of late high 
school/early college. RB 


Combinatorics, P. Combinatorics '86. Ed: A. 
Barlotti, M. Marchi, G. Tallini. Annals of Disc. 
Math., V. 37. North-Holland (US Distr: Elsevier 
Science), 1988, xvi + 502 pp, $126.25 (P). [ISBN: 
0-444-70369-1] This volume contains the proceedings 
of the conference held at the Centro di cultura of 
the Universita Cattolica of Milano, in Passo della 
Mendola from June 30 to July 5, 1986. The focus 
is on combinatorial and incidence geometry and its 
links with geometry, graph theory, algebraic struc- 
tures, and applications to coding theory and com- 
puter science. CEC 


Discrete Mathematics, T(14-15: 1, 2), L. Dis- 
crete Mathematics. Kenneth P. Bogart. DC Heath, 
1988, xvii + 834 pp, $29. [ISBN: 0-669-08665-7] 
More demanding of the student and more compre- 
hensive than most of the new books in discrete math- 
ematics, there is ample material for a year’s course. 
In addition to more details on the normal topics, at- 
tention is paid to logical quantifiers, growth rates, 
matrix algebra, probability, and abstract algebra, in- 
cluding coding theory. Good selection of exercises 
with answers and solutions, suggested readings, ap- 
pendix on generating functions. JS 


Discrete Mathematics, T(13-14: 1, 2). Intro- 
duction to Discrete Mathematics. James Bradley. 


978 


Addison-Wesley, 1988, xiv + 685 pp, $40.95. (ISBN: 
0-201-10628-0] Covers standard topics with algo- 
rithms as the unifying theme. Over 40 algorithms 
in Pascal pseudo-code are presented, with complex- 
ity discussed in many of them. Other topics include 
elementary number theory, probability models, and 
nearly 50 pages on elementary matrix theory (Gaus- 
sian elimination, inverses). LC 


Number Theory, P. Lecture Notes in Mathemat- 
tcs-1290: Diophantine Approzimation and Transcen- 
dence Theory. Ed: G. Wistholz. Springer-Verlag, 
1987, 243 pp, $21.20 (P). [ISBN: 0-387-18597-6] For- 
mulated in part during a 1985 workshop at Bonn, 
this volume contains articles on the Hasse principle 
for cubic surfaces, heights on Abelian varieties, large 
transcendence degree, linear forms in logarithms, and 
the Thue-Mahler equation. GG 


Number Theory, T(18: 1), S, P. Lectures on 
a Method in the Theory of Exponential Sums. M. 
Jutila. Springer-Verlag, 1987, viii + 131 pp, $12 
(P). [ISBN: 0-387-18366-3] Self-contained treatment 
of exponential sums arising in the study of the Rie- 
mann zeta function and, more generally, of Dirich- 
let series arising from modular forms. The first two 
of the book’s four chapters cover summation formu- 
las and exponential integrals. These are applied in 
the final two chapters to obtain general and specific 
transformation formulas, including many new results 
or new proofs. GG 


Number Theory, T(18: 1), S, P. Hilbert Modu- 
lar Surfaces. Gerard van der Geer. Ergebnisse der 
Math. und ihrer Grenzgebiete, B. 16. Springer- 
Verlag, 1988, ix + 291 pp, $69.50. [ISBN: 0-387- 
17601-2] To an ever-increasing extent, solutions to 
Diophantine problems rely on the power of algebraic 
geometry. Many times there is an explicit connection 
with automorphic forms. There are also conjectures 
relating automorphic forms to prime ideal decompo- 
sitions in number fields. A Hilbert modular surface 
is a quotient of the product of two upper half planes 
by a discrete group of two-by-two matrices over a 
real quadratic field. The book studies the geome- 
try of these surfaces and the automorphic forms on 
them. GG 


Number Theory, S(15-17), P, L**. Elementary 
Theory of Numbers. W. Sierpinski. Math. Lib., 
V. 31. North-Holland (US Distr: Elsevier Science), 
1988, xii + 513 pp, $105.25. [ISBN: 0-444-86662-0] 
Few changes, other than references, to the 1964 origi- 
nal in this second edition, edited by A. Schinzel. Sim- 
ilar to The Theory of Numbers by Hardy and Wright, 
this book considers primarily Diophantine and divis- 
ibility problems whose solutions lie within the ring of 
integers. A small sample of the range: Pell’s equa- 
tion, distribution of primes, congruences, Mersenne 
and Fermat numbers, and Waring’s problem. GG 


Number Theory, L**. The Book of Prime Num- 
ber Records. Paulo Ribenboim. Springer-Verlag, 
1988, xiii + 476 pp, $49.80. [ISBN: 0-387-96573- 
4] An interesting and truly delightful treatment of 
prime numbers. In spite of the title, this is much 
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more than a book of records. It contains up-to-date 
historical presentations of many important problems 
concerning prime numbers, including some proofs. 
Several topics are covered, for example the distribu- 
tion of primes and primality testing. Of course, the 
relevant records are also included. Written in a very 
readable manner which should be accessible and in- 
teresting to both specialists and nonspecialists. RH 


Number Theory, T*(14-16: 1), S, L*. Elemen- 
tary Number Theory and -Its Applications, Second 
Edition. Kenneth H. Rosen. Addison-Wesley, 1988, 
xiii + 466 pp, $37.95. [ISBN: 0-201-11958-7] New 
to this edition (First Edition, TR, January 1986) 
are sections on basic axioms for integers and mathe- 
matical induction, more routine exercises, simplified 
proofs, earlier introduction of Fibonacci numbers, ex- 
panded coverage of big-0 notation, sections on sum 
of squares and on factoring using continued fractions, 
recent discoveries of large primes, and biographies of 
mathematicians. RH 


Number Theory, T(17-18: 1), S, P, L. An Intro- 
duction to the Theory of the Riemann Zeta-Function. 
S.J. Patterson. Stud. in Adv. Math., V. 14. Cam- 
bridge U Pr, 1988, xiii + 156 pp, $34.50. [ISBN: 
0-521-33535-3] The Riemann zeta-function is a mero- 
morphic function which can be studied through com- 
plex analysis, but on the other hand yields interest- 
ing results concerning prime numbers. This book 
presents a classical introduction to the Riemann zeta- 
function, including an historical introduction and a 
lengthy discussion of the Riemann Hypothesis. Con- 
tains many exercises and includes appendices which 
summarize some of the background mathematics 
needed. RH 


Linear Algebra, T*(16-17: 1), L. A Second 
Course in Linear Algebra. William C. Brown. Wi- 
ley, 1988, x + 264 pp, $34.95. [ISBN: 0-471-62602-3] 
Preparation for graduate work in algebra, analysis, 
topology, applied mathematics. Presumes the usual 
one-semester undergraduate course. Studies famil- 
iar topics more deeply, then proceeds to multilinear 
algebra, canonical forms of matrices, normed linear 
spaces, inner-product spaces. Deliberately formal; 
excellent problem sets. DFA 


Linear Algebra, T(14: 1). Linear Algebra with 
Applications. Charles G. Cullen. Scott Fores- 
man, 1988, 393 pp, $26.50. [ISBN: 0-673-18570-2] 
Covers standard topics with emphasis on computa- 
tional aspects. Applications include linear program- 
ming, graph theory, least square approximations, and 
quadratic forms. LC 


Linear Algebra, T*(14: 1). A Plaid Linear Alge- 
bra. Robert Messer (Dept. of Math., Albion Coll., 
Albion, MI 49224), 1988, viii + 189 pp, $22 (P). An 
interesting textbook emphasizing general theory over 
computations. Covers the basics together with some 
applications. Specifically designed to help students 
develop the mathematical maturity needed for sub- 
sequent mathematics courses. AO 


Linear Algebra, T(14-15: 1). Linear Algebra. 
Carroll Wilde. Addison-Wesley, 1987, xvii + 492 


1988] 


pp, $39.75. [ISBN: 0-201-13089-0] Quite similar in 
content and organization to other recent linear alge- 
bra texts; begins with linear systems, matrix algebra, 
and determinants before considering vector spaces 
and transformations. Several distinguishing features 
include good graphics, review checklists, and use of 


the computer. Applications are treated in some de- 
tail. JS 


Linear Algebra, T(14: 1). Elementary Linear Al- 
gebra. Roland E. Larson, Bruce H. Edwards. DC 
Heath, 1988, xiii + 533 pp, $29. [ISBN: 0-669-14583- 
1] Chapters on systems of linear equations, matrices, 
and determinants precede coverage of vector spaces 
and linear transformations. Applications in areas 
such as computer graphics, cryptography, differen- 
tial equations, etc., are included in each chapter. A 
final chapter covers numerical methods. A diskette of 
software operating on an IBM-PC is available. JNC 


Group Theory, P. Whitehead Groups of Finite 
Groups. Robert Oliver. Math. Soc. Lect. Note 
Ser., V. 132. Cambridge U Pr, 1988, 349 pp, $34.50 
(P). [ISBN: 0-521-33646-5] The intent of the au- 
thor is to draw together a variety of results scattered 
through the literature and dealing with recent work 
on Whitehead groups. Part I is general theory, Part 
II treats group rings on p-groups, and Part III con- 
siders general finite groups. Bibliography, index. JS 


Group Theory, P. The Arcata Conference on Rep- 
resentations of Finite Groups. Ed: Paul Fong. Proc. 
of Symposia in Pure Math., V. 47. AMS, 1987, $122 
set (ISBN: 0-8218-1479-6]. Part 1, xi + 487 pp; Part 
2, vill + 552 pp. A collection of 29 expository pa- 
pers and 69 research papers presented at the Summer 
Research Institute held at Humboldt State Univer- 
sity in Arcata, California, July 7-25, 1986. Covers 
advances made in the representation theory of finite 
groups since the 1979 Santa Cruz Summer Institute 
on finite groups. RH 

Algebra, P. Commutative Algebra and Combina- 
torics. Ed: M. Nagata, H. Matsumura. Adv. Stud. 
in Pure Math., V. 11. North-Holland (US Distr: El- 
sevier Science), 1987, 359 pp, $163.25. [ISBN: 0-444- 
70314-4] A collection of 24 papers in those aspects 
of commutative algebra which are particularly use- 
ful in combinatorics. The papers cover such topics 
as Cohen- Macaulay rings, representations of classical 
groups, and non-commutative invariant theory. SG 


Algebra, T(17-18: 1), P. Model Theory and Mod- 
ules. Mike Prest. Math. Soc. Lect. Note Ser., V. 
130. Cambridge U Pr, 1988, xviii + 380 pp, $37.50 
(P). [ISBN: 0-521-34833-1] A monograph on the in- 


teraction between the two mathematical fields, con-_ 


cerned with applications of model theory to the study 
of modules (e.g., relationship between pp-types and 
pure-injective modulvs) and with use of modules as 
examples in stability theory. The book is intended 
to be accessible by graduate students and established 
workers in algebra or logic. RB 


Algebra, P. Triangulated Categories in the Rep- 
resentation Theory of Finite Dimensional Algebras. 
Dieter Happel. Math. Soc. Lect. Note Ser., V. 
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119. Cambridge U Pr, 1988, ix + 208 pp, $24.95 (P). 
[ISBN: 0-521-33922-7] Describes how triangulated 
categories can be used in studying modules over finite 
dimensional K-algebras, where K is a field which is 
usually assumed to be closed. Also shows that when 
recent methods of representation theory are applied, 


certain triangulated categories become more accessi- 
ble. RH 


Algebra, T(18: 2). Real Reductive Groups I. Nolan 
R. Wallach. Pure & Appl. Math., V. 132. Academic 
Pr, 1988, xix + 412 pp, $59.95. [ISBN: 0-12-732960- 
9] The representation theory of real reductive groups 
is almost completely the product of Harish-Chandra. 
The author attempts to present the algebraic aspects 
of the theory in this volume and the analytic aspects 
in the next. The heart of this volume is the theory of 
the real Jacquet module and the asymptotic behav- 
ior of matrix coefficients. Suitable for an advanced 
graduate course. No exercises. MR 


Algebra, T(16-17: 2), P. Modular Lie Algebras 
and Their Representations. Helmut Strade, Rolf 
Farnsteiner. Pure & Appl. Math., V. 116. Marcel 
Dekker, 1988, viii + 301 pp, $79.75. [ISBN: 0-8247- 
7594-5] Modular Lie algebras are Lie algebras over 
fields of positive characteristics. Although the last 
several decades have seen “vigorous development” 
in the theory of modular Lie algebras, these alge- 
bras have yet to be completely classified. This book 
first sketches the classical theory of Lie algebras over 
fields of characteristic 0, and then proceeds to in- 
troduce the structure and representation theory of 
modular Lie algebras. The only prerequisite for the 
first part of the book is a knowledge of linear algebra; 
for the chapters addressing representation theory, the 
reader needs some familiarity with the techniques 
from the theory of associative algebras. Although 
written quite formally, the book includes exercises 
and could be used as the text for a course. LW 


Calculus. Calculus with Analytic Geometry, Fourth 
Edition. Earl W. Swokowski. Prindle, Weber & 
Schmidt, 1988, xii + 914 pp, $39.50. [ISBN: 0- 
87150-007-8] Intended for a two-term freshman cal- 
culus course. Covers standard topics. Many exer- 
cises, both standard plug-n-chug and story problems. 
Applications from a wide variety of fields. Answers 
to odd numbered problems. Well illustrated. All 
major theorems proved. (First Edition, TR, Decem- 
ber 1975; Second Edition, TR, June-July 1979; Third 
Edition, TR, November 1984.) SM 


Calculus, T(13: 1). Brief Calculus for Manage- 
ment and the Life and Social Sciences. Donald L. 
Stancl, Mildred L. Stancl. Richard D Irwin, 1988, 
xv + 505 pp, $33.95. [ISBN: 0-256-06303-6] Use of 
functions to construct models, and calculus to ana- 
lyze models in management, economics, and sciences. 
Heavy emphasis on drawing and interpreting graphs 
of functions. Derivation, integration, exponential 
and logarithmic functions, and functions of several 
variables. Separate computer manual with IBM PC 
disk in Basic illustrating the material is available. MS 


Calculus, T. Calculus for Management and the Life 


980 


and Social Sciences. Donald L. Stancl, Mildred L. 
Stancl. Richard D Irwin, 1988, xv + 684 pp, $35.95. 
[ISBN: 0-256-05714-1] Presents basic concepts and 
techniques of calculus with an emphasis on the use of 
functions to construct models in management, eco- 
nomics, and the sciences. A companion computer 
manual containing an IBM PC formatted disk with 
programs illustrating text material is available. JNC 
Real Analysis, T*(16-17: 2), P, L*. Real Vari- 
ables. Alberto Torchinsky. Addison-Wesley, 1988, 
xii + 403 pp, $43.25. [ISBN: 0-201-15675-x] Covers 
standard topics on Lebesque measure and integration 
(in R™) as well as some of the basics of functional 
analysis. Assumes “a good advanced calculus back- 
ground as well as an elementary knowledge of the 
theory of metric spaces.” Very well motivated with 
extensive set of problems at end of each chapter. BH 


Complex Analysis, P. Lecture Notes in Mathemat- 
tcs-1295: Séminaire d’Analyse, P. Lelong—P. Dol- 
beault—H. Skoda. Ed: P. Lelong, P. Dolbeault, H. 
Skoda. Springer-Verlag, 1987, vii + 283 pp, $25.80 
(P). [ISBN: 0-387-18691-3] Collection of 11 articles 
in several complex variables (in French). Sequel to 
Lecture Notes in Mathematics 1028, 1198. BH 


Complex Analysis, P. Branched Coverings and Al- 
gebraic Functions. Makoto Namba. Pitman Res. 
Notes in Math. Ser., V. 161. Longman Scientific & 
Technical (US Distr: Wiley), 1987, 201 pp, $49.95 
(P). [ISBN: 0-470-20949-6] Covers theory of finite 
branched coverings of projective complex manifolds 
in connection with the theory of algebraic functions 
of several complex variables. Two main problems are 
discussed. First, give conditions for the existence of 
a Galois (Abelian) covering which branches at a divi- 
sor D, and second, describe the set of all finite Galois 
(Abelian) covering which branch at most at D. The 
first of these is Fenchel’s problem, and the second is 
class field theoretic and can be traced back to Hilbert 
and Weil. MR 


Differential Equations, P. Fuchstan Differential 
Equations: With Special Emphasis on the Gauss- 
Schwarz Theory. Masaaki Yoshida. Aspects of 
Math., E11. Freidr Vieweg & Sohn, 1987, xiv + 215 
pp, (P). [ISBN: 3-528-08971-7] A study in linear 
ordinary differential equations and systems of par- 
tial differential equations with finite dimensional so- 
lution spaces. Concentrates on Gauss-Schwarz the- 
ory of the hypergeometric differential equation, in 
particular as part of the theory of orbifold. The main 
question is how to constuct a differential equation (in 
several variables) with a given set of singularities. A 
restricted answer is given. MR 


Differential Equations, P. Oscillation Theory of 
Differential Equations with Deviating Arguments. 
G.S. Ladde, V. Lakshmikantham, B.G. Zhang. Pure 
& Appl. Math., V. 110. Marcel Dekker, 1987, vi 
+ 308 pp, $89.75. [ISBN: 0-8247-7738-7] An ordi- 
nary differential equation with deviating arguments 
(ODEWDA) is one in which the solution depends 
on the past history and not just on the current 
state; [e.g., 2y(t) + y(t) — y(t — w) = 0,2 > 0, has 
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y(t) = 1—sin?t as a solution]. There are six chapters: 
preliminaries, first order linear, first order nonlinear, 
second order, higher order, and systems of differen- 
tial equations; over 300 references are listed. It is 
“the first English language book that offers a system- 
atic study of the theory of oscillation of ODEWDA.” 
RSF 


Partial Differential Equations, P. First Interna- 
tional Symposium on Domain Decomposition Meth- 
ods for Partial Differential Equations. Ed: Roland 
Glowinski, et al. SIAM, 1988, x + 431 pp. [ISBN: 
0-89871-220-3] Proceedings of a symposium in Paris, 
January 1987. Includes 22 papers, and the list of 
contributors. Topics include elliptic problems in 
two- and three-dimensions, mixing finite elements 
and finite differences, preconditioning, fluid dynam- 
ics, structural analysis, incompressible fluid flows, 
vortex subdomains, and transonic flows. RSF 


Partial Differential Equations, T(16-17: 1), L. 
Numerical Solution of Partial Differential Equations 
by the Finite Element Method. Claes Johnson. Cam- 
bridge U Pr, 1987, 278 pp, $69.50; $24.95 (P). [ISBN: 
0-521-345-146; 0-521-347-580] An introduction to the 
finite element method. Covers elliptic, parabolic, and 
hyperbolic equations. Also includes a chapter on fi- 
nite element methods for integral equations and an 
introduction to some nonlinear problems. The ma- 
terial concerning parabolic and hyperbolic problems 
contains some recent developments which have not 
appeared in earlier texts. Includes appropriate exer- 
cises. Prerequisites are advanced calculus and linear 
algebra. RH 


Partial Differential Equations, P. Lecture Notes 
in Mathematics-1297: Numerical Methods for Par- 
tial Differential Equations. Ed: Zhu You-lan, Guo 
Ben-yu. Springer-Verlag, 1987, xi + 244 pp, $25.80 
(P). [ISBN: 0-387-18730-8] Proceedings of a confer- 
ence held in Shanghai, March 1987. Includes sixteen 
papers (in English) of the 75 papers presented (all 
75 are listed by author). Topics include Hamiltonian 
equations, Signorini problems, nonlinear wave equa- 
tions on n-dimensional spheres, parallel algorithms 
and domain decomposition, degenerate second order 
hyperbolic equations, error expansions for finite ele- 
ment approximations, the MKDV equation, nonlin- 
ear fourth order equations, parabolic systems, dis- 
continuous vortical flows, folds of degree four and 
swallowtail catastrophe, viscous splitting in bounded 
domains, inverse scattering, and quasilinear hyper- 
bolic initial-boundary-value problems. RSF 


Partial Differential Equations, P. Sensors and 
Controls in the Analysis of Distributed Systems. A. 
El Jai, A.J. Pritchard. Math. & Its Applic. Halsted 
Pr, 1988, 125 pp, $64.95. [ISBN: 0-470-21023-0] A 
theoretical study of the structure of input and out- 
put operators (i.e., controls and sensors) in infinite- 
dimensional distributed parameter systems (example 
application: heat flow in a furnace). The study, rig- 
orously approached via semigroups, is limited to sys- 
tems described by parabolic partial differential equa- 
tions; a final chapter considers hyperbolic and non- 


1988] 


linear systems. RB 


Numerical Analysis, P. Advanced Boundary Ele- 
ment Methods. Ed: T.A. Cruse. Intern. Union of 
Theoret. & Appl. Mech. Springer-Verlag, 1988, xix 
+ 489 pp, $83.70. [ISBN: 0-387-17454-0] Fifty-two 
papers from a symposium on boundary (as opposed 
to finite) element methods. Diverse mechanical ap- 
plications are considered. BC 


Numerical Analysis, P. Numerical Methods. Ed: 
D. Greenspan, P. Rézsa. North-Holland (US Distr: 
Elsevier Science), 1988, 688 pp, $223.75. [ISBN: 
0-444-70209-1] Proceedings of the Conference on 
Numerical Methods, organized by the Janos Bolyai 
Mathematical Society, held at the campus of the 
Technical University for Heavy Industry in Miskolc, 
Hungary in 1986. Includes 5 invited papers and 36 
refereed papers from the conference arranged into 
the following sections: linear algebra, differential 
equations, applications, nonlinear equations and op- 
timization, further topics in numerical mathematics, 
and program packages. RH 

Functional Analysis, P. Aspects of Positivity in 
Functional Analysis. R. Nagel, U. Schlotterbeck, 
M.P.H. Wolff. Math. Stud., V. 122. North- 
Holland (US Distr: Elsevier Science), 1986, xiii + 
277 pp, $40.75 (P). [ISBN: 0-444-87959-5] Proceed- 
ings of a conference in honor of H.H. Schaefer held 
at Tubingen in 1985. The general theme is positivity 
in functional analysis, with eleven invited addresses 
and thirteen shorter papers. List of participants. JS 


Functional Analysis, T(15: 1). Metric Spaces. 
E.T. Copson. Tracts in Math., No. 57. Cambridge 
U Pr, 1988, vii + 143 pp, $12.95 (P). [ISBN: 0-521- 
35732-2] Leisurely treatment of metric spaces. Ap- 
plications of theory to classical algebra and anal- 
ysis. Knowlege of elements of theory of uniform 
convergence is assumed. Includes open and closed 
sets, complete metric spaces, connected sets, com- 
pactness, functions and mappings, and some appli- 
cations. Banach and Hilbert spaces introduced at the 
end along with general topology. Corrected reprint 
of 1968 original (TR, November 1969). MS 


Functional Analysis, P. Perturbation Bounds for 
Matriz Eigenvalues. Rajendra Bhatia. Res. Notes in 
Math. Ser., V. 162. Longman Scientific & Technical 
(US Distr: Wiley), 1987, 129 pp, $41.95 (P). [ISBN: 
0-470-20917-8] The central question is: If A and B 
are n X n matrices, how close are the eigenvalues of A 
and B given the distance || A — B||? This monograph 
establishes good bounds on the eigenvalue distance 
for yarious classes of matrices. MR 

Analysis, T(17-18: 1, 2), S*, P**, L**. Fourier 
Analysis. T.W. Korner. Cambridge U Pr, 1988, xii 
+ 591 pp, $95. [ISBN: 0-521-25120-6] An exception- 
ally well-written and wide-ranging account of the 
theory and applications of Fourier analysis. Inter- 
spersed with down-to-earth examples and anecdotes 
(such as the author’s description of being scalded by 
the time delay equation of a dormitory shower), this 
book is rewarding even just to browse through. It is 
one of the few $95 books that is actually worth the 
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price. BC 


Analysis, S(18), P**, L. Lectures on Bochner- 
Riesz Means. Katherine Michelle Davis, Yang-Chun 
Chang. London Math. Soc. Lect. Note Ser., V. 
114. Cambridge U Pr, 1987, ix + 150 pp, $22.95 (P). 
[ISBN: 0-521-31277-9] “Self-contained exposition of 
the geometric theory of Bochner-Riesz means.” As- 
sumes graduate course in real analysis and basic fa- 
miliarity with distributions and the Fourier trans- 
form. Wonderfully motivated with many intuitive 
insights. A joy to read. BH 


Analysis, P. Some Applications of Weighted Sobolev 
Spaces. Alois Kufner, Anna-Margarete Sandig. 
Teubner-Texte zur Math., B. 100. BG Teubner 
Leipzig, 1987, 268 pp, 28 M (P). [ISBN: 3-322-00426- 
0] Sequel to first author’s book Weighted Sobolev 
Spaces (Volume 31 in series). Part I studies elliptic 
boundary value problems for domains with conical 
corners and with edges. Part II deals with existence 
theorems for elliptic differential equations and prob- 
lems of the type of degenerate equations and equa- 
tions with singular coefficients. BH 


Analysis, P. Proceedings of the Analysts Confer- 
ence, Singapore 1986. Ed: Stephen T.L. Choy, Ju- 
dith P. Jesudason, P.Y. Lee. Math. Stud., V. 150. 
North-Holland (US Distr: Elsevier Science), 1988, 
xii + 304 pp, Dfl. 175 (P). [ISBN: 0-444-70341-1] 
Collection of seventeen papers primarily in harmonic 
and functional analysis. BH 


Analysis, P. Moduli of Smoothness. Z. Ditzian, V. 
Totik. Ser. in Computat. Math., V. 9. Springer- 
Verlag, 1987, ix + 227 pp, $54.90. [ISBN: 0- 
387-96536-X] The classical modulus of smoothness 
in approximation theory is defined by w’(f,t) = 
supocace || Anf ||. The authors introduce a new 
modulus wi(f,t)p = supocace || Ans ||; and ap- 
ply it to problems in polynomial approximation and 
characterization of k-functionals. Results presented 
are new. Very clear presentation. BH 


Analysis, P. Nonlinear Analysis and Applications. 
Ed: V. Lakshmikantham. Lect. Notes in Pure & 
Appl. Math., V. 109. Marcel Dekker, 1987, xix + 
649 pp, $99.75 (P). [ISBN: 0-8247-7810-3] A collec- 
tion of 84 papers presented at the Seventh Interna- 
tional Conference on Nonlinear Analysis and Appli- 
cations held at the University of Texas at Arlington 
in 1986. AWR 


Algebraic Geometry, S(18), P. Convez Bodies 
and Algebraic Geometry: An Introduction to the 
Theory of Toric Varieties. Tadao Oda. Ergebnisse 
der Math. und ihrer Grenzgebiete, B. 15. Springer- 
Verlag, 1988, viii + 212 pp, $78. [ISBN: 0-387- 
17600-4] A geometric introduction to complex an- 
alytic spaces, concentrating on toric varieties, which 
are also called torus embeddings. Although the full 
strength of algebraic geometry is not required, knowl- 
edge of basic properties of algebraic varieties over the 
complex numbers is necessary. Many results, includ- 
ing those of the author, are less than fifteen years 
old. Discouraging price. GG 


Algebraic Geometry, P. Lecture Notes in Mathe- 
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matics-1293: The Monodromy Groups of Isolated 
Singularities of Complete Intersections. Wolfgang 
Ebeling. Springer-Verlag, 1987, xiv + 153 pp, $17.30 
(P). [ISBN: 0-387-18686-7] Arithmetical characteri- 
zation (in terms of the Milnor lattice) for monodromy 
groups and vanishing cycles of certain intersection 
singularities. A step beyond hypersurfaces. BC 


Algebraic Geometry, P. Etale Cohomology and 
the Weil Conjecture. Eberhard Freitag, Reinhardt 
Kiehl. Ergebnisse der Math. und ihrer Grenzgebiete, 
B. 13. Springer-Verlag, 1988, xviii + 317 pp, $98. 
[ISBN: 0-387-12175-7] A self-contained treatment of 
the etale cohomology necessary for Deligne’s proof 
of the Weil conjectures. Including a chapter on the 
monodromy theory of Lefschetz pencils, a description 
of some applications of the conjectures, and a history 
of the Weil conjectures written by Dieudonné. SG 


Algebraic Geometry, P. Topics on Real and Com- 
plez Singularities: An Introduction. Alexandru 
Dimca. Adv. Lect. in Math. Friedr Vieweg & 
Sohn, 1987, xvii + 242 pp, (P). [ISBN: 3-528-08999-7] 
An introduction to local singularity theory with spe- 
cial attention paid to the complex analytic situation 
and connections with algebraic geometry. Main topic 
covered is the finite determinancy of germs (alias lo- 
cal C-algebras). Presupposes basics of smooth mani- 
folds, commutative algebra, and algebraic geometry. 
Takes the reader from introductory material through 
unpublished research work. Some exercises, exam- 
ples of open problems, bibliography, and index. MR 


Differential Geometry, P. Foliations on Rieman- 
nian Mantfolds. Philippe Tondeur. Universitext. 
Springer-Verlag, 1988, xi + 247 pp, $29.80 (P). 
[ISBN: 0-387-96707-9] Foliations are generalizations 
of integral flows of dynamical systems to higher di- 
mensions. This text represents notes from lectures 
given at the University of Illinois. Chapters 1-4 are 
an introduction to the subject of foliations, while the 
remainder of the book covers the relationship be- 
tween foliations on Riemannian manifolds. No ex- 
ercises. MR 


Differential Geometry, T**(17-18: 1), L. Dif- 
ferential Geometry, Gauge Theories, and Gravity. 
M. Gockeler, T. Schiicker. Mono. on Math. Physics. 
Cambridge U Pr, 1987, xii + 230 pp, $49.50. (ISBN: 
0-521-32960-4] Introduction to those areas of modern 
differential geometry fundamental to particle physics 
and general relativity. Topics include Yang-Mills the- 
ories, gravity, fiber bundles, monopoles, instantons, 
spinors, and anomalies. AM 


Geometry, P**, L**. Shaping Space: A Poly- 
hedral Approach. Ed: Marjorie Senechal, George 
Fleck. Birkhauser Boston, 1988, xx + 284 pp, 
$49.95. [ISBN: 0-8176-3351-0] A polyhedral an- 
thology in the Design Science Collection series edited 
by Arthur L. Loeb. Inspired by the Shaping Space 
Conference at Smith College in April 1984. The- 
ory and history of polyhedra. Applications in sci- 
ence and engineering. For “all scholars and educa- 
tors interested in and working with two- and three- 
dimensional structures and patterns.” Richly illus- 
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trated (photographs, line and half-tone drawings) 
interdisciplinary guide to three-dimensional forms 
focusing on polyhedra. Chapters on model build- 
ing, unsolved problems, polyhedra in the curriculum. 
An abundance of references in art, architecture, sci- 
ence and, even, geometry which will be unfamiliar 
to many mathematicians. Recommended for every 
school, college, and university library. JK 


Algebraic Topology, P. Lecture Notes in Mathe- 
matics-1298: Algebraic Topology, Barcelona 1986. 
Ed: J. Aguadé, R. Kane. Springer-Verlag, 1987, x 
+ 255 pp, $25.80 (P). [ISBN: 0-387-18729-4] Col- 
lected refereed papers from the Second Barcelona 
Conference on Algebraic Topology held in April 1986, 
and subsequently dedicated to the memory of Alex 
Zabrodsky. Consists of 15 main talks and 23 short 
communications on a variety of topics. LW 


Differential Topology, P. A Féte of Topology: Pa- 
pers Dedicated to Itiro Tamura. Ed: Y. Matsumoto, 
T. Mizutani, S. Morita. Academic Pr, 1988, xii + 602 
pp, $60. [ISBN: 0-12-480440-3]} A collection of terse 
and formal papers contributed by 30 authors. The 
papers are arranged into five categories: foliations, 
characteristic classes, singularities and orbifolds, 3- 
and 4-dimensional manifolds, and group actions. LW 


Differential Topology, P. Topics in Differential 
Geometry, V. I-II. Ed: J. Szenthe, L. Tamassy. 
North-Holland (US Distr: Elsevier Science), 1988, 
$315.75 set (ISBN: 0-444-70090-0]. Volume I, 708 
pp; Volume II, 666 pp. Proceedings of a conference 
held at Hajduszoboszlé6 in August 1984. A variety 
of topics covered with emphasis on Finsler geometry, 
Riemannian geometry, and applications. MR 


Differential Topology, S(16-17), P, L. Models 
of the Real Projective Plane: Computer Graphics of 
Steiner and Boy Surfaces. Francois Apéry. Com- 
put. Graphics & Math. Models. Friedr Vieweg & 
Sohn, 1987, xi + 156 pp, (P). [ISBN: 3-528-08955-5] 
Nicely illustrated text (with 64 color plates) on im- 
mersions of P? in R3. Hilbert thought you couldn’t 
do it without introducing singularities. Boy, was he 
wrong. BC 


Differential Topology, P. Stratified Morse The- 
ory. Mark Goresky, Robert MacPherson. Ergebnisse 
der Math. und ihrer Grenzgebiete, B. 14. Springer- 
Verlag, 1988, xv + 272 pp, $75. [ISBN: 0-387-17300- 
5] Morse theory is generalized to stratified spaces, 
i.e., spaces with a decomposition into submanifolds 
(of various dimensions) so that singularities are lo- 
cally constant along each submanifold. Applications 
are given, primarily to the topology of complex an- 
alytic varieties. The proofs of the main theorems, 
announced in 1980, appear here for the first time af- 
ter a period of simplification and geometrization. RB 


Topology, T(18), P, L. Eztensions and Absolutes 
of Hausdorff Spaces. Jack R. Porter, R. Grant 
Woods. Springer-Verlag, 1987, xiii + 856 pp, $89. 
[ISBN: 0-387-96212-3] An extension of a Hausdorff 
space X is a space in which X is dense; the Iliadis 
absolute of X is an extremally disconnected zero- 
dimensional Hausdorff space EX that admits a per- 
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fect irreducible §-continuous surjection onto X. Al- 
though conceptually quite different, extensions and 
absolutes can be constructed using similar tools. 
This tome is primarily a systematic study of ex- 
tensions and absolutes of a Hausdorff space. Each 
chapter begins with a well-written and compelling 
introductory section, presenting the motivation and 
goals of the chapter. The text is nicely organized 
and pleasant to read. Each chapter is followed by 
numerous exercises, so the book would be ideal as a 
text for a graduate-level course. LW 


Operations Research, P. Traffic Processes in 
Queueing Networks: A Markov Renewal Approach. 
Ralph L. Disney, Peter C. Kiessler. Math. Sci., V. 
4. Johns Hopkins U Pr, 1987, xix + 251 pp, $49.50. 
[ISBN: 0-8018-3454-6] Study of queueing traffic pro- 
cess problems, including dependencies between traf- 
fic processes, e.g., cross-correlational properties. Ar- 
rival processes, overflow processes and finite capacity 
queues, Markov renewal processes, point processes, 
reversibility and traffic sets, detailed study of pro- 
cesses in small networks. RM 


Operations Research, T(15-17: 1, 2), S. Prin- 
ciples of Operations Research for Management, Sec- 
ond Edition. Frank S. Budnick, Dennis McLeavey, 
Richard Mojena. Richard D Irwin, 1988, xx + 988 
pp, $42.95. [ISBN: 0-256-02643-2] Textbook for a 
course in management science, operations research, 
or quantitative methods offered at schools of man- 
agement/business administration, colleges, or uni- 
versities. In this edition: coverage of mathemati- 
cal models is unlinked from coverage of their solu- 
tion algorithms for greater flexibility; extended end- 
of-chapter case studies added; answers provided for 
selected exercises. RB 


Optimization, T(15-16: 1), L*. The Mathemat- 
ics of Nonlinear Programming. Anthony L. Per- 
essini, Francis E. Sullivan, J.J. Uhl, Jr. Undergrad. 
Texts in Math. Springer-Verlag, 1988, x + 273 pp, 
$42. [ISBN: 0-387-96614-5] After a brief review of 
unconstrained optimization, the text begins with the 
study of convex sets. Iterative and least squares 
methods are then treated in more detail than usual in 
the prerequisite analysis and linear algebra courses. 
The heart of the book uses geometry, penalty func- 
tions, Lagrange multipliers, and the simplex method 
in constrained optimization problems. A recurring 
theme is the Karush-Kuhn-Tucker existence theorem 
for convex programming. Each chapter concludes 
with a variety of exercises. Well-written, but several 
misprints. GG 


Optimization, T(17: 2), P. Mazimum and Mini- 
mum Principles: A Unified Approach, with Applica- 
tions. M.J. Sewell. Cambridge U Pr, 1987, xvi + 468 
pp, $79.50; $34.50 (P). (ISBN: 0-521-33244-3; 0-521- 
34876-5| Subject matter is optimization focused on 
nonlinear programming and on calculus of variations, 
but a nonstandard approach seeks unifying principles 
in the context of a saddle functional L[z, u] concave 
in 2, convex in u, with z in one inner product space, 
u in another. Chapter 1: Saddle function problems; 
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Chapter 2: Duality and Legendre transformations; 
and Chapter 3: Upper and lower bounds via saddle 
functions. Said to be suitable for a final undergradu- 
ate year course, but students might find it stiff going. 
Last two chapters intended as reference for workers 
in the field. Exceptionally detailed table of contents 
enhances value as a reference book. AWR 

Optimization, T(17), P. Stability and Perfection 
of Nash Equilibria. Eric van Damme. Springer- 
Verlag, 1987, xvii + 318 pp, $99. [ISBN: 0-387- 
17101-0] Solutions to game theory problems usually 
assume rational behavior on the part of one’s oppo- 
nent. Irrational behavior (non-cooperative game the- 
ory) is commonly covered in discussions of Nash equi- 
librium, a concept increasingly found to have certain 
limitations. In this book, an elaboration of the au- 
thor’s “refinements of the Nash equilibrium concept” 
that includes chapters on specific examples, various 
refinements of Nash equilibrium, are discussed. AWR 


Dynamical Systems, S(17-18), P. Dynamical 
Systems Approaches to Nonlinear Problems in Sys- 
tems and Circuits. Ed: Fathi M.A. Salam, Mark 
L. Levi. SIAM, 1988, xvi + 413 pp. [ISBN: 0- 
89871-218-1] Abstracts and papers contributed to 
illuminate areas of applied mathematics and engi- 
neering for which dynamical systems are appropri- 
ate. Specific areas include nonlinear circuits and sys- 
tems, control systems, solids and vibrations (espe- 
cially chaotic motion), and mechanical systems (e.g., 
robot manipulators). Two chapters of background 
papers on the general theory introduce the book. LB- 
E 


Probability, $(16-18), P, L. Counterezamples in 
Probability. Jordan M. Stoyanov. Prob. & Math. 
Stat. Wiley, 1988, xxiii + 313 pp, $81.95. [ISBN: 
0-471-91649-8] First two-thirds presents counterex- 
amples in probability theory, including sections on 
random events, random variables, distributions, ex- 
pectations, and limit theorems. Last third gives 
counterexamples in stochastic processes. Note price! 
RSK 

Probability, P. Ten Lectures on the Probabilistic 
Method. Joel Spencer. CBMS-NSF Reg. Conf. 
Ser. in Appl. Math. SIAM, 1987, v + 78 pp, (P). 
[ISBN: 0-89871-213-0] Notes from the CBMS-NSF 
Conference on Probabilistic Methods in Combin- 
atorics, Colorado, 1986. Topics include random 
graphs, the deletion method, the Gale-Berlekamp 
Switching Game, and the Lovdsz Local Lemma. LC 


Probability, P. Random Graphs ’85. Ed: Michal 
Karonski, Zbigniew Palka. Math. Stud., V. 144. 
North-Holland (US Distr: Elsevier Science), 1987, ix 
+ 354 pp, $97.25 (P). [ISBN: 0-444-70265-2] Papers 
from the Second International Seminar on Random 
Graphs and Probabilistic Methods in Combinatorics, 
Poznan, August 1985. LC 

Probability, T**(15-16: 1, 2), S*, L*. Introduc- 
tion to Probability. J. Laurie Snell. Random House, 
1988, xix + 517 pp, $27. (ISBN: 0-394-34485-5] Well- 
written text at the post-calculus level. Dominant 
feature is its extensive use of computer programs, 
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written in True BASIC, as a pedagogical tool to sim- 
ulate and analyze chance experiments. Another dis- 
tinguishing feature is the inclusion of numerous inter- 
esting historical remarks. Presents discrete and con- 
tinuous probability in a separate but parallel man- 
ner so text could be used for a discrete probability 
course. Concludes with a chapter on finite Markov 
chains. RSK 


Probability, S(15-16), L. Applied Problems in 
Probability Theory. E. Wentzel, L. Ovcharov. 
Transl: Irene Aleksanova. MIR (US Distr: Imported 
Pub), 1986, 432 pp, $13.95. [ISBN: 0-8285-3322-9] 
Revision of the 1983 Russian edition. Problems re- 
lated to various fields, including electrical and radio 
engineering; data transmission; information systems; 
reliability, preventative maintenance, repair and ac- 
curacy of apparatus; transport; and health services. 
Each chaptér begins by sketching (without proofs) 
some of the theory of probability; the bulk of each 
chapter is a collection of problems and their solutions 
which make use of the theory previously presented. 
An impressive collection of problems. LW 


Elementary Statistics, T*(13-14: 1, 2). Statis- 
tics, Fourth Edition. James T. McClave, Frank H. 
Dietrich II. Dellen, 1988, xx + 1014 pp. [ISBN: 0- 
02-379260-4] Expanded and updated revision of the 
authors’ 1985 Third Edition (Second Edition, TR, 
December 1982). Major change is the increased use 
of computer output throughout the text in both ex- 
amples and exercises, allowing a change in emphasis 
from computations to interpretations. Other signif- 
icant changes include an increased emphasis on sta- 
tistical quality control, and a collection of computer 
exercises based on a data set available from the pub- 
lisher. RSK 


Elementary Statistics, T(13: 1). Statistics Ez- 
plained: Bastc Concepts and Methods. R. Kapadia, 
G. Andersson. Math. & Its Applic. Halsted Pr, 
1987, 234 pp, $36.95. [ISBN: 0-470-20966-6] Pre- 
supposes no college mathematics. The usual topics, 
calculator tips, and Minitab used selectively. Many 
newspaper clippings showing statistics as it is used 
daily. Only birth data set and random numbers table 
are included. MS 


Elementary Statistics, S(13). How to Think 
about Statistics. John L. Phillips, Jr. WH Freeman, 
1988, xiv + 198 pp, $9.95 (P); $17.95. [ISBN: 0- 
7167-1923-1; 0-7167-1922-3] Revision of the 1982 Sec- 
ond Edition of the author’s book Statistical Thinking 
(1973 First Edition, TR, August-September 1974). 
Presents the logic behind the major ideas in statis- 
tical thinking. Most chapters include several realis- 
tic situations, called sample applications, where the 
reader is asked to make a decision about what to do. 
Answer section includes both “a good choice” and 
“possible misinterpretations” for each. RSK 


Statistics, T(14-16: 1, 2). Probability and Statis- 
tical Inference, Third Edition. Robert V. Hogg, El- 
liot A. Tanis. Macmillan, 1988, ix + 658 pp. [ISBN: 
0-02-355810-5] For students with some calculus back- 
ground. This new edition has more material on ex- 
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ploratory data analysis, more graphs, and more real 
applications. (First Edition, TR, May 1977.) FLW 


Statistics, P**, L. The Collected Works of John 
W. Tukey, Volume V, Graphics: 1965-1985. Ed: 
William S. Cleveland. Stat./Prob. Ser. Wadsworth, 
1988, lxiv + 464 pp, $44.95. [ISBN: 0-534-05102-2] 
Latest volume of a projected series covering Tukey’s 
many and diverse contributions to statistics (see 
TRs, May 1985 of Series and Volume I; February 
1986 of Volume I; and June-July 1987 of Volumes 
III and IV). Contains 18 papers, including some pre- 
viously unpublished, documenting the inventiveness 
of his pioneering work in this area. Includes com- 
ments by the editor on each of the papers. RSK 


Statistics, S(17), P, L. Introduction to Optimiza- 
tion Methods and their Application in Statistics. B.S. 
Everitt. Chapman & Hall, 1987, vii + 88 pp, $25 (P); 
$49.95. [ISBN: 0-412-29680-2; 0-412-27210-5] This 
short book can be broken into two sections. The first 
part discusses nonlinear mathematical programming 
algorithms independent of context, both direct and 
gradient methods. The second part contains several 
applications to statistical analysis. Good bibliogra- 
phy, a few problems, no solutions. SM 


Statistics, T(16-18). Linear Least Squares Com- 
putations. R.W. Farebrother. Stat.: Textbooks & 
Mono., V. 91. Marcel Dekker, 1988, xiii + 293 pp, 
$79.75. [ISBN: 0-8247-7661-5] Prerequisite: sound 
knowledge of a computer programming language to 
implement the algorithms on computer. Matrix 
arithmetic introduced as required. Several aspects 
of least squares method examined; canonical expres- 
sions related to traditional expressions. Allows read- 
ers to select variants most suitable for their needs; 
eliminates error analysis; adopts an empirical ap- 
proach. MS 


Statistics, P. Statistical Estimation in Large Pa- 
rameter Spaces. A.W. van der Vaart. CWI Tract 
V. 44. Math Centrum, 1988, 205 pp, Dfl. 31.60 
(P). [ISBN: 90-6196-329-X] Theoretical monograph 
concerned with estimation in situations intermediate 
between parametric and nonparametric models. Ob- 
tains bounds on the asymptotic behavior of estima- 
tors, and shows how to construct efficient estimators 
for certain cases. RSK 


Statistics, P, L. Selected Tables in Mathematical 
Statistics, V. 11. Ed: R.E. Odeh, J.M. Daven- 
port. AMS, 1988, xi + 371 pp, $46. [ISBN: 0- 
8218-1911-9] Tables of one-sided and two-sided up- 
per equicoordinate percentage points of the central 
multivariate student ¢ distribution in which there is 
a common variance estimate in the denominators of 
the variates, and the numerators are equicorrelated. 
Applications of tables and examples of their use are 
described in detail. MS 


Computer Literacy, C. Using OS/2. Kris Jamsa. 
Osborne McGraw-Hill, 1988, xv + 758 pp, $19.95 
(P). [ISBN: 0-07-881306-9] A “how-to” book for us- 
ing MicroSofts’ new multi-tasking microcomputer 
operating system, OS/2. A straightforward, no frills 
coverage of concepts, commands, and procedures to 
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make use of OS/2. Most useful as a reference to ac- 
company OS/2’s user manual. PS 


Computer Literacy, L. Encyclopedia of Microcom- 
puters, Volume 1: Access Methods to Assembly Lan- 
guage and Assemblers. Ed: Allen Kent, James G. 
Williams. Marcel Dekker, 1988, viii + 434 pp, $160. 
[ISBN: 0-8247-2700-2] This is the first of a proposed 
ten-volume work of about 500 articles in about 5000 
pages. This volume covers access methods to assem- 
bly language and assemblers. It has a 1988 copy- 
right, but the entry for Apple, for example, does not 
have the Mac Plus, SE, or II mentioned. Two very 
good articles are “Architecture of microprocessors” 
(by Robert C. Stanley), and “Assembly language and 
assemblers” (by Roger Flynn). RSF 


Elementary Computer Science, T*(13: 1). A 
Second Course tin Computer Science with Modula-2. 
Daniel D. McCracken, William I. Salmon. Wiley, 
1987, xiii + 474 pp, $34.10. [ISBN: 0-471-63111-6] 
Strongly emphasizes the use of abstract data types, 
recursion, and program readability. Makes effective 
use of Modula-2 to illustrate the software engineer- 
ing principles discussed in the text. Content con- 
sistent with most recent ACM guidelines for a CS-2 
course. AO 


Elementary Computer Science, T*(13: 1). 
Pascal on the Macintosh: A Graphical Approach. 
David A. Niguidula, Andries van Dam. Addison- 
Wesley, 1987, xxxiv + 686 pp, (P). [ISBN: 0-201- 
16588-0] For the CS1 course. Interactive graphics 
orientation throughout: graphics before program- 
ming, not the other way around. Uses Macintosh and 
Lightspeed Pascal and all their debugging and for- 
matting facilities. Emphasizes problem solving, pro- 
gram development, graphical user interfaces. Over 
300 illustrations and 500 exercises. Later chapters 
have advanced topics. DFA 


Programming, T(13-14). Pascal Plus Data Struc- 
tures, Algorithms, and Advanced Programming, Sec- 
ond Edition. Nell Dale, Susan C. Lilly. DC Heath, 
1988, xx + 750 pp, $26 (P). [ISBN: 0-669-15284-6] 
Good introduction to data structures and advanced 
programming. Emphasis on algorithms, modulariza- 
tion, data encapsulation and abstraction, informa- 
tion hiding, and software engineering make it a pos- 
sible choice for a second semester programming text. 
Includes a new chapter on verifying, debugging, and 
testing, and updates in problem sets, built-in data 
structures discussion, and priority queues since the 
First Edition (TR, February 1986). Has always had 
readable text and nice illustrations. Sorting algo- 
rithms, efficiency considerations, and searching get 
higher visibility than some texts. LB-E 


Programming, S, P. OS/2 Programmer’s Guide. 
Ed Iacobucci. Osborne McGraw-Hill, 1988, xxi + 
1100 pp, $24.95 (P). [ISBN: 0-07-881300-X] Written 
by the leader of the IBM OS/2 design team, this book 
for programmers (using standard edition release 1.0) 
is in three parts: the first part includes background 
material; the second part covers the overall struc- 
ture and functions of OS/2; and the third part ap- 
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plies concepts from part 2. Examples are written 
in assembler; appendices include function calls, er- 
ror codes, and sample programs. Topics presented 
include dynamic linking, the application program- 
ming interface (API), multi-tasking, protected mode, 
memory management, device drivers, I/O subsys- 
tems, session management, and OS/2 commands and 
utilities. RSF 

Programming, S?? An Easy Course in Using the 
HP-28S. John W. Loux, Chris Coffin. Grapevine 
Pub, 1988, 315 pp, $22 (P). [ISBN: 0-931011-18-3] 
Hard to believe that anyone who purchased an HP- 
28S would have any use for this simplistic manual. 
Some helpful hints and shortcuts, but most are too 
little, too late. MR 


Programming, T(14-15: 1), S. The C Trilogy: 
A Complete Library for C Programmers. Eric P. 
Bloom. TAB Books, 1987, viii + 588 pp, $22.95 (P). 
[ISBN: 0-8306-2890-8] This is a primer and reference 
manual for C. In addition to simply teaching the lan- 
guage, it also contains a listing of dozens of useful C 
functions which it calls the C Toolbox Library. GMS 


Programming, P, L*. Computer Games I. Ed: 
David N.L. Levy. Springer-Verlag, 1988, xiii + 456 
pp, $45. [ISBN: 0-387-96496-7|] Part of a collec- 
tion of 50 papers on how computers play games of 
strategy, drawn from widely-scattered sources, as- 
sembled by an International Chess Master and spe- 
cialist in the field. Volume I: backgammon (world 
champion beaten!); chess; checkers; scrabble. Vol- 
ume II: bridge; go; poker; nine other games. These 
solid papers concern aspects including mathemati- 
cal analysis, heuristic strategies, practical considera- 
tions, history. Bibliography, index. RB 
Programming, P. C: Power User’s Guide. Herbert 
Schildt. Osborne McGraw-Hill, 1988, x + 382 pp, 
$22.95 (P). [ISBN: 0-07-881307-7] A text for those 
who want to become extremely competent program- 
mers in C. Covers many advanced and specialized 
topics such as windows, graphics, screen program- 
ming, communications, and I/O programming. For 
the technical specialist who must use C in everyday 
work environment. GMS 


Programming, T(13: 1). Illustrating Pascal. 
Donald Alcock. Cambridge U Pr, 1987, vii + 184 
pp, $12.95 (P). [ISBN: 0-521-33695-3] Complete in- 
troduction to standard Pascal. Presentation is picto- 
rial: the book is entirely hand-printed, with text and 
diagrams attractively interwoven throughout. Pleas- 
ant reading. Few programming exercises. Includes 
bubble sort, Quicksort, stacks and queues, symbol- 
state tables, reverse Polish notation, shortest-route 
technique, double-linked rings, binary trees, hash- 
ing. DFA 

Languages, S(13-15), L. Modula-2: A Complete 
Guide. K.N. King. DC Heath, 1988, xxi + 656 pp, 
(P). [ISBN: 0-669-11091-4] Designed to be used ei- 
ther as a reference or as a textbook. Based on the 
third, corrected edition of Wirth’s Programming in 
Modula-2 (TR, June-July 1983). AO 


Languages, T(18), P. Concurrent Prolog: Col- 


986 


lected Papers. Ed: Ehud Shapiro. Ser. in Logic 
Program. MIT Pr, 1987, $65 set [ISBN: 0-262-19255- 
1]. Volume 1, xxix + 525 pp; Volume 2, xviii + 
653 pp. A republication of 39 papers by authors 
from various countries on concurrent forms of the 
logic programming language Prolog. One of these 
languages (GHC) is the “system language” for the 
Japanese ICOT Fifth Generation Computer Project, 
and much of the material was developed while in- 
teracting at ICOT. After surveying several concur- 
rent logic programming languages, the work focuses 
primarily on two (Concurrent Prolog and its sub- 
set, Flat Concurrent Prolog). Parallel algorithms, 
system programming, high- and low-level implemen- 
tation techniques. Suitable for reference or for the 
comparative study of concurrent logic programming 
languages; can possibly be used for learning (Flat) 
Concurrent Prolog. RB 


Algorithms, P. Distributed Algorithms and Proto- 
cols. Michel Raynal. Transl: Jack Howlett. Wiley, 
1988, xi + 163 pp, $31.95 (P). [ISBN: 0-471-91754-0] 
Protocols for controlling distributed systems. Deals 
with important problems such as nontrivial exclu- 
sion, deadlock, termination, and management of in- 
formation transfers. Includes analyses of the algo- 
rithms and proofs of correctness. RWN 


Algorithms, P. Searching Algorithms. Jiri Wieder- 
mann. Teubner-Texte zur Math., B. 99. BG Teub- 
ner, 1987, 123 pp, 13 M (P). [ISBN: 3-322-00433-3] 
“[A] monograph as well as a handbook giving the 
overview of both the state-of-the-art and trends in 
the field of searching algorithms,” by a Czech au- 
thor. Advanced hashing techniques; various types 
of search trees; multidimensional searching with de- 
grees of matching (e.g., interval match queries); syn- 
thesizing efficient solutions of searching problems via 
class membership problems and dynamization. RB 


Computer Systems, S, P, L. The UNIX Sys- 
tem Guidebook, Second Edition. Peter P. Silvester. 
Springer-Verlag, 1988, xiv + 334 pp, $28.95 (P). 
[ISBN: 0-387-96489-4] An introduction to UNIX 
for those with some computer experience who are 
concerned with applications rather than with com- 
puter systems themselves. Getting started, file sys- 
tem, shells, system kernel, utilities, editing, text pro- 
cessing, compilers, commands, bibliography. The 
Second Edition is expanded, extensively rewritten. 
Tested on Seventh Edition Unix, System V and 4.2 
BSD versions of UNIX; emphasizes System V. (First 
Edition, TR, August-September 1984.) RB 


Computer Systems, P. Hypercube Multiprocessors 
1987. Ed: Michael T. Heath. SIAM, 1987, xvii + 761 
pp. [ISBN: 0-89871-215-7] A collection of 80 papers 
given at the Second Conference on Hypercube Mul- 
tiprocessors. Multiprocessor architecture has neces- 
sitated re-evaluating traditional programming tech- 
niques, algorithms, and operating systems to take 
full advantage of a multiprocessor system’s potential. 
Papers presented illustrate this point, ranging from 
performance studies on hypercubes to many areas of 
their application including combinatorial problems, 
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scientific applications, numerical methods, and spe- 
cial emphasis on matrix computations and partial 
differential equations. PS 


Computer Systems, P. Simulating Computer Sys- 
tems, Techniques and Tools. M.H. MacDougall. Ser. 
in Comput. Syst. MIT Pr, 1987, 292 pp, $32.50. 
[ISBN: 0-262-13229-X] Written for computer and 
communication system designers who want to ana- 
lyze performance of designs. Divided into two parts. 
First, an introduction to discrete-event simulation in 
SMPL, a C-based simulation language. Covers model 
building, implementation in SMPL, and output data 
analysis. A multiprocessor system model and an eth- 
ernet model are carefully created as examples. The 
second part describes the implementation of SMPL. 
C source code for the SMPL language is included. PS 


Computer Systems, P. Software Diversity in 
Computerized Control Systems. Ed: U. Voges. De- 
pendable Computing & Fault-Tolerant Syst., V. 2. 
Springer-Verlag, 1988, 216 pp, $49.50. [ISBN: 0- 
387-82014-0] One of the methods of achieving fault- 
tolerant computer systems is through the use of fault- 
tolerant software. This text reviews the methods of 
developing such software and looks at applications of 
these techniques to such areas as transportation and 
nuclear reactors. GMS 


Computer Systems, T(14: 1), S. The MC68000 
Assembly Language and Systems Programming. 
William Ford, William Topp. DC Heath, 1987, xx 
+ 590 pp, $24. (ISBN: 0-669-16085-7] Material for 
the one-semester assembly language programming 
course, and much more: on high-level language run- 
time environment, data structures, I/O program- 
ming (includes libraries), exception processing, pe- 
ripheral device interrupts. Examples, complete pro- 
grams, exercises. Appendices provide complete in- 
struction set details for the MC68000 and related 
microprocessors. Extensively class tested. DFA 
Computer Systems, S(15-16), P, L. UNIX In- 
ternals: A Systems Operation Handbook. Myril 
Clement Shaw, Susan Soltis Shaw. TAB Books, 1987, 
xiii + 206 pp, $17.95 (P). [ISBN: 0-8306-2951-3] An 
elementary introduction to the inner workings of the 
UNIX operating system. Doesn’t burden the reader 
with too many technical details. Recommended for 
the user without an extensive background in com- 
puter science who wants (or needs) to know more 
about how UNIX does its job. AO 


Computer Systems, S(15-16), P, L. UNIX Sys- 
tem Programming. Keith Haviland, Ben Salama. In- 
tern. Comput. Sci. Ser. Addison-Wesley, 1987, xv + 
339 pp, (P). [ISBN: 0-201-12919-1] An excellent in- 
troduction to the system call interface of the UNIX 
operating system as defined by the AT&T System 
V Interface Definition. Also discusses the standard 
subroutine libraries. AO 


Computer Graphics, P, L. Geometric Model- 
ing: Algorithms and New Trends. Ed: Gerald E. 
Farin. SIAM, 1987, xi + 399 pp. [ISBN: 0-89871- 
206-8] A collection of survey articles and research 
papers organized into four sections: Curves and 
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Surfaces; Solid Modeling and Multivariate Surfaces; 
Shape; and Intersection Algorithms. About 60% of 
the twenty-seven papers are based on presentations 
made at the SIAM Conference on Geometric Mod- 
eling and Robotics held in Albany, New York, July 
15-19, 1985. AO 


Computer Graphics, P. GKS Theory and Prac- 
tice. Ed: P.R. Bono, I. Herman. EurographicSemi- 
nars. Springer-Verlag, 1987, x + 316 pp, $49. [ISBN: 
0-387-18257-8] The Graphics Kernel System (GKS) 
is the first international graphics standard. It is a 
description of a device-independent graphics library. 
This book emphasizes issues that arise in the imple- 
mentation of GKS in a specific environment. It con- 
tains nineteen papers covering concepts and archi- 
tecture, algorithms, specification, certification, and 
programming language interfaces. AM 

Computer Graphics, T(16-17: 1), P, L*. An In- 
troduction to Splines for use in Computer Graphics 
and Geometric Modeling. Richard H. Bartels, John 
C. Beatty, Brian A. Barsky. Morgan Kaufmann, 
1987, xiv + 476 pp, $38.95. [ISBN: 0-934613-27-3] 
An intuitive introduction to parametric spline curves 
and parametric, tensor-product spline surfaces em- 
phasizing those topics most applicable to computer 
graphics. Assumes only a background in elementary 
calculus and linear algebra. AO 


Computer Graphics, P. Intelligent CAD Sys- 
tems I: Theoretical and Methodological Aspects. Ed: 
P.J.W. ten Hagen, T. Tomiyama. EurographicSem- 
inars. Springer-Verlag, 1987, xiv + 360 pp, $59.50. 
(ISBN: 0-387-18281-0] Eighteen papers contributed 
at an April 1987 workshop in The Netherlands. The 
title subject encompasses knowledge-based and ex- 
pert CAD systems. Future workshops will concern 
implementational issues and practical experiences 
and evaluations. DFA 


Computer Graphics, P. Theoretical Foundations 
of Computer Graphics and CAD. Ed: R.A. Earn- 
shaw. NATO ASI Ser. F, V. 40. Springer-Verlag, 
1988, xx + 1241 pp, $125. [ISBN: 0-387-19506-8] 
Proceedings of a 1987 institute on the analysis and 
exposition of the theoretical foundations of computer 
graphics and CAD. Topics include data structures, 
design, image processing, drawing algorithms, the- 
ory and formal methods, modelling and surfaces, im- 
age generation and reconstruction, hardware archi- 
tectures. RM 


Theory of Computation, S(18), P. Lecture Notes 
in Computer Science-294: STACS 88. Ed: R. Cori, 
M. Wirsing. Springer-Verlag, 1988, ix + 404 pp, 
$33.30 (P). [ISBN: 0-387-18834-7] This book contains 
reprints of the 34 papers presented at the Fifth An- 
nual Symposium on the Theoretical Aspects of Com- 
puter Science, Bordeaux, France, February 1988. 
The papers address research topics in four major ar- 
eas: complexity of algorithms, formal language the- 
ory, graph theory, and semantics. GMS 

Artificial Intelligence, S(15-17), L*. Percep- 
trons: An Introduction to Computational Geome- 
try, Ezpanded Edition. Marvin Minsky, Seymour 
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Papert. MIT Pr, 1988, xv + 292 pp, $12.50 (P). 
(ISBN: 0-262-63111-3] Reprint of the 1972 printing 
(complete with its handwritten corrections overwrit- 
ten on original 1969 typesetting), expanded with a 
contemporary prologue providing a twenty year ret- 
rospective: “We found that little of significance had 
changed since 1969, when the book was first pub- 
lished. ...The issues that were then obscure remain 
obscure today because no one yet knows how to tell 
which of the present discoveries are fundamental and 
which are superficial.” LAS 


Artificial Intelligence, T?(17-18: 1), P. Compu- 
tational Complezity and Natural Language. G. Ed- 
ward Barton, Jr., Robert C. Berwick, Eric Sven Ris- 
tad. MIT Pr, 1987, xii + 335 pp, $24.95. [ISBN: 
0-262-02266-4] Study of computational complexity 
theory studied as a classification tool more power- 
ful than weak generativity (classifying by the sets 
of strings capable of generation) for the analysis 
of grammatical formalisms. Analysis of agreement 
and ambiguity, lexical functional grammar, dictio- 
nary and morphological analysis sytems, and gener- 
alized phrase structure grammar (GPSG). RM 


Artificial Intelligence, P. Neural Computers. Ed: 
Rolf Eckmiller, Christoph v.d. Malsburg. NATO 
ASI Ser. F, V. 41. Springer-Verlag, 1988, xiii + 566 
pp, $112. [ISBN: 0-387-18724-3] A collection of 50 
papers and 4 group reports covering topics such as 
neural network architecture, learning and memory, 
fault tolerance, pattern recognition, and motor con- 
trol in brains versus neural computers. Intended as a 
reference book for research in the field of neural com- 
puters and in “the endeavor to transfer concepts of 
brain function and brain architecture to the design 
of self-organizing computers with neural net archi- 
tecture.” Includes a general reference list of intro- 
ductory books on neural computers. PS 


Artificial Intelligence, P. Natural Language Gen- 
eration Systems. Ed: David D. McDonald, Leonard 
Bolc. Symbolic Computat. Springer-Verlag, 1988, xi 
+ 388 pp, $37. [ISBN: 0-387-96691-9] Ten extended 
papers on computer systems for producing natural 
language output, concerning topics such as discourse 
theory, mechanical translation, deliberate (as op- 
posed to spontaneous) writing, and revision. RB 


Artificial Intelligence, P. Real-Time Object 
Measurement and Classification. Ed: Anil K. 
Jain. NATO ASI Series F, V. 42. Springer-Verlag, 
1988, viii + 407 pp, $79.50. [ISBN: 0-387-18766-9] 
Proceedings of a workshop at Raretea, Italy, Septem- 
ber 1987, focusing on robotics-related computer vi- 
sion. 25 papers, including hardware implementation 
of specific vision algorithms, a complete vision sys- 
tem for object tracking and inspection, three-camera 
stereo for future measurement, object recognition by 
neural network, integration of CAD and vision sys- 
tems, pyramid architectures in computer vision. RB 


Computer Science, T(15-18: 1). A Common- 
sense Approach to the Theory of Error Correcting 
Codes. Benjamin Arazi. Ser. in Comput. Syst. MIT 
Pr, 1988, x + 208 pp, $22.50. [ISBN: 0-262-01098- 
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4] Presents this theory without much of the higher- 
level mathematics (finite fields, primitive polynomi- 
als) except in Appendices. Purpose is to provide an 
intuitive feel for the material, using basic linear al- 
gebra, exclusive-OR gates, and linear feedback shift 
registers. Includes most basic concepts in an easy-to- 
read format with concise descriptions. No problem- 
solving. LB-E 

Computer Science, P. Lecture Notes in Computer 
Science-296: Trends in Computer Algebra. Ed: R. 
JanBen. Springer-Verlag, 1988, v + 197 pp, $21.80 
(P). [ISBN: 0-387-18928-9] Proceedings of an inter- 
national symposium sponsored by IBM Germany, 
Bad Neuenahr, May 1987, intended to provide a 
forum for researchers and users of computer alge- 
bra. Eleven papers and two abstracts: languages 
and systems; symbolic computations; computing in 
algebraic structures; applications. RB 

Computer Science, P. The Craft of Software En- 
gineering. Allen Macro, John Buxton. Intern. Com- 
put. Sci. Ser. Addison-Wesley, 1987, xii + 380 pp, 
(P). [ISBN: 0-201-18488-5] A text for practitioners 
in the area of software design and development. It 
addresses such practical “how-to” issues as managing 
software development, estimating cost and timescale, 
team structure, estimating quality, and acceptance 
testing. This text is appropriate for experienced pro- 
fessionals who wish to improve their practical, tech- 
nical, and management skills. GMS 


Computer Science, S(17-18). Lambda-Calculus, 
Combinators, and Functional Programming. G.E. 
Revesz. Tracts in Theoret. Comput. Sci., V. 4. 
Cambridge U Pr, 1988, x + 181 pp, $29.95. [ISBN: 
0-521-34589-8] The lambda calculus is the formal 
mathematical basis for the programming language 
LISP. There has been a renewed interest in this 
branch of mathematics because of its new applica- 
tions to functional programming, denotational se- 
mantics, and artificial intelligence. This text intro- 
duces the reader to this area of mathematics and how 
it can be used in computer science. GMS 


Computer Science, P, L. Annual Review of Com- 
puter Science, Volume 2, 1987. Ed: Joseph F. 
Traub, et al. Annual Reviews, 1987, ix + 565 pp, 
$39. [ISBN: 0-8243-3202-4] Sixteen survey papers re- 
viewing recent developments in artificial intelligence, 
fault-tolerant computing, software engineering, and 
theoretical computer science. AO 


Computer Science, P. Mathematics in Signal Pro- 
cessing. Ed: T.S. Durrani, et al. IMA Conf. Ser., V. 
12. Clarendon Pr, 1987, xvi + 677 pp, $115. (ISBN: 
0-19-853613-5] Proceedings of a conference at Bath, 
United Kingdom, September 1985. Application of 
advanced analytical techniques from mathematical 
sources such as approximation theory, number the- 
ory, linear algebra, optimization theory, time series 
analysis. 46 research papers on signal analysis and 
modelling; spectral analysis, inverse problems; im- 
age reconstruction; numerical algorithms and archi- 
tectures; adaptive techniques. RB 


Computer Science, P. The Characteristics of Par- 
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allel Algorithms. Ed: Leah H. Jamieson, Dennis 
Gannon, Robert J. Douglass. MIT Pr, 1987, x + 440 
pp, $27.50. [ISBN: 0-262-10036-3] Contains reprints 
of papers presented at a workshop on Taxonomy of 
Parallel Algorithms held in Santa Fe, New Mexico, 
January 1987. There are 16 papers divided into three 
sections—classification of parallel algorithms, con- 
currency issues, and impact of parallelism on soft- 
ware development. GMS 


Computer Science, P, L. Research Directions in 
Object-Oriented Programming. Ed: Bruce Shriver, 
Peter Wegner. Ser. in Comput. Syst. MIT Pr, 1987, 
vii + 585 pp, $40. [ISBN: 0-262-19264-0] Sixteen re- 
search papers describing current research in object- 
oriented programming in the areas of langauges, en- 
vironments, databases, and theory. Preliminary ver- 
sions of these papers were presented at a workshop 
held at the IBM T.J. Watson Research Center in 
June 1986. Early drafts of about half of these papers 
were published in the October 1986 issue of SIG- 
PLAN Notices. AO 


Computer Science, P. Lecture Notes in Computer 
Science-292: The Munich Project CIP, Volume II: 
The Program Transformation System CIP-S. CIP 
System Group. Springer-Verlag, 1987, viii + 522 pp, 
$40 (P). [ISBN: 0-387-18779-0] Formal descriptions 
of an implementation of a transformational program- 
ming system, a wide-ranging system in which the 
programmer can work at a variety of levels with au- 
tomatic assistance ih specification, program synthe- 
sis, code generation, verification, modification, and 
documentation. RWN 


Computer Science, P. Radon and Projection 
Transform-Based Computer Vision: Algorithms, A 
Pipeline Architecture, and Industrial Applications. 
J.L.C. Sanz, E.B. Hinkle, A.K. Jain. Ser. in Inform. 
Sci., V. 16. Springer-Verlag, 1988, viii + 123 pp, 
$39. [ISBN: 0-387-18396-5] Describes the hardware 
and algorithms used in a real-time, projection-based 
machine vision system. Also describes several appli- 
cations to industrial visual inspection problems. AO 


Computer Science, S$(18), P. Abstract Interpreta- 
tion of Declarative Languages. Ed: Samson Abram- 
sky, Chris Hankin. Books in Comput. Sci. Halsted 
Pr, 1987, 284 pp, $64.95. [ISBN: 0-470-20971-2] De- 
voted to a new research area in computer science 
called abstract interpretation, a technique which in- 
volves the application of non-standard semantics to 
the compilation of programs. The algorithms de- 
veloped are applied to declarative languages such as 
PROLOG. The book requires a great deal of sophis- 
ticated knowledge in such advanced areas as compiler 
design and denotational semantics. GMS 


Computer Science, T(13). An Introduction to 
Computer Science with Modula-2. J. Mack Adams, 
Philippe J. Gabrini, Barry L. Kurtz. DC Heath, 
1988, xix + 571 pp, (P). [ISBN: 0-669-12171-1] A 
well-organized and well-written introduction to com- 
puter science. Presents an excellent problem solving 
methodology and clearly illustrates it in each exam- 
ple throughout the text. Also places emphasis on 
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the role of documentation in programming—a topic 
needing emphasis during introductory courses. Cov- 
ers the usual other material: recursion (using sev- 
eral examples with debugging traces), iteration, ele- 
mentary data structures, control structures. Modu- 
lar programming’s importance is both stressed and 
justified—the choice of Modula-2 reinforces this. 
Other topics include simple sorts, some algorithm 
analysis, abstract data types, numerical methods, 
and concurrency. PS 


Computer Science, P. Lecture Notes in Com- 
puter Science-291: Graph-Grammars and Their Ap- 
plication to Computer Science. Ed: H. Ehrig, et 
al. Springer-Verlag, 1987, viii + 609 pp, $52.10 
(P). {ISBN: 0-387-18771-5] Proceedings of a 1986 
workshop on graph grammars (grammar methods 
for specifying and generating graphs and rewriting 
mechanisms). Contains both tutorial papers and 
research contributions, with applications to paral- 
lelism, concurrency, modelling, complexity, etc. RM 


Applications, T(15-17), P, L. Fuzzy Mathemat- 
tcal Techniques with Applications. Abraham Kan- 
del. Addison-Wesley, 1986, xiv + 274 pp. [ISBN: 0- 
201-11752-5] A systematic introduction to the math- 
ematical theory and applications of imprecise rela- 
tionships. Fuzzy sets transcend classical probability 
in scope: imprecision (fuzziness) includes not only 
sets but also probabilities, statistics, relations, and 
logics. Applications conclude with a fuzzy relational 
data base as a model of a human expert. Includes 
1000-item bibliography. LAS 

Applications. Mathematical Modelling: Classroom 
Notes in Applied Mathematics. Ed: Murray S. 
Klamkin. SIAM, 1987, xiv + 338 pp, (P). [ISBN: 0- 
89871-204-1] A collection of the articles which have 
appeared in the “Classroom Notes in Applied Mathe- 
matics” section of SIAM Reviews since 1975 together 
with an extensive list of supplemental references or- 
ganized by subject area. AO 


Applications (Economics), P. Econometrics of 
Planning and Efficiency. Ed: Jati K. Sengupta, 
Gopal K. Kadekodi. Adv. Stud. in Theoret. & 
Appl. Econom., V. 11. Kluwer Academic, 1988, ix 
+ 193 pp, $66.50. [ISBN: 90-247-3602-1] A collection 
of ten papers concerning stochastic planning, a com- 
bination of econometrics with aspects of probability 
and operations research. Three parts: methodology, 
planning models, and analysis of efficiency. SM 


Applications (Electrical Engineering), T(16- 
17: 1). An Introduction to Signal Detection and Es- 
timation. H. Vincent Poor. Texts in Elect. Engin. 
Springer-Verlag, 1988, x + 549 pp, $58. (ISBN: 0- 
387-96667-6] Provides good introduction to (or re- 
view of) hypothesis testing while pre-supposing a 
working knowledge of applied probability and ran- 
dom processes. Excellent exercises at the end of each 
chapter require both theoretical and application un- 
derstanding. Separates subject into problems of de- 
tection and estimation. Fine explanations. LB-E 

Applications (Engineering), T(15-16: 2, 3). 
Advanced Engineering Mathematics. Stanley I. 
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Grossman, William R. Derrick. Harper & Row, 1988, 
xviii + 1176 pp. [ISBN: 0-06-042534-2] In six mostly 
independent parts for a year-long course covering or- 
dinary and partial differential equations, vector cal- 
culus, linear algebra, numerical methods, and com- 
plex variables. In addition to the usual applications 
in electrical circuits and vibratory motion, topics 
in economics, engineering, and physics are included. 
Not wordy, but very clearly written. Stress on the 
need for solving problems. Instructor’s manual avail- 
able. JK 


Applications (Engineering), T(15: 1, 2). Ad- 
vanced Engineering Mathematics. Michael D. Green- 
berg. Prentice-Hall, 1988, xii + 946 pp. [ISBN: 0- 
13-010505-8] Traditional material on applied mathe- 
matics for students in engineering or science. Twenty 
chapters in five (mostly independent) parts. For one 
or more semesters. Prerequisites are calculus and at 
least part of a course in ordinary differential equa- 
tions. Written to be readable by students. Useful 
as a reference and for self study within and with- 
out a course. Except for the finite-difference solu- 
tion of partial differential equations, topics in nu- 
merical analysis are omitted. Drill-type exercises. 
Some problems on applications, others extend the 
text. Answers to selected exercises. Solutions man- 
ual available. JK 


Applications (Fluid Mechanics), P, L. Annual 
Reviews of Fluid Mechanics, Volume 20, 1988. Ed: 
John L. Lumley, Milton van Dyke, Helen L. Reed. 
Annual Reviews, 1988, 551 pp, $34. [ISBN: 0-8243- 
0720-8] Seventeen survey papers on various topics 
(e.g., fractals, surf-zones, sand transport, digital im- 
age processing) together with cumulative author and 
title indices to the entire twenty volume series. LAS 


Applications (Information Theory), P. Seif- 
Organization and Assoctative Memory, Second Edt- 
tion. Teuvo Kohonen. Ser. in Inform. Sci., V. 
8. Springer-Verlag, 1988, xv + 312 pp, $39.50 (P). 
[ISBN: 0-387-18314-0] An exposition of mathemati- 
cal aspects of patterning processing. This edition in- 
cludes new chapters on neural computing and optical 
associative memories. (First Edition, TR, November 
1984.) SG 


Applications (Medicine), P. Mathematics and 
Computer Science in Medical Imaging. Ed: Max A. 
Viergever, Andrew Todd-Pokropek. NATO ASI Ser. 
F, V. 39. Springer-Verlag, 1988, viii + 546 pp, $98. 
[ISBN: 0-387-18672-7] A summary of the proceed- 
ings of a NATO Advanced Study Institute held in II 
Ciocco, Italy, September 21-October 4, 1986. Covers 
fundamental ideas in medical imaging and the asso- 
ciated mathematical problems. Aimed at physicists, 
engineers, computer scientists, mathematicians, and 
informed clinicians working in the area of medical 
imaging. Includes eleven introductory and tutorial 
papers and nineteen more specialized papers. RH 


Applications (Physical Science), P. Multipro- 
cessing in Meteorological Models. Ed: G.-R. Hoff- 
mann, D.F. Snelling. Topics in Atmospheric & 
Oceanic Sci. Springer-Verlag, 1988, xvi + 438 pp, 
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$70. [ISBN: 0-387-18457-0] Predicting the weather 
is harder than von Neumann imagined, but parallel 
processing should extend the range and accuracy of 
weather models. Contains 28 papers from two work- 
shops, including an excellent article on “Computer 
Requirements for Atmospheric Modelling.” BC 


Applications (Physics), S(13), L. Seven Ideas 
that Shook the Universe. Nathan Spielberg, Bryon D. 
Anderson. Wiley, 1987, viii + 263 pp, $22.95. [ISBN: 
0-471-85974-5] A descriptive, non-mathematical ap- 
proach to seven themes in physics: Copernican as- 
tronomy, Newtonian mechanics, the energy concept, 
entropy, relativity, quantum theory, and conserva- 
tion principles and symmetries. Describes the his- 
torical and philosophical context in which the ideas 
arose, and discusses the relationship between these 
ideas and our understanding of the physical world. 
The quantitative material is presented mostly in the 
form of graphs and diagrams, and very few formulas 
are included. RH 

Applications (Physics), S(14-16), L. The Ez- 
panding Universe. Arthur Eddington. Cambridge 
U Pr, 1987, xxiii + 128 pp, $12.95 (P). [ISBN: 
0-521-34976-1] Re-issue in the Cambridge Science 
Classics series of a set of public lectures delivered 
by Eddington in 1932 in Cambridge, Massachusetts, 
in which he attempts to convey in personal, non- 
mathematical terms a cosmological model linking 
relativity with quantum theory in a manner con- 
sistent with Hubble’s observations. Lively, enlight- 
ening, closely reasoned—a vivid mix of science and 
art. LAS 

Applications (Physics), S*(16-17), L**. £El- 
ementary Particles and the Laws of Physics: The 
1986 Dirac Memorial Lectures. Richard P. Feynman, 
Steven Weinberg. Cambridge U Pr, 1987, x + 110 
pp, $12.95. [ISBN: 0-521-340004] The first two lec- 
tures in a series established at Cambridge University 
to commemorate P.A.M. Dirac. Feynman speaks on 
the reason for antiparticles, Weinberg on the final 
laws of physics. Super exposition by master inter- 
preters of modern physics. LAS 


Applications (Physics), P. Probabilistic Meth- 
ods in Mathematical Physics. Ed: Kiyosi Ité, 
Nobuyuki Ikeda. Academic Pr, 1987, vii + 441 pp, 
$59.50. [ISBN: 0-12-375660-X] Proceedings of the 
Taniguchi International Symposium in Katata and 
Kyoto, 1985. MU 

Applications (Physics), T(13: 1), L. Beyond the 
Mechanical Universe: From Electricity to Modern 
Physics. Richard P. Olenick, Tom M. Apostol, David 
L. Goodstein. Cambridge U Pr, 1986, xiv + 574 pp, 
$27.95. (ISBN: 0-521-30430-X] The second volume 
of a college physics text which accompanies the ac- 
claimed PBS series The Mechanical Universe ...and 
Beyond, a Caltech-based project to “infuse new life 
in freshman physics.” This volume corresponds to 
the last half of the television series: electricity; mag- 
netism; relativity; particles; quantum mechanics. An 
offering institution would presumably provide a “live, 
flesh-and-blood college physics teacher” and a labo- 
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ratory component. Text may be used alone with- 
out the television series; teacher supplements avail- 
able. RB 


Applications (Physics), P. Functional Integrals 
and Collective Ezcttations. V.N. Popov. Mono. on 
Math. Physics. Cambridge U Pr, 1987, viii + 216 
pp, $49.50. [ISBN: 0-521-30777-5] Begins with an in- 
troduction to functional integral techniques in equi- 
librium statistical physics and the expression of par- 
tition functions and Green functions in terms of func- 
tional integrals. Later sections cover superfluid Bose 
systems, plasma and superfluid Fermi systems, crys- 
tals, heavy atoms, and model Hamiltonians. MU 


Applications (Physics), S, P, L*. Nonlinear 
Functional Analysis and tts Applications, IV: Appli- 
cations to Mathematical Physics. Eberhard Zeidler. 
Transl: Juergen Quandt. Springer-Verlag, 1988, xxiii 
+ 975 pp, $129. [ISBN: 0-387-96499-1] The fourth of 
a five-volume set that includes historic background 
of modern ideas and attempts to bridge the gaps be- 
tween mathematicians and physicists. Definite tilt 
of author to the notion that mathematicians should 
know a good deal about the assumptions underly- 
ing mathematical models results in a book in which 
physics and mathematics do appear unified. Nu- 
merous quotations that introduce each chapter are 
themselves fascinating reading; an impressive, eru- 


dite, readable book. AWR 


Applications (Simulation), T(17-18), S, P. 
Neural and Brain Modeling. Ronald J. MacGregor. 
Academic Pr, 1987, xii + 643 pp, $89. [ISBN: 0- 
12-464260-8] Part I: an extensive review (presum- 
ably the first) of the literature in neural and brain 
modeling, surveying engineering, mathematical, and 
computér-based modeling methodology. Part II: a 
computer model (54 FORTRAN programs) which 
simulates the dynamics of neurons and neuronal net- 
works, accompanied by over 200 pages of instruc- 
tions, examples and exercises, for access by students, 
scholars, and researchers. RB 


Reviewers 


RJA: Richard J. Allen, St. Olaf; DFA: David F. Appleyard, 
Carleton; RB: Richard Brown, Carleton; LB-E: Lisa Brown- 
Ewing, Macalester; JNC: Judith N. Cederberg, St. Olaf; LC: 
Laura Chihara, St. Olaf; BC: Barry Cipra, St. Olaf; CEC: 
Clifton E. Corzatt, St. Olaf; JD-B: John Dyer-Bennet, Car- 
leton; RSF: Robert S. Fisk, St. Olaf; SG: Steven Galovich, 
Carleton; GG: George Gilbert, St. Olaf; BH: Bruce Hanson, 
St. Olaf; RH: Rhonda Hatcher, St. Olaf; RK: Roger Kirch- 
ner, Carleton; RSK: Richard S. Kleber, St. Olaf; JK: Joseph 
Konhauser, Macalester; LCL: Loren C. Larson, St. Olaf; AM: 
Alan Magnuson, St. Olaf; SM: Steve McKelvey, St. Olaf; RM: 
Richard Molnar, Macalester; RWN: Richard W. Nau, Car- 
leton; AO: Arnold Ostebee, St. Olaf; MR: Matthew Richey, 
St. Olaf; AWR: A. Wayne Roberts, Macalester; GMS: G. 
Michael Schneider, Macalester; PS: Paul Schoening, St. Olaf; 
JS: John Schue, Macalester; SS: Seymour Schuster, Carleton; 
KS: Kay Smith, St. Olaf; LAS: Lynn Arthur Steen, St. Olaf; 
MS: Myriam Steinback, Macalester; MU: Milton Ulmer, Car- 
leton; TAV: Theodore A. Vessey, St. Olaf; MW: Martha Wal- 
lace, St. Olaf; LW: Lynne Walling, St. Olaf; FLW: Frank L. 
Wolf, Carleton; MZ: Michael Zielinski, St. Olaf. 
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New... 


Writing Mathematics Well, by Leonard Gillman 


64 pp., 1987, ISBN-0-88385-443-0 


Catalog Number - WMW __ List: $6.00 MAA Member: $4.50 


Good writing conveys more than the 
author originally had in mind, while poor 
writing conveys less. Well-written papers 
are more quickly accepted and put into 
print and more widely read and appreci- 
ated than poorly written ones—and for 
notes, monographs, and books the qual- 
ity of writing is more important than it is 
for papers. 

In Writing Mathematics Well, Leo- 
nard Gillman tells his readers how to 
develop a clear and effective style. All 
aspects of mathematical writing are cov- 
ered, from general organization and 
choice of title, to the presentation of 
results, to fine points on using words and 
symbols, to revision, and finally, to the 
mechanics of putting your manuscript into 
print. No book can by itself make you a 
better writer, but this one will alert you to 
the opportunities for better and more 


forceful writing. It does this both by pre- 
cept and by example. 

A book to be read for its sharpness 
and wit as well as for enlightenment, Writ- 
ing Mathematics Well should be on the 
shelf of anyone who writes or intends to 
write mathematics. It will amuse and 
delight the already careful writer and it will 
help reform and refine the sensibilities of 
those who may be somewhat careless 
about their writing. 
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The Mathematical Association 
of America 

1529 Eighteenth St. NW 
Washington, D.C. 20036 
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A Series of Successes from 


D. C. Heath 


THE LARSON/HOSTETLER 
PRECALCULUS SERIES 


Roland E. Larson and 
Robert P. Hostetler, 

both of The Pennsylvania State University, 
The Behrend College 
Instructor’s Guide 

Study and Solutions Guide 
Complete Solutions Guide 
Transparencies 

HeathTest for Precalculus 
(IBM PC and Apple Il Series) 
Computer Activities for the 
Precalculus Series 


ALGEBRA AND 
TRIGONOMETRY 
Second Edition 

1989 Cloth 729 pages 


COLLEGE ALGEBRA 
Second Edition 
1989 Cloth 545 pages 


TRIGONOMETRY 
Second Edition 
1989 Cloth 332 pages est. 


PRECALCULUS 
Second Edition 
1989 Cloth 638 pages est. 


Want to Know More? 
For details or sample copies 
call us toll free at: 


1 (800) 235-3565. 


D. C. Heath 


and Company 


125 Spring Street 
Lexington, MA 02173 


THE HARSHBARGER/ 
REYNOLDS SERIES 


Ronald J. Harshbarger and 
James J. Reynolds, 
both of The Pennsylvania State University 


Instructor’s Guide 

Selected Solutions Guide 
Computerized Testing Program 
(IBM PC} 


MATHEMATICAL APPLICA- 
TIONS FOR MANAGEMENT, 
LIFE, AND SOCIAL SCIENCES 
Third Edition 

1989 Cloth 800 pages est. 


FINITE MATHEMATICS FOR 
MANAGEMENT, LIFE, AND 
SOCIAL SCIENCES 

Third Edition 

1989 Cloth 512 pages est. 


APPLIED CALCULUS FOR 
MANAGEMENT, LIFE, AND 
SOCIAL SCIENCES 

Third Edition 

1989 Cloth 448 pages est. 


New to 

THE NOVAK SERIES FOR 
DEVELOPMENTAL 
MATHEMATICS 


DEVELOPMENTAL 
MATHEMATICS 
Second Edition 


David Novak, Simmons College 
1989 Paper 992 pages est. 


Instructor’s Guide with Tests 
Student Guide to Margin Exercises 
Answer Key 

Computerized Testing Program 
(IBM PC) 

The Graphic Gradebook 
Videotapes 


Announcing the successor to muMATH* 


Derive™ 
A Mathematical Assistant 


for PC-compatible computers 
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COMMAND: Author Build Calculus Declare Expand Factor Help Jump soLve Manage 
Options ETI Quit Remove Simplify Transfer moVe Window approX 

Enter option 

User B: DERIVEAD .MTH Free: 797% Derive Algebra 


¢ Computer algebra, including calculus, vectors 
and matrices 


e 2-dimensional display of formulas 

e 2- and 3-dimensional function plotting 

e Exact and approximate arithmetic to thousands 
of digits 

¢ Easy menu-driven interface 

¢ Requires only 512 kilobytes of RAM memory and 
one floppy drive 

¢ Ideal for educators, students and professionals 

¢ $200 plus shipping: Call or write for information: 


Lv] Soft Warchouse: 


3615 Harding Avenue, Suite 505 « Honolulu, Hi 96816 
(808) 734-5801 after noon PST 


©1988 Soft Warehouse, Inc. Handcrafted software for the mind. 
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